P2322R6 accumulator types for reduce #509

gevtushenko · 2022-06-13T12:40:45Z

This PR addresses the following issue. I also found that we use copy assignment operator on uninitialized memory, which might lead to issues for not primitive types.

⚠️ Breaking changes:

AgentReduce:
1. now has extra template parameter representing the accumulator type.
2. ConsumeTile uses accumulator type for thread aggregate instead of output iterator value type.
3. ConsumeTile returns accumulator type instead of output iterator value type.
DeviceReduceKernel doesn't accept output iterator as a template parameter. Apart from that, it now accepts accumulator type.
DeviceReduceSingleTileKernel now accepts accumulator type.
DeviceSegmentedReduceKernel now accepts accumulator type.
DeviceReducePolicy now accepts accumulator type instead of input iterator value type. It also doesn't accept output iterator value type now.
DispatchReduce:
1. now accepts accumulator type as last parameter.
2. now accepts initializer type instead of output iterator value type.
3. doesn't work with device extended lambdas now.
4. Constructor now accepts init as initial type instead of output iterator value type.
DispatchSegmentedReduce:
1. accepts accumulator type as last parameter.
2. accepts initializer type instead of output iterator value type.
Thread operators now accept parameters using different types: Equality, Inequality, InequalityWrapper, Sum, Difference, Division, Max, ArgMax, Min, ArgMin.
ThreadReduce now accepts accumulator type and use different type for prefix
Default accumulator type is now selected by introspecting return type of the reduce operator.

gevtushenko · 2022-06-13T12:47:24Z

This PR should be merged after libcu++ supports device lambdas in invoke result: NVIDIA/libcudacxx#284

alliepiper · 2022-06-24T21:13:50Z

cub/device/dispatch/dispatch_reduce.cuh

+    cub::detail::non_void_value_t<
+      OutputIteratorT, 
+      cub::detail::value_t<InputIteratorT>>,
+  typename AccumT = 


Can you add the release: breaking change label and put a description of these changes to the Dispatch interface in the PR description? That way I'll make sure to call these changes out in the release notes.

Same goes for all of the accumulator-type changes in behavior -- I check that label when I'm building relnotes from the list of PRs.

alliepiper

This will need a couple of things before merging:

Rebase for debug_synchronous/DebugSyncStream/CDP changes
Add summary of breaking changes to PR description.

miscco

I am not sure if I am yet qualified to really approve it but I can complain

cub/agent/agent_reduce.cuh

cub/device/dispatch/dispatch_reduce.cuh

cub/thread/thread_operators.cuh

gevtushenko · 2022-08-01T08:55:25Z

⚠️ Breaking changes:

AgentReduce:
1. now has extra template parameter representing the accumulator type.
2. ConsumeTile uses accumulator type for thread aggregate instead of output iterator value type.
3. ConsumeTile returns accumulator type instead of output iterator value type.
DeviceReduceKernel doesn't accept output iterator as a template parameter. Apart from that, it now accepts accumulator type.
DeviceReduceSingleTileKernel now accepts accumulator type.
DeviceSegmentedReduceKernel now accepts accumulator type.
DeviceReducePolicy now accepts accumulator type instead of input iterator value type. It also doesn't accept output iterator value type now.
DispatchReduce:
1. now accepts accumulator type as last parameter.
2. now accepts initializer type instead of output iterator value type.
3. doesn't work with device extended lambdas now.
4. Constructor now accepts init as initial type instead of output iterator value type.
DispatchSegmentedReduce:
1. accepts accumulator type as last parameter.
2. accepts initializer type instead of output iterator value type.
Thread operators now accept parameters using different types: Equality, Inequality, InequalityWrapper, Sum, Difference, Division, Max, ArgMax, Min, ArgMin.
ThreadReduce now accepts accumulator type and use different type for prefix
Default accumulator type is now selected by introspecting return type of the reduce operator.

alliepiper · 2022-08-03T14:39:50Z

Breaking changes

Thanks for writing this up! Can you edit the PR description/first comment and add this to it? I'm less likely to overlook it that way :)

gevtushenko · 2022-08-03T15:39:44Z

Breaking changes

Thanks for writing this up! Can you edit the PR description/first comment and add this to it? I'm less likely to overlook it that way :)

Sorry, didn't think about this aspect 😄 I'll definitely update the description.

gevtushenko requested a review from alliepiper June 13, 2022 12:40

gevtushenko added testing: gpuCI in progress Started gpuCI testing. type: bug: functional Does not work as intended. labels Jun 13, 2022

gevtushenko added this to the 2.0.0 milestone Jun 13, 2022

gevtushenko added testing: gpuCI passed Passed gpuCI testing. and removed testing: gpuCI in progress Started gpuCI testing. labels Jun 13, 2022

gevtushenko mentioned this pull request Jun 16, 2022

P2322R6 accumulator types for scan and reduce by key #511

Merged

alliepiper added the P1: should have Necessary, but not critical. label Jun 22, 2022

alliepiper reviewed Jun 24, 2022

View reviewed changes

gevtushenko added the release: breaking change Include in "Breaking Changes" section of release notes. label Jun 24, 2022

alliepiper self-assigned this Jul 25, 2022

alliepiper approved these changes Jul 25, 2022

View reviewed changes

alliepiper removed their assignment Jul 25, 2022

gevtushenko force-pushed the fix-main/github/reduce_intermediate_type branch from a064b12 to 921885f Compare July 28, 2022 09:10

miscco reviewed Jul 28, 2022

View reviewed changes

P2322R6 accumulator types for reduce

1e98f6f

gevtushenko force-pushed the fix-main/github/reduce_intermediate_type branch from a87316b to 1e98f6f Compare August 1, 2022 17:00

gevtushenko merged commit 81a96c9 into NVIDIA:main Aug 1, 2022

gevtushenko mentioned this pull request Aug 3, 2022

Use P2322R6 to determine intermediate types for relevant algorithms #428

Closed

gevtushenko mentioned this pull request Mar 21, 2023

thrust::exclusive_scan deduces value type from identity value NVIDIA/cccl#836

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

P2322R6 accumulator types for reduce #509

P2322R6 accumulator types for reduce #509

gevtushenko commented Jun 13, 2022 •

edited

Loading

gevtushenko commented Jun 13, 2022

alliepiper Jun 24, 2022

alliepiper left a comment

miscco left a comment

gevtushenko commented Aug 1, 2022

alliepiper commented Aug 3, 2022

gevtushenko commented Aug 3, 2022

P2322R6 accumulator types for reduce #509

P2322R6 accumulator types for reduce #509

Conversation

gevtushenko commented Jun 13, 2022 • edited Loading

gevtushenko commented Jun 13, 2022

alliepiper Jun 24, 2022

Choose a reason for hiding this comment

alliepiper left a comment

Choose a reason for hiding this comment

miscco left a comment

Choose a reason for hiding this comment

gevtushenko commented Aug 1, 2022

alliepiper commented Aug 3, 2022

gevtushenko commented Aug 3, 2022

gevtushenko commented Jun 13, 2022 •

edited

Loading