KEMBAR78
Drop deprecated CUB macros by bernhardmgruber · Pull Request #3821 · NVIDIA/cccl · GitHub
Skip to content

Conversation

@bernhardmgruber
Copy link
Contributor

@bernhardmgruber bernhardmgruber commented Feb 15, 2025

Split out of #3748, since it causes SASS changes in at least cub.bench.copy.memcpy.base. Several split-off PRs proposed sub-parts, accompanied by SASS diffs and benchmarks. The remainder of this PR is now only the removal of the unused macros (and the addition of a few missing includes).

@bernhardmgruber bernhardmgruber requested review from a team as code owners February 15, 2025 10:05
@bernhardmgruber bernhardmgruber force-pushed the drop_min_max branch 2 times, most recently from 5c959be to fa676c1 Compare February 15, 2025 10:59
@bernhardmgruber bernhardmgruber added cub For all items related to CUB breaking Breaking change labels Feb 15, 2025
Comment on lines 46 to 47
#include <limits>

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Should we move towards ::cuda::std::numeric_limits?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I thought so too, but in the example it wasn't necessary, and maybe <limits> is more familiar. I really don't mind.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Fine for me

@copy-pr-bot
Copy link
Contributor

copy-pr-bot bot commented Feb 24, 2025

Auto-sync is disabled for draft pull requests in this repository. Workflows must be run manually.

Contributors can view more details about this message here.

@github-actions
Copy link
Contributor

🟨 CI finished in 1h 51m: Pass: 94%/93 | Total: 2d 00h | Avg: 30m 59s | Max: 1h 14m | Hits: 84%/127854
  • 🟨 cub: Pass: 88%/45 | Total: 1d 11h | Avg: 47m 13s | Max: 1h 14m | Hits: 73%/47410

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  88%/43  | Total:  1d 09h | Avg: 47m 20s | Max:  1h 14m | Hits:  73%/44980 
      🟩 arm64              Pass: 100%/2   | Total:  1h 29m | Avg: 44m 49s | Max: 45m 32s | Hits:  80%/2430  
    🔍 ctk: 12.8 🔍
      🟩 12.0               Pass: 100%/5   | Total:  3h 49m | Avg: 45m 49s | Max: 58m 33s | Hits:  67%/5908  
      🟩 12.5               Pass: 100%/2   | Total:  1h 55m | Avg: 57m 54s | Max: 59m 52s | Hits:  66%/2248  
      🔍 12.8               Pass:  86%/38  | Total:  1d 05h | Avg: 46m 51s | Max:  1h 14m | Hits:  75%/39254 
    🔍 cudacxx: nvcc12.8 🔍
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  1h 46m | Avg: 53m 24s | Max: 53m 57s | Hits:  90%/2100  
      🟩 nvcc12.0           Pass: 100%/5   | Total:  3h 49m | Avg: 45m 49s | Max: 58m 33s | Hits:  67%/5908  
      🟩 nvcc12.5           Pass: 100%/2   | Total:  1h 55m | Avg: 57m 54s | Max: 59m 52s | Hits:  66%/2248  
      🔍 nvcc12.8           Pass:  86%/36  | Total:  1d 03h | Avg: 46m 29s | Max:  1h 14m | Hits:  74%/37154 
    🔍 cudacxx_family: nvcc 🔍
      🟩 ClangCUDA          Pass: 100%/2   | Total:  1h 46m | Avg: 53m 24s | Max: 53m 57s | Hits:  90%/2100  
      🔍 nvcc               Pass:  88%/43  | Total:  1d 09h | Avg: 46m 56s | Max:  1h 14m | Hits:  73%/45310 
    🔍 sm: 90 🔍
      🔍 90                 Pass:  66%/3   | Total:  1h 31m | Avg: 30m 27s | Max: 52m 46s | Hits:  94%/2430  
      🟩 90;90a;100         Pass: 100%/1   | Total: 51m 56s | Avg: 51m 56s | Max: 51m 56s | Hits:  75%/1215  
    🔍 std: 20 🔍
      🟩 17                 Pass: 100%/20  | Total: 16h 22m | Avg: 49m 06s | Max:  1h 13m | Hits:  69%/23535 
      🔍 20                 Pass:  80%/25  | Total: 19h 03m | Avg: 45m 43s | Max:  1h 14m | Hits:  78%/23875 
    🟨 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  2h 48m | Avg: 42m 10s | Max: 42m 44s | Hits:  80%/4868  
      🟩 Clang15            Pass: 100%/2   | Total:  1h 29m | Avg: 44m 38s | Max: 45m 23s | Hits:  78%/2430  
      🟩 Clang16            Pass: 100%/2   | Total:  1h 25m | Avg: 42m 52s | Max: 44m 15s | Hits:  78%/2430  
      🟩 Clang17            Pass: 100%/2   | Total:  1h 34m | Avg: 47m 16s | Max: 47m 19s | Hits:  77%/2430  
      🟨 Clang18            Pass:  85%/7   | Total:  5h 17m | Avg: 45m 22s | Max: 55m 47s | Hits:  86%/6960  
      🟩 GCC7               Pass: 100%/2   | Total:  1h 27m | Avg: 43m 40s | Max: 43m 47s | Hits:  79%/2434  
      🟩 GCC8               Pass: 100%/1   | Total: 59m 52s | Avg: 59m 52s | Max: 59m 52s | Hits:  67%/1217  
      🟩 GCC9               Pass: 100%/2   | Total:  1h 25m | Avg: 42m 55s | Max: 43m 00s | Hits:  78%/2434  
      🟩 GCC10              Pass: 100%/2   | Total:  1h 34m | Avg: 47m 26s | Max: 49m 26s | Hits:  75%/2434  
      🟩 GCC11              Pass: 100%/2   | Total:  1h 31m | Avg: 45m 52s | Max: 46m 39s | Hits:  77%/2430  
      🟩 GCC12              Pass: 100%/2   | Total:  1h 41m | Avg: 50m 52s | Max: 56m 08s | Hits:  73%/2430  
      🟨 GCC13              Pass:  63%/11  | Total:  7h 39m | Avg: 41m 44s | Max: 57m 02s | Hits:  85%/8505  
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 04m | Avg:  1h 02m | Max:  1h 06m | Hits:  12%/2080  
      🟩 MSVC14.42          Pass: 100%/2   | Total:  2h 28m | Avg:  1h 14m | Max:  1h 14m | Hits:  12%/2080  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  1h 55m | Avg: 57m 54s | Max: 59m 52s | Hits:  66%/2248  
    🟨 cxx_family
      🟨 Clang              Pass:  94%/17  | Total: 12h 35m | Avg: 44m 27s | Max: 55m 47s | Hits:  81%/19118 
      🟨 GCC                Pass:  81%/22  | Total: 16h 20m | Avg: 44m 34s | Max: 59m 52s | Hits:  79%/21884 
      🟩 MSVC               Pass: 100%/4   | Total:  4h 33m | Avg:  1h 08m | Max:  1h 14m | Hits:  12%/4160  
      🟩 NVHPC              Pass: 100%/2   | Total:  1h 55m | Avg: 57m 54s | Max: 59m 52s | Hits:  66%/2248  
    🟨 gpu
      🟨 h100               Pass:  66%/3   | Total:  1h 31m | Avg: 30m 27s | Max: 52m 46s | Hits:  94%/2430  
      🟩 rtx2080            Pass: 100%/34  | Total:  1d 04h | Avg: 49m 29s | Max:  1h 14m | Hits:  70%/40120 
      🟨 rtxa6000           Pass:  50%/8   | Total:  5h 51m | Avg: 43m 54s | Max: 57m 02s | Hits:  88%/4860  
    🟨 jobs
      🟩 Build              Pass: 100%/37  | Total:  1d 05h | Avg: 48m 22s | Max:  1h 14m | Hits:  71%/43765 
      🟥 DeviceLaunch       Pass:   0%/1   | Total: 57m 02s | Avg: 57m 02s | Max: 57m 02s
      🟥 GraphCapture       Pass:   0%/1   | Total: 48m 39s | Avg: 48m 39s | Max: 48m 39s
      🟥 HostLaunch         Pass:   0%/3   | Total:  2h 44m | Avg: 54m 50s | Max: 55m 58s
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 05m | Avg: 21m 48s | Max: 23m 02s | Hits:  99%/3645  
    
  • 🟩 thrust: Pass: 100%/45 | Total: 11h 39m | Avg: 15m 33s | Max: 52m 05s | Hits: 89%/80136

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 18m 02s | Avg:  9m 01s | Max: 10m 56s | Hits:  98%/3564  
    🟩 cpu
      🟩 amd64              Pass: 100%/43  | Total: 11h 24m | Avg: 15m 54s | Max: 52m 05s | Hits:  89%/76573 
      🟩 arm64              Pass: 100%/2   | Total: 15m 48s | Avg:  7m 54s | Max:  9m 54s | Hits:  95%/3563  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  1h 25m | Avg: 17m 01s | Max: 43m 20s | Hits:  85%/8901  
      🟩 12.5               Pass: 100%/2   | Total:  1h 13m | Avg: 36m 57s | Max: 38m 51s | Hits:  86%/3562  
      🟩 12.8               Pass: 100%/38  | Total:  9h 00m | Avg: 14m 13s | Max: 52m 05s | Hits:  90%/67673 
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 15m 56s | Avg:  7m 58s | Max: 10m 35s | Hits:  97%/3562  
      🟩 nvcc12.0           Pass: 100%/5   | Total:  1h 25m | Avg: 17m 01s | Max: 43m 20s | Hits:  85%/8901  
      🟩 nvcc12.5           Pass: 100%/2   | Total:  1h 13m | Avg: 36m 57s | Max: 38m 51s | Hits:  86%/3562  
      🟩 nvcc12.8           Pass: 100%/36  | Total:  8h 44m | Avg: 14m 34s | Max: 52m 05s | Hits:  90%/64111 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 15m 56s | Avg:  7m 58s | Max: 10m 35s | Hits:  97%/3562  
      🟩 nvcc               Pass: 100%/43  | Total: 11h 23m | Avg: 15m 54s | Max: 52m 05s | Hits:  89%/76574 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total: 40m 33s | Avg: 10m 08s | Max: 11m 06s | Hits:  93%/7124  
      🟩 Clang15            Pass: 100%/2   | Total: 22m 58s | Avg: 11m 29s | Max: 12m 43s | Hits:  92%/3562  
      🟩 Clang16            Pass: 100%/2   | Total: 25m 02s | Avg: 12m 31s | Max: 14m 32s | Hits:  92%/3562  
      🟩 Clang17            Pass: 100%/2   | Total: 21m 09s | Avg: 10m 34s | Max: 11m 14s | Hits:  92%/3562  
      🟩 Clang18            Pass: 100%/7   | Total:  1h 00m | Avg:  8m 39s | Max: 11m 21s | Hits:  97%/12467 
      🟩 GCC7               Pass: 100%/2   | Total: 21m 12s | Avg: 10m 36s | Max: 11m 28s | Hits:  92%/3564  
      🟩 GCC8               Pass: 100%/1   | Total: 10m 21s | Avg: 10m 21s | Max: 10m 21s | Hits:  92%/1782  
      🟩 GCC9               Pass: 100%/2   | Total: 25m 42s | Avg: 12m 51s | Max: 13m 11s | Hits:  91%/3564  
      🟩 GCC10              Pass: 100%/2   | Total: 24m 40s | Avg: 12m 20s | Max: 12m 43s | Hits:  92%/3564  
      🟩 GCC11              Pass: 100%/2   | Total: 26m 11s | Avg: 13m 05s | Max: 13m 12s | Hits:  91%/3564  
      🟩 GCC12              Pass: 100%/2   | Total: 29m 32s | Avg: 14m 46s | Max: 15m 01s | Hits:  90%/3564  
      🟩 GCC13              Pass: 100%/10  | Total:  1h 42m | Avg: 10m 17s | Max: 14m 23s | Hits:  96%/17820 
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 26m | Avg: 43m 04s | Max: 43m 20s | Hits:  55%/3550  
      🟩 MSVC14.42          Pass: 100%/3   | Total:  2h 09m | Avg: 43m 01s | Max: 52m 05s | Hits:  60%/5325  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  1h 13m | Avg: 36m 57s | Max: 38m 51s | Hits:  86%/3562  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/17  | Total:  2h 50m | Avg: 10m 01s | Max: 14m 32s | Hits:  94%/30277 
      🟩 GCC                Pass: 100%/21  | Total:  4h 00m | Avg: 11m 27s | Max: 15m 01s | Hits:  94%/37422 
      🟩 MSVC               Pass: 100%/5   | Total:  3h 35m | Avg: 43m 02s | Max: 52m 05s | Hits:  58%/8875  
      🟩 NVHPC              Pass: 100%/2   | Total:  1h 13m | Avg: 36m 57s | Max: 38m 51s | Hits:  86%/3562  
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 16m 26s | Avg:  8m 13s | Max: 11m 48s | Hits:  99%/3564  
      🟩 rtx2080            Pass: 100%/33  | Total:  8h 40m | Avg: 15m 45s | Max: 43m 20s | Hits:  89%/58769 
      🟩 rtx4090            Pass: 100%/10  | Total:  2h 43m | Avg: 16m 20s | Max: 52m 05s | Hits:  90%/17803 
    🟩 jobs
      🟩 Build              Pass: 100%/38  | Total: 10h 06m | Avg: 15m 58s | Max: 52m 05s | Hits:  88%/67671 
      🟩 TestCPU            Pass: 100%/3   | Total: 48m 50s | Avg: 16m 16s | Max: 33m 45s | Hits:  90%/5338  
      🟩 TestGPU            Pass: 100%/4   | Total: 44m 07s | Avg: 11m 01s | Max: 11m 48s | Hits:  99%/7127  
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 16m 26s | Avg:  8m 13s | Max: 11m 48s | Hits:  99%/3564  
      🟩 90;90a;100         Pass: 100%/1   | Total: 14m 23s | Avg: 14m 23s | Max: 14m 23s | Hits:  91%/1782  
    🟩 std
      🟩 17                 Pass: 100%/20  | Total:  5h 45m | Avg: 17m 15s | Max: 43m 20s | Hits:  87%/35611 
      🟩 20                 Pass: 100%/23  | Total:  5h 36m | Avg: 14m 38s | Max: 52m 05s | Hits:  91%/40961 
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 15m 57s | Avg: 7m 58s | Max: 13m 40s | Hits: 98%/308

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 15m 57s | Avg:  7m 58s | Max: 13m 40s | Hits:  98%/308   
    🟩 ctk
      🟩 12.8               Pass: 100%/2   | Total: 15m 57s | Avg:  7m 58s | Max: 13m 40s | Hits:  98%/308   
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/2   | Total: 15m 57s | Avg:  7m 58s | Max: 13m 40s | Hits:  98%/308   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total: 15m 57s | Avg:  7m 58s | Max: 13m 40s | Hits:  98%/308   
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total: 15m 57s | Avg:  7m 58s | Max: 13m 40s | Hits:  98%/308   
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total: 15m 57s | Avg:  7m 58s | Max: 13m 40s | Hits:  98%/308   
    🟩 gpu
      🟩 rtx2080            Pass: 100%/2   | Total: 15m 57s | Avg:  7m 58s | Max: 13m 40s | Hits:  98%/308   
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 17s | Avg:  2m 17s | Max:  2m 17s | Hits:  98%/154   
      🟩 Test               Pass: 100%/1   | Total: 13m 40s | Avg: 13m 40s | Max: 13m 40s | Hits:  98%/154   
    
  • 🟩 python: Pass: 100%/1 | Total: 40m 27s | Avg: 40m 27s | Max: 40m 27s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 40m 27s | Avg: 40m 27s | Max: 40m 27s
    🟩 ctk
      🟩 12.8               Pass: 100%/1   | Total: 40m 27s | Avg: 40m 27s | Max: 40m 27s
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/1   | Total: 40m 27s | Avg: 40m 27s | Max: 40m 27s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 40m 27s | Avg: 40m 27s | Max: 40m 27s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 40m 27s | Avg: 40m 27s | Max: 40m 27s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 40m 27s | Avg: 40m 27s | Max: 40m 27s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/1   | Total: 40m 27s | Avg: 40m 27s | Max: 40m 27s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 40m 27s | Avg: 40m 27s | Max: 40m 27s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
+/- CUB
Thrust
CUDA Experimental
python
+/- CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 93)

# Runner
66 linux-amd64-cpu16
9 windows-amd64-cpu16
6 linux-amd64-gpu-rtxa6000-latest-1
4 linux-arm64-cpu16
3 linux-amd64-gpu-h100-latest-1
3 linux-amd64-gpu-rtx4090-latest-1
2 linux-amd64-gpu-rtx2080-latest-1

// Implement part of MemBoundScaling
items_per_thread = CUB_MAX(1, CUB_MIN(items_per_thread * 4 / accumulator_type.size, items_per_thread * 2));
block_size = CUB_MIN(block_size, (((1024 * 48) / (accumulator_type.size * items_per_thread)) + 31) / 32 * 32);
items_per_thread = cuda::std::clamp(items_per_thread * 4 / accumulator_type.size, 1, items_per_thread * 2);
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think we need to be consistent with the namespace usage _CUDA_VSTD vs. cuda::std::. Also, the global namespace is missing ::cuda...

Copy link
Contributor Author

@bernhardmgruber bernhardmgruber Feb 25, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I am following the guideline to use ::cuda::std in all library headers, and cuda::std in all test code and non-includable source files. The latter is the preference of @elstehle. I really don't care. Nobody sat down and put #2635 into a document yet.

@bernhardmgruber bernhardmgruber force-pushed the drop_min_max branch 2 times, most recently from b635217 to d01fe85 Compare March 3, 2025 08:34
@bernhardmgruber bernhardmgruber marked this pull request as ready for review March 4, 2025 10:40
@cccl-authenticator-app cccl-authenticator-app bot moved this from In Progress to In Review in CCCL Mar 4, 2025
@bernhardmgruber bernhardmgruber enabled auto-merge (squash) March 4, 2025 11:25
@github-actions
Copy link
Contributor

github-actions bot commented Mar 4, 2025

🟨 CI finished in 6h 04m: Pass: 98%/93 | Total: 3d 02h | Avg: 48m 10s | Max: 5h 59m | Hits: 41%/133724
  • 🟨 cccl_c_parallel: Pass: 50%/2 | Total: 6h 02m | Avg: 3h 01m | Max: 5h 59m | Hits: 95%/154

    🚨 jobs: Test 🚨
      🟩 Build              Pass: 100%/1   | Total:  2m 37s | Avg:  2m 37s | Max:  2m 37s | Hits:  95%/154   
      🔥 Test               Pass:   0%/1   | Total:  5h 59m | Avg:  5h 59m | Max:  5h 59m
    🟨 cpu
      🟨 amd64              Pass:  50%/2   | Total:  6h 02m | Avg:  3h 01m | Max:  5h 59m | Hits:  95%/154   
    🟨 ctk
      🟨 12.8               Pass:  50%/2   | Total:  6h 02m | Avg:  3h 01m | Max:  5h 59m | Hits:  95%/154   
    🟨 cudacxx
      🟨 nvcc12.8           Pass:  50%/2   | Total:  6h 02m | Avg:  3h 01m | Max:  5h 59m | Hits:  95%/154   
    🟨 cudacxx_family
      🟨 nvcc               Pass:  50%/2   | Total:  6h 02m | Avg:  3h 01m | Max:  5h 59m | Hits:  95%/154   
    🟨 cxx
      🟨 GCC13              Pass:  50%/2   | Total:  6h 02m | Avg:  3h 01m | Max:  5h 59m | Hits:  95%/154   
    🟨 cxx_family
      🟨 GCC                Pass:  50%/2   | Total:  6h 02m | Avg:  3h 01m | Max:  5h 59m | Hits:  95%/154   
    🟨 gpu
      🟨 rtx2080            Pass:  50%/2   | Total:  6h 02m | Avg:  3h 01m | Max:  5h 59m | Hits:  95%/154   
    
  • 🟩 cub: Pass: 100%/45 | Total: 1d 18h | Avg: 56m 45s | Max: 1h 24m | Hits: 30%/53614

    🟩 cpu
      🟩 amd64              Pass: 100%/43  | Total:  1d 16h | Avg: 56m 20s | Max:  1h 24m | Hits:  31%/51178 
      🟩 arm64              Pass: 100%/2   | Total:  2h 11m | Avg:  1h 05m | Max:  1h 05m | Hits:  16%/2436  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  5h 18m | Avg:  1h 03m | Max:  1h 07m | Hits:  15%/5922  
      🟩 12.5               Pass: 100%/2   | Total:  2h 20m | Avg:  1h 10m | Max:  1h 10m | Hits:  10%/2254  
      🟩 12.8               Pass: 100%/38  | Total:  1d 10h | Avg: 55m 07s | Max:  1h 24m | Hits:  34%/45438 
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  2h 06m | Avg:  1h 03m | Max:  1h 03m | Hits:  15%/2104  
      🟩 nvcc12.0           Pass: 100%/5   | Total:  5h 18m | Avg:  1h 03m | Max:  1h 07m | Hits:  15%/5922  
      🟩 nvcc12.5           Pass: 100%/2   | Total:  2h 20m | Avg:  1h 10m | Max:  1h 10m | Hits:  10%/2254  
      🟩 nvcc12.8           Pass: 100%/36  | Total:  1d 08h | Avg: 54m 40s | Max:  1h 24m | Hits:  34%/43334 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  2h 06m | Avg:  1h 03m | Max:  1h 03m | Hits:  15%/2104  
      🟩 nvcc               Pass: 100%/43  | Total:  1d 16h | Avg: 56m 27s | Max:  1h 24m | Hits:  31%/51510 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  4h 08m | Avg:  1h 02m | Max:  1h 04m | Hits:  16%/4880  
      🟩 Clang15            Pass: 100%/2   | Total:  2h 01m | Avg:  1h 00m | Max:  1h 02m | Hits:  16%/2436  
      🟩 Clang16            Pass: 100%/2   | Total:  2h 08m | Avg:  1h 04m | Max:  1h 05m | Hits:  16%/2436  
      🟩 Clang17            Pass: 100%/2   | Total:  2h 04m | Avg:  1h 02m | Max:  1h 03m | Hits:  16%/2436  
      🟩 Clang18            Pass: 100%/7   | Total:  5h 59m | Avg: 51m 21s | Max:  1h 05m | Hits:  41%/8194  
      🟩 GCC7               Pass: 100%/2   | Total:  2h 01m | Avg:  1h 00m | Max:  1h 01m | Hits:  16%/2440  
      🟩 GCC8               Pass: 100%/1   | Total:  1h 04m | Avg:  1h 04m | Max:  1h 04m | Hits:  16%/1220  
      🟩 GCC9               Pass: 100%/2   | Total:  2h 05m | Avg:  1h 02m | Max:  1h 05m | Hits:  16%/2440  
      🟩 GCC10              Pass: 100%/2   | Total:  2h 17m | Avg:  1h 08m | Max:  1h 09m | Hits:  16%/2440  
      🟩 GCC11              Pass: 100%/2   | Total:  2h 10m | Avg:  1h 05m | Max:  1h 08m | Hits:  16%/2436  
      🟩 GCC12              Pass: 100%/2   | Total:  2h 03m | Avg:  1h 01m | Max:  1h 01m | Hits:  16%/2436  
      🟩 GCC13              Pass: 100%/11  | Total:  7h 03m | Avg: 38m 32s | Max:  1h 14m | Hits:  61%/13398 
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 21m | Avg:  1h 10m | Max:  1h 13m | Hits:  12%/2084  
      🟩 MSVC14.42          Pass: 100%/2   | Total:  2h 42m | Avg:  1h 21m | Max:  1h 24m | Hits:  12%/2084  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  2h 20m | Avg:  1h 10m | Max:  1h 10m | Hits:  10%/2254  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/17  | Total: 16h 23m | Avg: 57m 49s | Max:  1h 05m | Hits:  26%/20382 
      🟩 GCC                Pass: 100%/22  | Total: 18h 47m | Avg: 51m 13s | Max:  1h 14m | Hits:  38%/26810 
      🟩 MSVC               Pass: 100%/4   | Total:  5h 03m | Avg:  1h 15m | Max:  1h 24m | Hits:  12%/4168  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 20m | Avg:  1h 10m | Max:  1h 10m | Hits:  10%/2254  
    🟩 gpu
      🟩 h100               Pass: 100%/3   | Total:  1h 13m | Avg: 24m 25s | Max: 28m 43s | Hits:  71%/3654  
      🟩 rtx2080            Pass: 100%/34  | Total:  1d 13h | Avg:  1h 05m | Max:  1h 24m | Hits:  15%/40216 
      🟩 rtxa6000           Pass: 100%/8   | Total:  4h 12m | Avg: 31m 36s | Max:  1h 00m | Hits:  78%/9744  
    🟩 jobs
      🟩 Build              Pass: 100%/37  | Total:  1d 15h | Avg:  1h 04m | Max:  1h 24m | Hits:  15%/43870 
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 22m 43s | Avg: 22m 43s | Max: 22m 43s | Hits:  99%/1218  
      🟩 GraphCapture       Pass: 100%/1   | Total: 17m 37s | Avg: 17m 37s | Max: 17m 37s | Hits:  99%/1218  
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 08m | Avg: 22m 50s | Max: 23m 30s | Hits:  99%/3654  
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 06m | Avg: 22m 15s | Max: 23m 18s | Hits:  99%/3654  
    🟩 sm
      🟩 90                 Pass: 100%/3   | Total:  1h 13m | Avg: 24m 25s | Max: 28m 43s | Hits:  71%/3654  
      🟩 90;90a;100         Pass: 100%/1   | Total:  1h 14m | Avg:  1h 14m | Max:  1h 14m | Hits:  16%/1218  
    🟩 std
      🟩 17                 Pass: 100%/20  | Total: 21h 46m | Avg:  1h 05m | Max:  1h 24m | Hits:  15%/23591 
      🟩 20                 Pass: 100%/25  | Total: 20h 47m | Avg: 49m 53s | Max:  1h 17m | Hits:  43%/30023 
    
  • 🟩 thrust: Pass: 100%/45 | Total: 1d 01h | Avg: 33m 24s | Max: 1h 09m | Hits: 48%/79956

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 42m 38s | Avg: 21m 19s | Max: 31m 30s | Hits:  70%/3556  
    🟩 cpu
      🟩 amd64              Pass: 100%/43  | Total:  1d 00h | Avg: 33m 34s | Max:  1h 09m | Hits:  49%/76401 
      🟩 arm64              Pass: 100%/2   | Total: 59m 15s | Avg: 29m 37s | Max: 31m 04s | Hits:  41%/3555  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  3h 10m | Avg: 38m 10s | Max:  1h 00m | Hits:  41%/8881  
      🟩 12.5               Pass: 100%/2   | Total:  2h 18m | Avg:  1h 09m | Max:  1h 09m | Hits:   0%/3554  
      🟩 12.8               Pass: 100%/38  | Total: 19h 33m | Avg: 30m 53s | Max:  1h 06m | Hits:  52%/67521 
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 54m 22s | Avg: 27m 11s | Max: 27m 19s | Hits:  41%/3554  
      🟩 nvcc12.0           Pass: 100%/5   | Total:  3h 10m | Avg: 38m 10s | Max:  1h 00m | Hits:  41%/8881  
      🟩 nvcc12.5           Pass: 100%/2   | Total:  2h 18m | Avg:  1h 09m | Max:  1h 09m | Hits:   0%/3554  
      🟩 nvcc12.8           Pass: 100%/36  | Total: 18h 39m | Avg: 31m 05s | Max:  1h 06m | Hits:  52%/63967 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 54m 22s | Avg: 27m 11s | Max: 27m 19s | Hits:  41%/3554  
      🟩 nvcc               Pass: 100%/43  | Total:  1d 00h | Avg: 33m 41s | Max:  1h 09m | Hits:  49%/76402 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  2h 05m | Avg: 31m 21s | Max: 32m 42s | Hits:  53%/7108  
      🟩 Clang15            Pass: 100%/2   | Total:  1h 06m | Avg: 33m 13s | Max: 34m 32s | Hits:  41%/3554  
      🟩 Clang16            Pass: 100%/2   | Total:  1h 08m | Avg: 34m 04s | Max: 35m 12s | Hits:  41%/3554  
      🟩 Clang17            Pass: 100%/2   | Total:  1h 05m | Avg: 32m 31s | Max: 33m 45s | Hits:  41%/3554  
      🟩 Clang18            Pass: 100%/7   | Total:  2h 42m | Avg: 23m 12s | Max: 31m 17s | Hits:  61%/12439 
      🟩 GCC7               Pass: 100%/2   | Total:  1h 08m | Avg: 34m 11s | Max: 35m 01s | Hits:  43%/3556  
      🟩 GCC8               Pass: 100%/1   | Total: 35m 39s | Avg: 35m 39s | Max: 35m 39s | Hits:  41%/1778  
      🟩 GCC9               Pass: 100%/2   | Total:  1h 06m | Avg: 33m 03s | Max: 33m 07s | Hits:  48%/3556  
      🟩 GCC10              Pass: 100%/2   | Total:  1h 07m | Avg: 33m 48s | Max: 33m 55s | Hits:  41%/3556  
      🟩 GCC11              Pass: 100%/2   | Total:  1h 06m | Avg: 33m 07s | Max: 33m 15s | Hits:  41%/3556  
      🟩 GCC12              Pass: 100%/2   | Total:  1h 11m | Avg: 35m 56s | Max: 36m 55s | Hits:  41%/3556  
      🟩 GCC13              Pass: 100%/10  | Total:  3h 42m | Avg: 22m 12s | Max: 33m 58s | Hits:  71%/17780 
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 59m | Avg: 59m 57s | Max:  1h 00m | Hits:  21%/3542  
      🟩 MSVC14.42          Pass: 100%/3   | Total:  2h 39m | Avg: 53m 02s | Max:  1h 06m | Hits:  25%/5313  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  2h 18m | Avg:  1h 09m | Max:  1h 09m | Hits:   0%/3554  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/17  | Total:  8h 07m | Avg: 28m 40s | Max: 35m 12s | Hits:  52%/30209 
      🟩 GCC                Pass: 100%/21  | Total:  9h 57m | Avg: 28m 28s | Max: 36m 55s | Hits:  56%/37338 
      🟩 MSVC               Pass: 100%/5   | Total:  4h 39m | Avg: 55m 48s | Max:  1h 06m | Hits:  23%/8855  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 18m | Avg:  1h 09m | Max:  1h 09m | Hits:   0%/3554  
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 31m 04s | Avg: 15m 32s | Max: 20m 20s | Hits:  70%/3556  
      🟩 rtx2080            Pass: 100%/33  | Total: 20h 31m | Avg: 37m 19s | Max:  1h 09m | Hits:  39%/58637 
      🟩 rtx4090            Pass: 100%/10  | Total:  4h 00m | Avg: 24m 03s | Max:  1h 06m | Hits:  74%/17763 
    🟩 jobs
      🟩 Build              Pass: 100%/38  | Total: 23h 32m | Avg: 37m 10s | Max:  1h 09m | Hits:  40%/67519 
      🟩 TestCPU            Pass: 100%/3   | Total: 46m 58s | Avg: 15m 39s | Max: 32m 02s | Hits:  90%/5326  
      🟩 TestGPU            Pass: 100%/4   | Total: 43m 15s | Avg: 10m 48s | Max: 11m 08s | Hits:  99%/7111  
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 31m 04s | Avg: 15m 32s | Max: 20m 20s | Hits:  70%/3556  
      🟩 90;90a;100         Pass: 100%/1   | Total: 33m 01s | Avg: 33m 01s | Max: 33m 01s | Hits:  75%/1778  
    🟩 std
      🟩 17                 Pass: 100%/20  | Total: 13h 02m | Avg: 39m 07s | Max:  1h 09m | Hits:  38%/35531 
      🟩 20                 Pass: 100%/23  | Total: 11h 17m | Avg: 29m 28s | Max:  1h 08m | Hits:  56%/40869 
    
  • 🟩 python: Pass: 100%/1 | Total: 1h 00m | Avg: 1h 00m | Max: 1h 00m

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total:  1h 00m | Avg:  1h 00m | Max:  1h 00m
    🟩 ctk
      🟩 12.8               Pass: 100%/1   | Total:  1h 00m | Avg:  1h 00m | Max:  1h 00m
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/1   | Total:  1h 00m | Avg:  1h 00m | Max:  1h 00m
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total:  1h 00m | Avg:  1h 00m | Max:  1h 00m
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total:  1h 00m | Avg:  1h 00m | Max:  1h 00m
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total:  1h 00m | Avg:  1h 00m | Max:  1h 00m
    🟩 gpu
      🟩 rtx2080            Pass: 100%/1   | Total:  1h 00m | Avg:  1h 00m | Max:  1h 00m
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total:  1h 00m | Avg:  1h 00m | Max:  1h 00m
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
+/- CUB
Thrust
CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 93)

# Runner
66 linux-amd64-cpu16
9 windows-amd64-cpu16
6 linux-amd64-gpu-rtxa6000-latest-1
4 linux-arm64-cpu16
3 linux-amd64-gpu-h100-latest-1
3 linux-amd64-gpu-rtx4090-latest-1
2 linux-amd64-gpu-rtx2080-latest-1

@github-actions
Copy link
Contributor

github-actions bot commented Mar 4, 2025

🟩 CI finished in 6h 26m: Pass: 100%/93 | Total: 2d 20h | Avg: 44m 26s | Max: 1h 24m | Hits: 41%/133878
  • 🟩 cub: Pass: 100%/45 | Total: 1d 18h | Avg: 56m 45s | Max: 1h 24m | Hits: 30%/53614

    🟩 cpu
      🟩 amd64              Pass: 100%/43  | Total:  1d 16h | Avg: 56m 20s | Max:  1h 24m | Hits:  31%/51178 
      🟩 arm64              Pass: 100%/2   | Total:  2h 11m | Avg:  1h 05m | Max:  1h 05m | Hits:  16%/2436  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  5h 18m | Avg:  1h 03m | Max:  1h 07m | Hits:  15%/5922  
      🟩 12.5               Pass: 100%/2   | Total:  2h 20m | Avg:  1h 10m | Max:  1h 10m | Hits:  10%/2254  
      🟩 12.8               Pass: 100%/38  | Total:  1d 10h | Avg: 55m 07s | Max:  1h 24m | Hits:  34%/45438 
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  2h 06m | Avg:  1h 03m | Max:  1h 03m | Hits:  15%/2104  
      🟩 nvcc12.0           Pass: 100%/5   | Total:  5h 18m | Avg:  1h 03m | Max:  1h 07m | Hits:  15%/5922  
      🟩 nvcc12.5           Pass: 100%/2   | Total:  2h 20m | Avg:  1h 10m | Max:  1h 10m | Hits:  10%/2254  
      🟩 nvcc12.8           Pass: 100%/36  | Total:  1d 08h | Avg: 54m 40s | Max:  1h 24m | Hits:  34%/43334 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  2h 06m | Avg:  1h 03m | Max:  1h 03m | Hits:  15%/2104  
      🟩 nvcc               Pass: 100%/43  | Total:  1d 16h | Avg: 56m 27s | Max:  1h 24m | Hits:  31%/51510 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  4h 08m | Avg:  1h 02m | Max:  1h 04m | Hits:  16%/4880  
      🟩 Clang15            Pass: 100%/2   | Total:  2h 01m | Avg:  1h 00m | Max:  1h 02m | Hits:  16%/2436  
      🟩 Clang16            Pass: 100%/2   | Total:  2h 08m | Avg:  1h 04m | Max:  1h 05m | Hits:  16%/2436  
      🟩 Clang17            Pass: 100%/2   | Total:  2h 04m | Avg:  1h 02m | Max:  1h 03m | Hits:  16%/2436  
      🟩 Clang18            Pass: 100%/7   | Total:  5h 59m | Avg: 51m 21s | Max:  1h 05m | Hits:  41%/8194  
      🟩 GCC7               Pass: 100%/2   | Total:  2h 01m | Avg:  1h 00m | Max:  1h 01m | Hits:  16%/2440  
      🟩 GCC8               Pass: 100%/1   | Total:  1h 04m | Avg:  1h 04m | Max:  1h 04m | Hits:  16%/1220  
      🟩 GCC9               Pass: 100%/2   | Total:  2h 05m | Avg:  1h 02m | Max:  1h 05m | Hits:  16%/2440  
      🟩 GCC10              Pass: 100%/2   | Total:  2h 17m | Avg:  1h 08m | Max:  1h 09m | Hits:  16%/2440  
      🟩 GCC11              Pass: 100%/2   | Total:  2h 10m | Avg:  1h 05m | Max:  1h 08m | Hits:  16%/2436  
      🟩 GCC12              Pass: 100%/2   | Total:  2h 03m | Avg:  1h 01m | Max:  1h 01m | Hits:  16%/2436  
      🟩 GCC13              Pass: 100%/11  | Total:  7h 03m | Avg: 38m 32s | Max:  1h 14m | Hits:  61%/13398 
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 21m | Avg:  1h 10m | Max:  1h 13m | Hits:  12%/2084  
      🟩 MSVC14.42          Pass: 100%/2   | Total:  2h 42m | Avg:  1h 21m | Max:  1h 24m | Hits:  12%/2084  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  2h 20m | Avg:  1h 10m | Max:  1h 10m | Hits:  10%/2254  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/17  | Total: 16h 23m | Avg: 57m 49s | Max:  1h 05m | Hits:  26%/20382 
      🟩 GCC                Pass: 100%/22  | Total: 18h 47m | Avg: 51m 13s | Max:  1h 14m | Hits:  38%/26810 
      🟩 MSVC               Pass: 100%/4   | Total:  5h 03m | Avg:  1h 15m | Max:  1h 24m | Hits:  12%/4168  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 20m | Avg:  1h 10m | Max:  1h 10m | Hits:  10%/2254  
    🟩 gpu
      🟩 h100               Pass: 100%/3   | Total:  1h 13m | Avg: 24m 25s | Max: 28m 43s | Hits:  71%/3654  
      🟩 rtx2080            Pass: 100%/34  | Total:  1d 13h | Avg:  1h 05m | Max:  1h 24m | Hits:  15%/40216 
      🟩 rtxa6000           Pass: 100%/8   | Total:  4h 12m | Avg: 31m 36s | Max:  1h 00m | Hits:  78%/9744  
    🟩 jobs
      🟩 Build              Pass: 100%/37  | Total:  1d 15h | Avg:  1h 04m | Max:  1h 24m | Hits:  15%/43870 
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 22m 43s | Avg: 22m 43s | Max: 22m 43s | Hits:  99%/1218  
      🟩 GraphCapture       Pass: 100%/1   | Total: 17m 37s | Avg: 17m 37s | Max: 17m 37s | Hits:  99%/1218  
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 08m | Avg: 22m 50s | Max: 23m 30s | Hits:  99%/3654  
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 06m | Avg: 22m 15s | Max: 23m 18s | Hits:  99%/3654  
    🟩 sm
      🟩 90                 Pass: 100%/3   | Total:  1h 13m | Avg: 24m 25s | Max: 28m 43s | Hits:  71%/3654  
      🟩 90;90a;100         Pass: 100%/1   | Total:  1h 14m | Avg:  1h 14m | Max:  1h 14m | Hits:  16%/1218  
    🟩 std
      🟩 17                 Pass: 100%/20  | Total: 21h 46m | Avg:  1h 05m | Max:  1h 24m | Hits:  15%/23591 
      🟩 20                 Pass: 100%/25  | Total: 20h 47m | Avg: 49m 53s | Max:  1h 17m | Hits:  43%/30023 
    
  • 🟩 thrust: Pass: 100%/45 | Total: 1d 01h | Avg: 33m 24s | Max: 1h 09m | Hits: 48%/79956

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 42m 38s | Avg: 21m 19s | Max: 31m 30s | Hits:  70%/3556  
    🟩 cpu
      🟩 amd64              Pass: 100%/43  | Total:  1d 00h | Avg: 33m 34s | Max:  1h 09m | Hits:  49%/76401 
      🟩 arm64              Pass: 100%/2   | Total: 59m 15s | Avg: 29m 37s | Max: 31m 04s | Hits:  41%/3555  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  3h 10m | Avg: 38m 10s | Max:  1h 00m | Hits:  41%/8881  
      🟩 12.5               Pass: 100%/2   | Total:  2h 18m | Avg:  1h 09m | Max:  1h 09m | Hits:   0%/3554  
      🟩 12.8               Pass: 100%/38  | Total: 19h 33m | Avg: 30m 53s | Max:  1h 06m | Hits:  52%/67521 
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 54m 22s | Avg: 27m 11s | Max: 27m 19s | Hits:  41%/3554  
      🟩 nvcc12.0           Pass: 100%/5   | Total:  3h 10m | Avg: 38m 10s | Max:  1h 00m | Hits:  41%/8881  
      🟩 nvcc12.5           Pass: 100%/2   | Total:  2h 18m | Avg:  1h 09m | Max:  1h 09m | Hits:   0%/3554  
      🟩 nvcc12.8           Pass: 100%/36  | Total: 18h 39m | Avg: 31m 05s | Max:  1h 06m | Hits:  52%/63967 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 54m 22s | Avg: 27m 11s | Max: 27m 19s | Hits:  41%/3554  
      🟩 nvcc               Pass: 100%/43  | Total:  1d 00h | Avg: 33m 41s | Max:  1h 09m | Hits:  49%/76402 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  2h 05m | Avg: 31m 21s | Max: 32m 42s | Hits:  53%/7108  
      🟩 Clang15            Pass: 100%/2   | Total:  1h 06m | Avg: 33m 13s | Max: 34m 32s | Hits:  41%/3554  
      🟩 Clang16            Pass: 100%/2   | Total:  1h 08m | Avg: 34m 04s | Max: 35m 12s | Hits:  41%/3554  
      🟩 Clang17            Pass: 100%/2   | Total:  1h 05m | Avg: 32m 31s | Max: 33m 45s | Hits:  41%/3554  
      🟩 Clang18            Pass: 100%/7   | Total:  2h 42m | Avg: 23m 12s | Max: 31m 17s | Hits:  61%/12439 
      🟩 GCC7               Pass: 100%/2   | Total:  1h 08m | Avg: 34m 11s | Max: 35m 01s | Hits:  43%/3556  
      🟩 GCC8               Pass: 100%/1   | Total: 35m 39s | Avg: 35m 39s | Max: 35m 39s | Hits:  41%/1778  
      🟩 GCC9               Pass: 100%/2   | Total:  1h 06m | Avg: 33m 03s | Max: 33m 07s | Hits:  48%/3556  
      🟩 GCC10              Pass: 100%/2   | Total:  1h 07m | Avg: 33m 48s | Max: 33m 55s | Hits:  41%/3556  
      🟩 GCC11              Pass: 100%/2   | Total:  1h 06m | Avg: 33m 07s | Max: 33m 15s | Hits:  41%/3556  
      🟩 GCC12              Pass: 100%/2   | Total:  1h 11m | Avg: 35m 56s | Max: 36m 55s | Hits:  41%/3556  
      🟩 GCC13              Pass: 100%/10  | Total:  3h 42m | Avg: 22m 12s | Max: 33m 58s | Hits:  71%/17780 
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 59m | Avg: 59m 57s | Max:  1h 00m | Hits:  21%/3542  
      🟩 MSVC14.42          Pass: 100%/3   | Total:  2h 39m | Avg: 53m 02s | Max:  1h 06m | Hits:  25%/5313  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  2h 18m | Avg:  1h 09m | Max:  1h 09m | Hits:   0%/3554  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/17  | Total:  8h 07m | Avg: 28m 40s | Max: 35m 12s | Hits:  52%/30209 
      🟩 GCC                Pass: 100%/21  | Total:  9h 57m | Avg: 28m 28s | Max: 36m 55s | Hits:  56%/37338 
      🟩 MSVC               Pass: 100%/5   | Total:  4h 39m | Avg: 55m 48s | Max:  1h 06m | Hits:  23%/8855  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 18m | Avg:  1h 09m | Max:  1h 09m | Hits:   0%/3554  
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 31m 04s | Avg: 15m 32s | Max: 20m 20s | Hits:  70%/3556  
      🟩 rtx2080            Pass: 100%/33  | Total: 20h 31m | Avg: 37m 19s | Max:  1h 09m | Hits:  39%/58637 
      🟩 rtx4090            Pass: 100%/10  | Total:  4h 00m | Avg: 24m 03s | Max:  1h 06m | Hits:  74%/17763 
    🟩 jobs
      🟩 Build              Pass: 100%/38  | Total: 23h 32m | Avg: 37m 10s | Max:  1h 09m | Hits:  40%/67519 
      🟩 TestCPU            Pass: 100%/3   | Total: 46m 58s | Avg: 15m 39s | Max: 32m 02s | Hits:  90%/5326  
      🟩 TestGPU            Pass: 100%/4   | Total: 43m 15s | Avg: 10m 48s | Max: 11m 08s | Hits:  99%/7111  
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 31m 04s | Avg: 15m 32s | Max: 20m 20s | Hits:  70%/3556  
      🟩 90;90a;100         Pass: 100%/1   | Total: 33m 01s | Avg: 33m 01s | Max: 33m 01s | Hits:  75%/1778  
    🟩 std
      🟩 17                 Pass: 100%/20  | Total: 13h 02m | Avg: 39m 07s | Max:  1h 09m | Hits:  38%/35531 
      🟩 20                 Pass: 100%/23  | Total: 11h 17m | Avg: 29m 28s | Max:  1h 08m | Hits:  56%/40869 
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 15m 30s | Avg: 7m 45s | Max: 12m 53s | Hits: 97%/308

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 15m 30s | Avg:  7m 45s | Max: 12m 53s | Hits:  97%/308   
    🟩 ctk
      🟩 12.8               Pass: 100%/2   | Total: 15m 30s | Avg:  7m 45s | Max: 12m 53s | Hits:  97%/308   
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/2   | Total: 15m 30s | Avg:  7m 45s | Max: 12m 53s | Hits:  97%/308   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total: 15m 30s | Avg:  7m 45s | Max: 12m 53s | Hits:  97%/308   
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total: 15m 30s | Avg:  7m 45s | Max: 12m 53s | Hits:  97%/308   
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total: 15m 30s | Avg:  7m 45s | Max: 12m 53s | Hits:  97%/308   
    🟩 gpu
      🟩 rtx2080            Pass: 100%/2   | Total: 15m 30s | Avg:  7m 45s | Max: 12m 53s | Hits:  97%/308   
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 37s | Avg:  2m 37s | Max:  2m 37s | Hits:  95%/154   
      🟩 Test               Pass: 100%/1   | Total: 12m 53s | Avg: 12m 53s | Max: 12m 53s | Hits:  98%/154   
    
  • 🟩 python: Pass: 100%/1 | Total: 1h 00m | Avg: 1h 00m | Max: 1h 00m

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total:  1h 00m | Avg:  1h 00m | Max:  1h 00m
    🟩 ctk
      🟩 12.8               Pass: 100%/1   | Total:  1h 00m | Avg:  1h 00m | Max:  1h 00m
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/1   | Total:  1h 00m | Avg:  1h 00m | Max:  1h 00m
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total:  1h 00m | Avg:  1h 00m | Max:  1h 00m
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total:  1h 00m | Avg:  1h 00m | Max:  1h 00m
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total:  1h 00m | Avg:  1h 00m | Max:  1h 00m
    🟩 gpu
      🟩 rtx2080            Pass: 100%/1   | Total:  1h 00m | Avg:  1h 00m | Max:  1h 00m
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total:  1h 00m | Avg:  1h 00m | Max:  1h 00m
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
+/- CUB
Thrust
CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 93)

# Runner
66 linux-amd64-cpu16
9 windows-amd64-cpu16
6 linux-amd64-gpu-rtxa6000-latest-1
4 linux-arm64-cpu16
3 linux-amd64-gpu-h100-latest-1
3 linux-amd64-gpu-rtx4090-latest-1
2 linux-amd64-gpu-rtx2080-latest-1

@bernhardmgruber bernhardmgruber merged commit 93a6505 into NVIDIA:main Mar 4, 2025
109 of 112 checks passed
@github-project-automation github-project-automation bot moved this from In Review to Done in CCCL Mar 4, 2025
@bernhardmgruber bernhardmgruber deleted the drop_min_max branch March 4, 2025 17:13
davebayer pushed a commit to davebayer/cccl that referenced this pull request Apr 7, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

breaking Breaking change cub For all items related to CUB

Projects

Archived in project

Development

Successfully merging this pull request may close these issues.

4 participants