KEMBAR78
Add PTX `elect.sync` by fbusato · Pull Request #4445 · NVIDIA/cccl · GitHub
Skip to content

Conversation

@fbusato
Copy link
Contributor

@fbusato fbusato commented Apr 14, 2025

Description

The PR adds (automatically generated) PTX elect.sync code and docs.

@fbusato fbusato added the 3.0 Targeted for 3.0 release label Apr 14, 2025
@fbusato fbusato requested a review from ahendriksen April 14, 2025 19:54
@fbusato fbusato self-assigned this Apr 14, 2025
@fbusato fbusato requested review from a team as code owners April 14, 2025 19:54
@fbusato fbusato added this to CCCL Apr 14, 2025
@fbusato fbusato requested review from gonidelis and wmaxey April 14, 2025 19:54
@github-project-automation github-project-automation bot moved this to Todo in CCCL Apr 14, 2025
@cccl-authenticator-app cccl-authenticator-app bot moved this from Todo to In Review in CCCL Apr 14, 2025
@fbusato fbusato added 3.1.0 Targeted for 3.1 release and removed 3.0 Targeted for 3.0 release labels Apr 14, 2025
@fbusato fbusato enabled auto-merge (squash) April 14, 2025 21:21
@github-actions
Copy link
Contributor

🟩 CI finished in 1h 39m: Pass: 100%/174 | Total: 3d 11h | Avg: 28m 56s | Max: 1h 26m | Hits: 75%/271143
  • 🟩 cub: Pass: 100%/47 | Total: 1d 23h | Avg: 1h 01m | Max: 1h 26m | Hits: 44%/56545

    🟩 cpu
      🟩 amd64              Pass: 100%/45  | Total:  1d 21h | Avg:  1h 00m | Max:  1h 26m | Hits:  45%/54087 
      🟩 arm64              Pass: 100%/2   | Total:  2h 18m | Avg:  1h 09m | Max:  1h 12m | Hits:  34%/2458  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  5h 44m | Avg:  1h 08m | Max:  1h 11m | Hits:  34%/5974  
      🟩 12.8               Pass: 100%/42  | Total:  1d 18h | Avg:  1h 00m | Max:  1h 26m | Hits:  45%/50571 
    🟩 cudacxx
      🟩 ClangCUDA19        Pass: 100%/2   | Total:  2h 00m | Avg:  1h 00m | Max:  1h 01m | Hits:  35%/2120  
      🟩 nvcc12.0           Pass: 100%/5   | Total:  5h 44m | Avg:  1h 08m | Max:  1h 11m | Hits:  34%/5974  
      🟩 nvcc12.8           Pass: 100%/40  | Total:  1d 16h | Avg:  1h 00m | Max:  1h 26m | Hits:  46%/48451 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  2h 00m | Avg:  1h 00m | Max:  1h 01m | Hits:  35%/2120  
      🟩 nvcc               Pass: 100%/45  | Total:  1d 21h | Avg:  1h 01m | Max:  1h 26m | Hits:  45%/54425 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  4h 31m | Avg:  1h 07m | Max:  1h 10m | Hits:  34%/4924  
      🟩 Clang15            Pass: 100%/2   | Total:  2h 14m | Avg:  1h 07m | Max:  1h 11m | Hits:  34%/2458  
      🟩 Clang16            Pass: 100%/2   | Total:  2h 19m | Avg:  1h 09m | Max:  1h 10m | Hits:  34%/2458  
      🟩 Clang17            Pass: 100%/2   | Total:  2h 17m | Avg:  1h 08m | Max:  1h 12m | Hits:  34%/2458  
      🟩 Clang18            Pass: 100%/2   | Total:  2h 11m | Avg:  1h 05m | Max:  1h 05m | Hits:  34%/2458  
      🟩 Clang19            Pass: 100%/7   | Total:  6h 12m | Avg: 53m 10s | Max:  1h 08m | Hits:  54%/8265  
      🟩 GCC7               Pass: 100%/2   | Total:  2h 12m | Avg:  1h 06m | Max:  1h 08m | Hits:  34%/2462  
      🟩 GCC8               Pass: 100%/1   | Total:  1h 05m | Avg:  1h 05m | Max:  1h 05m | Hits:  34%/1231  
      🟩 GCC9               Pass: 100%/2   | Total:  2h 12m | Avg:  1h 06m | Max:  1h 06m | Hits:  34%/2462  
      🟩 GCC10              Pass: 100%/2   | Total:  2h 10m | Avg:  1h 05m | Max:  1h 05m | Hits:  34%/2462  
      🟩 GCC11              Pass: 100%/2   | Total:  2h 16m | Avg:  1h 08m | Max:  1h 11m | Hits:  34%/2458  
      🟩 GCC12              Pass: 100%/2   | Total:  2h 14m | Avg:  1h 07m | Max:  1h 09m | Hits:  34%/2458  
      🟩 GCC13              Pass: 100%/11  | Total:  7h 54m | Avg: 43m 06s | Max:  1h 17m | Hits:  69%/13519 
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 28m | Avg:  1h 14m | Max:  1h 17m | Hits:  34%/2100  
      🟩 MSVC14.42          Pass: 100%/2   | Total:  2h 42m | Avg:  1h 21m | Max:  1h 25m | Hits:   5%/2100  
      🟩 NVHPC25.3          Pass: 100%/2   | Total:  2h 48m | Avg:  1h 24m | Max:  1h 26m | Hits:  31%/2272  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/19  | Total: 19h 46m | Avg:  1h 02m | Max:  1h 12m | Hits:  41%/23021 
      🟩 GCC                Pass: 100%/22  | Total: 20h 06m | Avg: 54m 50s | Max:  1h 17m | Hits:  52%/27052 
      🟩 MSVC               Pass: 100%/4   | Total:  5h 11m | Avg:  1h 17m | Max:  1h 25m | Hits:  19%/4200  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 48m | Avg:  1h 24m | Max:  1h 26m | Hits:  31%/2272  
    🟩 gpu
      🟩 h100               Pass: 100%/3   | Total:  1h 17m | Avg: 25m 52s | Max: 28m 28s | Hits:  77%/3687  
      🟩 rtx2080            Pass: 100%/36  | Total:  1d 17h | Avg:  1h 09m | Max:  1h 26m | Hits:  32%/43026 
      🟩 rtxa6000           Pass: 100%/8   | Total:  4h 54m | Avg: 36m 45s | Max:  1h 06m | Hits:  83%/9832  
    🟩 jobs
      🟩 Build              Pass: 100%/39  | Total:  1d 20h | Avg:  1h 08m | Max:  1h 26m | Hits:  33%/46713 
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 30m 08s | Avg: 30m 08s | Max: 30m 08s | Hits:  99%/1229  
      🟩 GraphCapture       Pass: 100%/1   | Total: 24m 10s | Avg: 24m 10s | Max: 24m 10s | Hits:  99%/1229  
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 25m | Avg: 28m 33s | Max: 30m 21s | Hits:  99%/3687  
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 10m | Avg: 23m 31s | Max: 25m 50s | Hits:  99%/3687  
    🟩 sm
      🟩 90                 Pass: 100%/3   | Total:  1h 17m | Avg: 25m 52s | Max: 28m 28s | Hits:  77%/3687  
      🟩 90;90a;100         Pass: 100%/1   | Total:  1h 17m | Avg:  1h 17m | Max:  1h 17m | Hits:  34%/1229  
    🟩 std
      🟩 17                 Pass: 100%/21  | Total:  1d 00h | Avg:  1h 09m | Max:  1h 26m | Hits:  32%/25026 
      🟩 20                 Pass: 100%/26  | Total: 23h 35m | Avg: 54m 27s | Max:  1h 25m | Hits:  53%/31519 
    
  • 🟩 thrust: Pass: 100%/47 | Total: 1d 00h | Avg: 31m 39s | Max: 1h 03m | Hits: 77%/83463

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 38m 45s | Avg: 19m 22s | Max: 27m 17s | Hits:  88%/3554  
    🟩 cpu
      🟩 amd64              Pass: 100%/45  | Total: 23h 51m | Avg: 31m 49s | Max:  1h 03m | Hits:  77%/79910 
      🟩 arm64              Pass: 100%/2   | Total: 55m 45s | Avg: 27m 52s | Max: 29m 58s | Hits:  77%/3553  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  3h 05m | Avg: 37m 06s | Max: 56m 28s | Hits:  74%/8876  
      🟩 12.8               Pass: 100%/42  | Total: 21h 42m | Avg: 31m 00s | Max:  1h 03m | Hits:  78%/74587 
    🟩 cudacxx
      🟩 ClangCUDA19        Pass: 100%/2   | Total: 49m 05s | Avg: 24m 32s | Max: 25m 06s | Hits:  77%/3552  
      🟩 nvcc12.0           Pass: 100%/5   | Total:  3h 05m | Avg: 37m 06s | Max: 56m 28s | Hits:  74%/8876  
      🟩 nvcc12.8           Pass: 100%/40  | Total: 20h 52m | Avg: 31m 19s | Max:  1h 03m | Hits:  78%/71035 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 49m 05s | Avg: 24m 32s | Max: 25m 06s | Hits:  77%/3552  
      🟩 nvcc               Pass: 100%/45  | Total: 23h 58m | Avg: 31m 57s | Max:  1h 03m | Hits:  77%/79911 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  2h 04m | Avg: 31m 12s | Max: 33m 07s | Hits:  77%/7104  
      🟩 Clang15            Pass: 100%/2   | Total:  1h 01m | Avg: 30m 58s | Max: 33m 17s | Hits:  77%/3552  
      🟩 Clang16            Pass: 100%/2   | Total:  1h 02m | Avg: 31m 15s | Max: 34m 17s | Hits:  77%/3552  
      🟩 Clang17            Pass: 100%/2   | Total:  1h 02m | Avg: 31m 14s | Max: 32m 08s | Hits:  77%/3552  
      🟩 Clang18            Pass: 100%/2   | Total:  1h 02m | Avg: 31m 07s | Max: 31m 35s | Hits:  77%/3552  
      🟩 Clang19            Pass: 100%/7   | Total:  2h 29m | Avg: 21m 23s | Max: 28m 33s | Hits:  83%/12432 
      🟩 GCC7               Pass: 100%/2   | Total:  1h 07m | Avg: 33m 41s | Max: 34m 54s | Hits:  77%/3554  
      🟩 GCC8               Pass: 100%/1   | Total: 33m 57s | Avg: 33m 57s | Max: 33m 57s | Hits:  77%/1777  
      🟩 GCC9               Pass: 100%/2   | Total:  1h 03m | Avg: 31m 55s | Max: 32m 33s | Hits:  77%/3554  
      🟩 GCC10              Pass: 100%/2   | Total:  1h 01m | Avg: 30m 41s | Max: 31m 23s | Hits:  77%/3554  
      🟩 GCC11              Pass: 100%/2   | Total:  1h 18m | Avg: 39m 22s | Max: 39m 27s | Hits:  77%/3554  
      🟩 GCC12              Pass: 100%/2   | Total:  1h 05m | Avg: 32m 39s | Max: 34m 12s | Hits:  77%/3554  
      🟩 GCC13              Pass: 100%/10  | Total:  3h 34m | Avg: 21m 24s | Max: 32m 20s | Hits:  86%/17770 
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 51m | Avg: 55m 30s | Max: 56m 28s | Hits:  65%/3540  
      🟩 MSVC14.42          Pass: 100%/3   | Total:  2h 28m | Avg: 49m 24s | Max:  1h 03m | Hits:  54%/5310  
      🟩 NVHPC25.3          Pass: 100%/2   | Total:  1h 59m | Avg: 59m 59s | Max:  1h 00m | Hits:  65%/3552  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/19  | Total:  8h 43m | Avg: 27m 33s | Max: 34m 17s | Hits:  79%/33744 
      🟩 GCC                Pass: 100%/21  | Total:  9h 44m | Avg: 27m 50s | Max: 39m 27s | Hits:  81%/37317 
      🟩 MSVC               Pass: 100%/5   | Total:  4h 19m | Avg: 51m 51s | Max:  1h 03m | Hits:  59%/8850  
      🟩 NVHPC              Pass: 100%/2   | Total:  1h 59m | Avg: 59m 59s | Max:  1h 00m | Hits:  65%/3552  
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 29m 15s | Avg: 14m 37s | Max: 17m 34s | Hits:  88%/3554  
      🟩 rtx2080            Pass: 100%/35  | Total: 20h 36m | Avg: 35m 20s | Max:  1h 03m | Hits:  73%/62156 
      🟩 rtx4090            Pass: 100%/10  | Total:  3h 41m | Avg: 22m 08s | Max: 54m 29s | Hits:  89%/17753 
    🟩 jobs
      🟩 Build              Pass: 100%/40  | Total: 23h 16m | Avg: 34m 54s | Max:  1h 03m | Hits:  73%/71033 
      🟩 TestCPU            Pass: 100%/3   | Total: 46m 17s | Avg: 15m 25s | Max: 30m 17s | Hits:  99%/5323  
      🟩 TestGPU            Pass: 100%/4   | Total: 45m 11s | Avg: 11m 17s | Max: 11m 41s | Hits:  99%/7107  
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 29m 15s | Avg: 14m 37s | Max: 17m 34s | Hits:  88%/3554  
      🟩 90;90a;100         Pass: 100%/1   | Total: 32m 13s | Avg: 32m 13s | Max: 32m 13s | Hits:  77%/1777  
    🟩 std
      🟩 17                 Pass: 100%/21  | Total: 12h 56m | Avg: 36m 57s | Max:  1h 03m | Hits:  71%/37287 
      🟩 20                 Pass: 100%/24  | Total: 11h 12m | Avg: 28m 01s | Max:  1h 00m | Hits:  81%/42622 
    
  • 🟩 libcudacxx: Pass: 100%/45 | Total: 7h 00m | Avg: 9m 20s | Max: 28m 48s | Hits: 86%/116267

    🟩 cpu
      🟩 amd64              Pass: 100%/43  | Total:  6h 47m | Avg:  9m 28s | Max: 28m 48s | Hits:  86%/110236
      🟩 arm64              Pass: 100%/2   | Total: 13m 02s | Avg:  6m 31s | Max:  8m 27s | Hits:  91%/6031  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total: 36m 58s | Avg:  7m 23s | Max: 20m 31s | Hits:  97%/14694 
      🟩 12.8               Pass: 100%/40  | Total:  6h 23m | Avg:  9m 35s | Max: 28m 48s | Hits:  85%/101573
    🟩 cudacxx
      🟩 ClangCUDA19        Pass: 100%/2   | Total: 44m 49s | Avg: 22m 24s | Max: 23m 16s | Hits:  27%/5991  
      🟩 nvcc12.0           Pass: 100%/5   | Total: 36m 58s | Avg:  7m 23s | Max: 20m 31s | Hits:  97%/14694 
      🟩 nvcc12.8           Pass: 100%/38  | Total:  5h 38m | Avg:  8m 54s | Max: 28m 48s | Hits:  88%/95582 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 44m 49s | Avg: 22m 24s | Max: 23m 16s | Hits:  27%/5991  
      🟩 nvcc               Pass: 100%/43  | Total:  6h 15m | Avg:  8m 43s | Max: 28m 48s | Hits:  89%/110276
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total: 19m 15s | Avg:  4m 48s | Max:  5m 20s | Hits:  97%/11950 
      🟩 Clang15            Pass: 100%/2   | Total: 10m 36s | Avg:  5m 18s | Max:  5m 27s | Hits:  97%/5987  
      🟩 Clang16            Pass: 100%/2   | Total: 16m 45s | Avg:  8m 22s | Max: 11m 10s | Hits:  88%/5987  
      🟩 Clang17            Pass: 100%/2   | Total: 10m 36s | Avg:  5m 18s | Max:  5m 34s | Hits:  97%/5987  
      🟩 Clang18            Pass: 100%/2   | Total: 10m 05s | Avg:  5m 02s | Max:  5m 14s | Hits:  96%/5987  
      🟩 Clang19            Pass: 100%/6   | Total:  1h 08m | Avg: 11m 28s | Max: 23m 16s | Hits:  69%/14993 
      🟩 GCC7               Pass: 100%/2   | Total:  7m 54s | Avg:  3m 57s | Max:  4m 07s | Hits:  97%/5923  
      🟩 GCC8               Pass: 100%/1   | Total:  4m 10s | Avg:  4m 10s | Max:  4m 10s | Hits:  97%/2972  
      🟩 GCC9               Pass: 100%/2   | Total:  8m 13s | Avg:  4m 06s | Max:  4m 35s | Hits:  97%/5935  
      🟩 GCC10              Pass: 100%/2   | Total:  9m 42s | Avg:  4m 51s | Max:  5m 11s | Hits:  96%/5993  
      🟩 GCC11              Pass: 100%/2   | Total:  9m 21s | Avg:  4m 40s | Max:  5m 00s | Hits:  96%/5989  
      🟩 GCC12              Pass: 100%/2   | Total:  9m 16s | Avg:  4m 38s | Max:  4m 39s | Hits:  97%/5989  
      🟩 GCC13              Pass: 100%/10  | Total:  1h 53m | Avg: 11m 20s | Max: 23m 54s | Hits:  94%/15255 
      🟩 MSVC14.29          Pass: 100%/2   | Total: 45m 27s | Avg: 22m 43s | Max: 24m 56s | Hits:  64%/5635  
      🟩 MSVC14.42          Pass: 100%/2   | Total: 53m 35s | Avg: 26m 47s | Max: 28m 48s | Hits:   2%/5708  
      🟩 NVHPC25.3          Pass: 100%/2   | Total: 23m 11s | Avg: 11m 35s | Max: 11m 37s | Hits:  97%/5977  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/18  | Total:  2h 16m | Avg:  7m 33s | Max: 23m 16s | Hits:  88%/50891 
      🟩 GCC                Pass: 100%/21  | Total:  2h 42m | Avg:  7m 42s | Max: 23m 54s | Hits:  96%/48056 
      🟩 MSVC               Pass: 100%/4   | Total:  1h 39m | Avg: 24m 45s | Max: 28m 48s | Hits:  33%/11343 
      🟩 NVHPC              Pass: 100%/2   | Total: 23m 11s | Avg: 11m 35s | Max: 11m 37s | Hits:  97%/5977  
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 23m 29s | Avg: 11m 44s | Max: 14m 07s | Hits:  95%/3105  
      🟩 rtx2080            Pass: 100%/43  | Total:  6h 36m | Avg:  9m 13s | Max: 28m 48s | Hits:  86%/113162
    🟩 jobs
      🟩 Build              Pass: 100%/39  | Total:  5h 40m | Avg:  8m 43s | Max: 28m 48s | Hits:  86%/116227
      🟩 NVRTC              Pass: 100%/2   | Total: 44m 25s | Avg: 22m 12s | Max: 23m 54s | Hits:  90%/40    
      🟩 Test               Pass: 100%/3   | Total: 33m 22s | Avg: 11m 07s | Max: 14m 07s
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  2m 12s | Avg:  2m 12s | Max:  2m 12s
    🟩 sm
      🟩 75                 Pass: 100%/2   | Total: 44m 25s | Avg: 22m 12s | Max: 23m 54s | Hits:  90%/40    
      🟩 90                 Pass: 100%/2   | Total: 23m 29s | Avg: 11m 44s | Max: 14m 07s | Hits:  95%/3105  
      🟩 90;90a;100         Pass: 100%/1   | Total: 14m 57s | Avg: 14m 57s | Max: 14m 57s | Hits:  95%/3105  
    🟩 std
      🟩 17                 Pass: 100%/22  | Total:  3h 18m | Avg:  9m 02s | Max: 24m 56s | Hits:  86%/61936 
      🟩 20                 Pass: 100%/22  | Total:  3h 39m | Avg:  9m 58s | Max: 28m 48s | Hits:  86%/54331 
    
  • 🟩 cudax: Pass: 100%/26 | Total: 3h 05m | Avg: 7m 07s | Max: 14m 16s | Hits: 89%/14540

    🟩 cpu
      🟩 amd64              Pass: 100%/22  | Total:  2h 45m | Avg:  7m 32s | Max: 14m 16s | Hits:  89%/12212 
      🟩 arm64              Pass: 100%/4   | Total: 19m 26s | Avg:  4m 51s | Max:  5m 05s | Hits:  90%/2328  
    🟩 ctk
      🟩 12.0               Pass: 100%/3   | Total: 21m 30s | Avg:  7m 10s | Max: 11m 57s | Hits:  87%/1452  
      🟩 12.8               Pass: 100%/23  | Total:  2h 43m | Avg:  7m 07s | Max: 14m 16s | Hits:  90%/13088 
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/3   | Total: 21m 30s | Avg:  7m 10s | Max: 11m 57s | Hits:  87%/1452  
      🟩 nvcc12.8           Pass: 100%/23  | Total:  2h 43m | Avg:  7m 07s | Max: 14m 16s | Hits:  90%/13088 
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/26  | Total:  3h 05m | Avg:  7m 07s | Max: 14m 16s | Hits:  89%/14540 
    🟩 cxx
      🟩 Clang14            Pass: 100%/2   | Total:  9m 42s | Avg:  4m 51s | Max:  5m 07s | Hits:  90%/1168  
      🟩 Clang15            Pass: 100%/1   | Total:  5m 29s | Avg:  5m 29s | Max:  5m 29s | Hits:  90%/582   
      🟩 Clang16            Pass: 100%/1   | Total:  5m 45s | Avg:  5m 45s | Max:  5m 45s | Hits:  90%/582   
      🟩 Clang17            Pass: 100%/1   | Total:  5m 14s | Avg:  5m 14s | Max:  5m 14s | Hits:  90%/582   
      🟩 Clang18            Pass: 100%/1   | Total:  5m 07s | Avg:  5m 07s | Max:  5m 07s | Hits:  90%/582   
      🟩 Clang19            Pass: 100%/4   | Total: 28m 03s | Avg:  7m 00s | Max: 12m 17s | Hits:  92%/2328  
      🟩 GCC10              Pass: 100%/2   | Total: 10m 18s | Avg:  5m 09s | Max:  5m 20s | Hits:  90%/1168  
      🟩 GCC11              Pass: 100%/1   | Total:  5m 22s | Avg:  5m 22s | Max:  5m 22s | Hits:  90%/582   
      🟩 GCC12              Pass: 100%/1   | Total:  6m 15s | Avg:  6m 15s | Max:  6m 15s | Hits:  90%/582   
      🟩 GCC13              Pass: 100%/8   | Total: 57m 00s | Avg:  7m 07s | Max: 14m 16s | Hits:  92%/4656  
      🟩 MSVC14.39          Pass: 100%/1   | Total: 11m 57s | Avg: 11m 57s | Max: 11m 57s | Hits:  77%/284   
      🟩 MSVC14.42          Pass: 100%/1   | Total: 14m 15s | Avg: 14m 15s | Max: 14m 15s | Hits:  34%/284   
      🟩 NVHPC25.3          Pass: 100%/2   | Total: 20m 48s | Avg: 10m 24s | Max: 10m 41s | Hits:  88%/1160  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/10  | Total: 59m 20s | Avg:  5m 56s | Max: 12m 17s | Hits:  91%/5824  
      🟩 GCC                Pass: 100%/12  | Total:  1h 18m | Avg:  6m 34s | Max: 14m 16s | Hits:  91%/6988  
      🟩 MSVC               Pass: 100%/2   | Total: 26m 12s | Avg: 13m 06s | Max: 14m 15s | Hits:  56%/568   
      🟩 NVHPC              Pass: 100%/2   | Total: 20m 48s | Avg: 10m 24s | Max: 10m 41s | Hits:  88%/1160  
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 19m 14s | Avg:  9m 37s | Max: 14m 16s | Hits:  94%/1164  
      🟩 rtx2080            Pass: 100%/24  | Total:  2h 46m | Avg:  6m 55s | Max: 14m 15s | Hits:  89%/13376 
    🟩 jobs
      🟩 Build              Pass: 100%/23  | Total:  2h 25m | Avg:  6m 20s | Max: 14m 15s | Hits:  88%/12794 
      🟩 Test               Pass: 100%/3   | Total: 39m 31s | Avg: 13m 10s | Max: 14m 16s | Hits:  99%/1746  
    🟩 sm
      🟩 90                 Pass: 100%/3   | Total: 23m 25s | Avg:  7m 48s | Max: 14m 16s | Hits:  93%/1746  
      🟩 90a                Pass: 100%/1   | Total:  4m 45s | Avg:  4m 45s | Max:  4m 45s | Hits:  90%/582   
    🟩 std
      🟩 17                 Pass: 100%/4   | Total: 24m 27s | Avg:  6m 06s | Max: 10m 41s | Hits:  89%/2326  
      🟩 20                 Pass: 100%/22  | Total:  2h 40m | Avg:  7m 18s | Max: 14m 16s | Hits:  90%/12214 
    
  • 🟩 stdpar: Pass: 100%/4 | Total: 19m 08s | Avg: 4m 47s | Max: 5m 34s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 11m 00s | Avg:  5m 30s | Max:  5m 34s
      🟩 arm64              Pass: 100%/2   | Total:  8m 08s | Avg:  4m 04s | Max:  4m 17s
    🟩 ctk
      🟩 12.8               Pass: 100%/4   | Total: 19m 08s | Avg:  4m 47s | Max:  5m 34s
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/4   | Total: 19m 08s | Avg:  4m 47s | Max:  5m 34s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/4   | Total: 19m 08s | Avg:  4m 47s | Max:  5m 34s
    🟩 cxx
      🟩 NVHPC25.3          Pass: 100%/4   | Total: 19m 08s | Avg:  4m 47s | Max:  5m 34s
    🟩 cxx_family
      🟩 NVHPC              Pass: 100%/4   | Total: 19m 08s | Avg:  4m 47s | Max:  5m 34s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/4   | Total: 19m 08s | Avg:  4m 47s | Max:  5m 34s
    🟩 jobs
      🟩 Build              Pass: 100%/4   | Total: 19m 08s | Avg:  4m 47s | Max:  5m 34s
    🟩 std
      🟩 17                 Pass: 100%/2   | Total:  9m 25s | Avg:  4m 42s | Max:  5m 34s
      🟩 20                 Pass: 100%/2   | Total:  9m 43s | Avg:  4m 51s | Max:  5m 26s
    
  • 🟩 python: Pass: 100%/3 | Total: 25m 33s | Avg: 8m 31s | Max: 16m 32s

    🟩 cpu
      🟩 amd64              Pass: 100%/3   | Total: 25m 33s | Avg:  8m 31s | Max: 16m 32s
    🟩 ctk
      🟩 12.8               Pass: 100%/3   | Total: 25m 33s | Avg:  8m 31s | Max: 16m 32s
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/3   | Total: 25m 33s | Avg:  8m 31s | Max: 16m 32s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/3   | Total: 25m 33s | Avg:  8m 31s | Max: 16m 32s
    🟩 cxx
      🟩 GCC13              Pass: 100%/3   | Total: 25m 33s | Avg:  8m 31s | Max: 16m 32s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/3   | Total: 25m 33s | Avg:  8m 31s | Max: 16m 32s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/3   | Total: 25m 33s | Avg:  8m 31s | Max: 16m 32s
    🟩 jobs
      🟩 cuda.cccl          Pass: 100%/1   | Total:  3m 06s | Avg:  3m 06s | Max:  3m 06s
      🟩 cuda.cooperative   Pass: 100%/1   | Total: 16m 32s | Avg: 16m 32s | Max: 16m 32s
      🟩 cuda.parallel      Pass: 100%/1   | Total:  5m 55s | Avg:  5m 55s | Max:  5m 55s
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 24m 41s | Avg: 12m 20s | Max: 22m 14s | Hits: 96%/328

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 24m 41s | Avg: 12m 20s | Max: 22m 14s | Hits:  96%/328   
    🟩 ctk
      🟩 12.8               Pass: 100%/2   | Total: 24m 41s | Avg: 12m 20s | Max: 22m 14s | Hits:  96%/328   
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/2   | Total: 24m 41s | Avg: 12m 20s | Max: 22m 14s | Hits:  96%/328   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total: 24m 41s | Avg: 12m 20s | Max: 22m 14s | Hits:  96%/328   
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total: 24m 41s | Avg: 12m 20s | Max: 22m 14s | Hits:  96%/328   
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total: 24m 41s | Avg: 12m 20s | Max: 22m 14s | Hits:  96%/328   
    🟩 gpu
      🟩 rtx2080            Pass: 100%/2   | Total: 24m 41s | Avg: 12m 20s | Max: 22m 14s | Hits:  96%/328   
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 27s | Avg:  2m 27s | Max:  2m 27s | Hits:  94%/164   
      🟩 Test               Pass: 100%/1   | Total: 22m 14s | Avg: 22m 14s | Max: 22m 14s | Hits:  98%/164   
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
+/- libcu++
CUB
Thrust
CUDA Experimental
stdpar
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- stdpar
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 174)

# Runner
123 linux-amd64-cpu16
15 windows-amd64-cpu16
12 linux-arm64-cpu16
10 linux-amd64-gpu-rtx2080-latest-1
6 linux-amd64-gpu-rtxa6000-latest-1
5 linux-amd64-gpu-h100-latest-1
3 linux-amd64-gpu-rtx4090-latest-1

Copy link
Contributor

@ahendriksen ahendriksen left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM!

@fbusato fbusato merged commit a4a4049 into NVIDIA:main Apr 15, 2025
192 checks passed
@github-project-automation github-project-automation bot moved this from In Review to Done in CCCL Apr 15, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

3.1.0 Targeted for 3.1 release

Projects

Archived in project

Development

Successfully merging this pull request may close these issues.

4 participants