KEMBAR78
Simplify Thrust assign_value by bernhardmgruber · Pull Request #4227 · NVIDIA/cccl · GitHub
Skip to content

Conversation

@bernhardmgruber
Copy link
Contributor

@bernhardmgruber bernhardmgruber commented Mar 21, 2025

This avoids instantiating thrust::copy and thrust::transform when assigning a single element between thrust references.

Copy link
Contributor

@miscco miscco left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

:sad approve:

@github-actions
Copy link
Contributor

🟨 CI finished in 1h 40m: Pass: 95%/97 | Total: 2d 20h | Avg: 42m 40s | Max: 1h 27m | Hits: 63%/128351
  • 🟨 cub: Pass: 95%/45 | Total: 1d 19h | Avg: 57m 48s | Max: 1h 27m | Hits: 51%/51672

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  95%/43  | Total:  1d 17h | Avg: 57m 29s | Max:  1h 27m | Hits:  51%/49228 
      🟩 arm64              Pass: 100%/2   | Total:  2h 08m | Avg:  1h 04m | Max:  1h 08m | Hits:  46%/2444  
    🔍 ctk: 12.8 🔍
      🟩 12.0               Pass: 100%/5   | Total:  5h 46m | Avg:  1h 09m | Max:  1h 17m | Hits:  42%/5940  
      🟩 12.6               Pass: 100%/2   | Total:  2h 34m | Avg:  1h 17m | Max:  1h 18m | Hits:  10%/2260  
      🔍 12.8               Pass:  94%/38  | Total:  1d 10h | Avg: 55m 14s | Max:  1h 27m | Hits:  54%/43472 
    🚨 cudacxx: ClangCUDA18 🚨
      🔥 ClangCUDA18        Pass:   0%/2   | Total: 19m 13s | Avg:  9m 36s | Max:  9m 41s
      🟩 nvcc12.0           Pass: 100%/5   | Total:  5h 46m | Avg:  1h 09m | Max:  1h 17m | Hits:  42%/5940  
      🟩 nvcc12.6           Pass: 100%/2   | Total:  2h 34m | Avg:  1h 17m | Max:  1h 18m | Hits:  10%/2260  
      🟩 nvcc12.8           Pass: 100%/36  | Total:  1d 10h | Avg: 57m 46s | Max:  1h 27m | Hits:  54%/43472 
    🚨 cudacxx_family: ClangCUDA 🚨
      🔥 ClangCUDA          Pass:   0%/2   | Total: 19m 13s | Avg:  9m 36s | Max:  9m 41s
      🟩 nvcc               Pass: 100%/43  | Total:  1d 19h | Avg:  1h 00m | Max:  1h 27m | Hits:  51%/51672 
    🔍 cxx: Clang18 🔍
      🟩 Clang14            Pass: 100%/4   | Total:  4h 14m | Avg:  1h 03m | Max:  1h 06m | Hits:  69%/4896  
      🟩 Clang15            Pass: 100%/2   | Total:  2h 06m | Avg:  1h 03m | Max:  1h 05m | Hits:  69%/2444  
      🟩 Clang16            Pass: 100%/2   | Total:  2h 09m | Avg:  1h 04m | Max:  1h 07m | Hits:  69%/2444  
      🟩 Clang17            Pass: 100%/2   | Total:  2h 04m | Avg:  1h 02m | Max:  1h 02m | Hits:  69%/2444  
      🔍 Clang18            Pass:  71%/7   | Total:  4h 21m | Avg: 37m 17s | Max:  1h 04m | Hits:  81%/6110  
      🟩 GCC7               Pass: 100%/2   | Total:  2h 24m | Avg:  1h 12m | Max:  1h 17m | Hits:  27%/2448  
      🟩 GCC8               Pass: 100%/1   | Total:  1h 04m | Avg:  1h 04m | Max:  1h 04m | Hits:  23%/1224  
      🟩 GCC9               Pass: 100%/2   | Total:  2h 15m | Avg:  1h 07m | Max:  1h 07m | Hits:  24%/2448  
      🟩 GCC10              Pass: 100%/2   | Total:  2h 14m | Avg:  1h 07m | Max:  1h 08m | Hits:  23%/2448  
      🟩 GCC11              Pass: 100%/2   | Total:  2h 15m | Avg:  1h 07m | Max:  1h 08m | Hits:  24%/2444  
      🟩 GCC12              Pass: 100%/2   | Total:  2h 13m | Avg:  1h 06m | Max:  1h 08m | Hits:  24%/2444  
      🟩 GCC13              Pass: 100%/11  | Total:  7h 59m | Avg: 43m 35s | Max:  1h 15m | Hits:  66%/13442 
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 41m | Avg:  1h 20m | Max:  1h 27m | Hits:  18%/2088  
      🟩 MSVC14.42          Pass: 100%/2   | Total:  2h 41m | Avg:  1h 20m | Max:  1h 23m | Hits:  13%/2088  
      🟩 NVHPC25.1          Pass: 100%/2   | Total:  2h 34m | Avg:  1h 17m | Max:  1h 18m | Hits:  10%/2260  
    🔍 cxx_family: Clang 🔍
      🔍 Clang              Pass:  88%/17  | Total: 14h 54m | Avg: 52m 38s | Max:  1h 07m | Hits:  73%/18338 
      🟩 GCC                Pass: 100%/22  | Total: 20h 27m | Avg: 55m 48s | Max:  1h 17m | Hits:  45%/26898 
      🟩 MSVC               Pass: 100%/4   | Total:  5h 23m | Avg:  1h 20m | Max:  1h 27m | Hits:  16%/4176  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 34m | Avg:  1h 17m | Max:  1h 18m | Hits:  10%/2260  
    🔍 gpu: rtx2080 🔍
      🟩 h100               Pass: 100%/3   | Total:  1h 25m | Avg: 28m 35s | Max: 30m 23s | Hits:  78%/3666  
      🔍 rtx2080            Pass:  94%/34  | Total:  1d 12h | Avg:  1h 05m | Max:  1h 27m | Hits:  40%/38230 
      🟩 rtxa6000           Pass: 100%/8   | Total:  4h 58m | Avg: 37m 21s | Max:  1h 05m | Hits:  86%/9776  
    🔍 jobs: Build 🔍
      🔍 Build              Pass:  94%/37  | Total:  1d 15h | Avg:  1h 04m | Max:  1h 27m | Hits:  40%/41896 
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 32m 16s | Avg: 32m 16s | Max: 32m 16s | Hits:  99%/1222  
      🟩 GraphCapture       Pass: 100%/1   | Total: 24m 18s | Avg: 24m 18s | Max: 24m 18s | Hits:  99%/1222  
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 30m | Avg: 30m 15s | Max: 32m 34s | Hits:  99%/3666  
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 17m | Avg: 25m 54s | Max: 26m 28s | Hits:  99%/3666  
    🟩 sm
      🟩 90                 Pass: 100%/3   | Total:  1h 25m | Avg: 28m 35s | Max: 30m 23s | Hits:  78%/3666  
      🟩 90;90a;100         Pass: 100%/1   | Total:  1h 15m | Avg:  1h 15m | Max:  1h 15m | Hits:  23%/1222  
    🟨 std
      🟨 17                 Pass:  95%/20  | Total: 22h 15m | Avg:  1h 06m | Max:  1h 27m | Hits:  37%/22608 
      🟨 20                 Pass:  96%/25  | Total: 21h 05m | Avg: 50m 37s | Max:  1h 23m | Hits:  62%/29064 
    
  • 🟨 thrust: Pass: 95%/45 | Total: 23h 44m | Avg: 31m 39s | Max: 1h 09m | Hits: 71%/76359

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  95%/43  | Total: 22h 49m | Avg: 31m 50s | Max:  1h 09m | Hits:  70%/72806 
      🟩 arm64              Pass: 100%/2   | Total: 55m 14s | Avg: 27m 37s | Max: 29m 24s | Hits:  77%/3553  
    🔍 ctk: 12.8 🔍
      🟩 12.0               Pass: 100%/5   | Total:  3h 12m | Avg: 38m 26s | Max:  1h 08m | Hits:  69%/8876  
      🟩 12.6               Pass: 100%/2   | Total:  2h 16m | Avg:  1h 08m | Max:  1h 09m | Hits:   3%/3552  
      🔍 12.8               Pass:  94%/38  | Total: 18h 15m | Avg: 28m 49s | Max:  1h 08m | Hits:  75%/63931 
    🚨 cudacxx: ClangCUDA18 🚨
      🔥 ClangCUDA18        Pass:   0%/2   | Total: 15m 29s | Avg:  7m 44s | Max:  8m 04s
      🟩 nvcc12.0           Pass: 100%/5   | Total:  3h 12m | Avg: 38m 26s | Max:  1h 08m | Hits:  69%/8876  
      🟩 nvcc12.6           Pass: 100%/2   | Total:  2h 16m | Avg:  1h 08m | Max:  1h 09m | Hits:   3%/3552  
      🟩 nvcc12.8           Pass: 100%/36  | Total: 17h 59m | Avg: 29m 59s | Max:  1h 08m | Hits:  75%/63931 
    🚨 cudacxx_family: ClangCUDA 🚨
      🔥 ClangCUDA          Pass:   0%/2   | Total: 15m 29s | Avg:  7m 44s | Max:  8m 04s
      🟩 nvcc               Pass: 100%/43  | Total: 23h 28m | Avg: 32m 45s | Max:  1h 09m | Hits:  71%/76359 
    🔍 cxx: Clang18 🔍
      🟩 Clang14            Pass: 100%/4   | Total:  1h 59m | Avg: 29m 45s | Max: 30m 30s | Hits:  77%/7104  
      🟩 Clang15            Pass: 100%/2   | Total: 57m 14s | Avg: 28m 37s | Max: 29m 24s | Hits:  77%/3552  
      🟩 Clang16            Pass: 100%/2   | Total: 59m 31s | Avg: 29m 45s | Max: 31m 17s | Hits:  77%/3552  
      🟩 Clang17            Pass: 100%/2   | Total: 58m 50s | Avg: 29m 25s | Max: 29m 51s | Hits:  77%/3552  
      🔍 Clang18            Pass:  71%/7   | Total:  1h 59m | Avg: 17m 04s | Max: 31m 06s | Hits:  86%/8880  
      🟩 GCC7               Pass: 100%/2   | Total:  1h 05m | Avg: 32m 33s | Max: 34m 25s | Hits:  77%/3554  
      🟩 GCC8               Pass: 100%/1   | Total: 32m 37s | Avg: 32m 37s | Max: 32m 37s | Hits:  77%/1777  
      🟩 GCC9               Pass: 100%/2   | Total:  1h 03m | Avg: 31m 41s | Max: 32m 49s | Hits:  77%/3554  
      🟩 GCC10              Pass: 100%/2   | Total:  1h 02m | Avg: 31m 07s | Max: 32m 08s | Hits:  77%/3554  
      🟩 GCC11              Pass: 100%/2   | Total:  1h 07m | Avg: 33m 46s | Max: 35m 43s | Hits:  62%/3554  
      🟩 GCC12              Pass: 100%/2   | Total:  1h 03m | Avg: 31m 36s | Max: 33m 02s | Hits:  77%/3554  
      🟩 GCC13              Pass: 100%/10  | Total:  3h 40m | Avg: 22m 03s | Max: 33m 45s | Hits:  83%/17770 
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 16m | Avg:  1h 08m | Max:  1h 08m | Hits:  32%/3540  
      🟩 MSVC14.42          Pass: 100%/3   | Total:  2h 42m | Avg: 54m 00s | Max:  1h 07m | Hits:  38%/5310  
      🟩 NVHPC25.1          Pass: 100%/2   | Total:  2h 16m | Avg:  1h 08m | Max:  1h 09m | Hits:   3%/3552  
    🔍 cxx_family: Clang 🔍
      🔍 Clang              Pass:  88%/17  | Total:  6h 54m | Avg: 24m 21s | Max: 31m 17s | Hits:  80%/26640 
      🟩 GCC                Pass: 100%/21  | Total:  9h 34m | Avg: 27m 21s | Max: 35m 43s | Hits:  79%/37317 
      🟩 MSVC               Pass: 100%/5   | Total:  4h 58m | Avg: 59m 43s | Max:  1h 08m | Hits:  36%/8850  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 16m | Avg:  1h 08m | Max:  1h 09m | Hits:   3%/3552  
    🔍 gpu: rtx2080 🔍
      🟩 h100               Pass: 100%/2   | Total: 31m 12s | Avg: 15m 36s | Max: 18m 21s | Hits:  88%/3554  
      🔍 rtx2080            Pass:  93%/33  | Total: 19h 14m | Avg: 34m 58s | Max:  1h 09m | Hits:  66%/55052 
      🟩 rtx4090            Pass: 100%/10  | Total:  3h 58m | Avg: 23m 53s | Max:  1h 07m | Hits:  80%/17753 
    🔍 jobs: Build 🔍
      🔍 Build              Pass:  94%/38  | Total: 22h 07m | Avg: 34m 56s | Max:  1h 09m | Hits:  66%/63929 
      🟩 TestCPU            Pass: 100%/3   | Total: 42m 21s | Avg: 14m 07s | Max: 26m 25s | Hits:  99%/5323  
      🟩 TestGPU            Pass: 100%/4   | Total: 54m 13s | Avg: 13m 33s | Max: 18m 07s | Hits:  92%/7107  
    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 42m 51s | Avg: 21m 25s | Max: 24m 44s | Hits:  74%/3554  
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 31m 12s | Avg: 15m 36s | Max: 18m 21s | Hits:  88%/3554  
      🟩 90;90a;100         Pass: 100%/1   | Total: 32m 04s | Avg: 32m 04s | Max: 32m 04s | Hits:  77%/1777  
    🟨 std
      🟨 17                 Pass:  95%/20  | Total: 12h 31m | Avg: 37m 35s | Max:  1h 09m | Hits:  65%/33735 
      🟨 20                 Pass:  95%/23  | Total: 10h 29m | Avg: 27m 22s | Max:  1h 07m | Hits:  75%/39070 
    
  • 🟩 stdpar: Pass: 100%/4 | Total: 16m 28s | Avg: 4m 07s | Max: 4m 48s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total:  9m 26s | Avg:  4m 43s | Max:  4m 48s
      🟩 arm64              Pass: 100%/2   | Total:  7m 02s | Avg:  3m 31s | Max:  3m 35s
    🟩 ctk
      🟩 12.6               Pass: 100%/4   | Total: 16m 28s | Avg:  4m 07s | Max:  4m 48s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/4   | Total: 16m 28s | Avg:  4m 07s | Max:  4m 48s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/4   | Total: 16m 28s | Avg:  4m 07s | Max:  4m 48s
    🟩 cxx
      🟩 NVHPC25.1          Pass: 100%/4   | Total: 16m 28s | Avg:  4m 07s | Max:  4m 48s
    🟩 cxx_family
      🟩 NVHPC              Pass: 100%/4   | Total: 16m 28s | Avg:  4m 07s | Max:  4m 48s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/4   | Total: 16m 28s | Avg:  4m 07s | Max:  4m 48s
    🟩 jobs
      🟩 Build              Pass: 100%/4   | Total: 16m 28s | Avg:  4m 07s | Max:  4m 48s
    🟩 std
      🟩 17                 Pass: 100%/2   | Total:  8m 15s | Avg:  4m 07s | Max:  4m 48s
      🟩 20                 Pass: 100%/2   | Total:  8m 13s | Avg:  4m 06s | Max:  4m 38s
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 19m 24s | Avg: 9m 42s | Max: 17m 02s | Hits: 98%/320

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 19m 24s | Avg:  9m 42s | Max: 17m 02s | Hits:  98%/320   
    🟩 ctk
      🟩 12.8               Pass: 100%/2   | Total: 19m 24s | Avg:  9m 42s | Max: 17m 02s | Hits:  98%/320   
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/2   | Total: 19m 24s | Avg:  9m 42s | Max: 17m 02s | Hits:  98%/320   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total: 19m 24s | Avg:  9m 42s | Max: 17m 02s | Hits:  98%/320   
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total: 19m 24s | Avg:  9m 42s | Max: 17m 02s | Hits:  98%/320   
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total: 19m 24s | Avg:  9m 42s | Max: 17m 02s | Hits:  98%/320   
    🟩 gpu
      🟩 rtx2080            Pass: 100%/2   | Total: 19m 24s | Avg:  9m 42s | Max: 17m 02s | Hits:  98%/320   
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 22s | Avg:  2m 22s | Max:  2m 22s | Hits:  98%/160   
      🟩 Test               Pass: 100%/1   | Total: 17m 02s | Avg: 17m 02s | Max: 17m 02s | Hits:  98%/160   
    
  • 🟩 python: Pass: 100%/1 | Total: 1h 17m | Avg: 1h 17m | Max: 1h 17m

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total:  1h 17m | Avg:  1h 17m | Max:  1h 17m
    🟩 ctk
      🟩 12.8               Pass: 100%/1   | Total:  1h 17m | Avg:  1h 17m | Max:  1h 17m
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/1   | Total:  1h 17m | Avg:  1h 17m | Max:  1h 17m
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total:  1h 17m | Avg:  1h 17m | Max:  1h 17m
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total:  1h 17m | Avg:  1h 17m | Max:  1h 17m
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total:  1h 17m | Avg:  1h 17m | Max:  1h 17m
    🟩 gpu
      🟩 rtx2080            Pass: 100%/1   | Total:  1h 17m | Avg:  1h 17m | Max:  1h 17m
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total:  1h 17m | Avg:  1h 17m | Max:  1h 17m
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
CUB
+/- Thrust
CUDA Experimental
stdpar
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental
+/- stdpar
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 97)

# Runner
68 linux-amd64-cpu16
9 windows-amd64-cpu16
6 linux-arm64-cpu16
6 linux-amd64-gpu-rtxa6000-latest-1
3 linux-amd64-gpu-h100-latest-1
3 linux-amd64-gpu-rtx4090-latest-1
2 linux-amd64-gpu-rtx2080-latest-1

@github-actions
Copy link
Contributor

🟨 CI finished in 1h 34m: Pass: 95%/97 | Total: 2d 19h | Avg: 41m 27s | Max: 1h 25m | Hits: 76%/126904
  • 🟨 thrust: Pass: 91%/45 | Total: 22h 31m | Avg: 30m 02s | Max: 56m 19s | Hits: 77%/72804

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  90%/43  | Total: 21h 37m | Avg: 30m 10s | Max: 56m 19s | Hits:  77%/69251 
      🟩 arm64              Pass: 100%/2   | Total: 54m 05s | Avg: 27m 02s | Max: 28m 54s | Hits:  77%/3553  
    🔍 ctk: 12.8 🔍
      🟩 12.0               Pass: 100%/5   | Total:  3h 01m | Avg: 36m 23s | Max: 56m 19s | Hits:  75%/8876  
      🟩 12.6               Pass: 100%/2   | Total:  1h 41m | Avg: 50m 54s | Max: 51m 48s | Hits:  66%/3552  
      🔍 12.8               Pass:  89%/38  | Total: 17h 48m | Avg: 28m 06s | Max: 53m 27s | Hits:  78%/60376 
    🔍 cudacxx: nvcc12.8 🔍
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 48m 08s | Avg: 24m 04s | Max: 24m 22s | Hits:  77%/3552  
      🟩 nvcc12.0           Pass: 100%/5   | Total:  3h 01m | Avg: 36m 23s | Max: 56m 19s | Hits:  75%/8876  
      🟩 nvcc12.6           Pass: 100%/2   | Total:  1h 41m | Avg: 50m 54s | Max: 51m 48s | Hits:  66%/3552  
      🔍 nvcc12.8           Pass:  88%/36  | Total: 17h 00m | Avg: 28m 20s | Max: 53m 27s | Hits:  78%/56824 
    🔍 cudacxx_family: nvcc 🔍
      🟩 ClangCUDA          Pass: 100%/2   | Total: 48m 08s | Avg: 24m 04s | Max: 24m 22s | Hits:  77%/3552  
      🔍 nvcc               Pass:  90%/43  | Total: 21h 43m | Avg: 30m 19s | Max: 56m 19s | Hits:  77%/69252 
    🚨 jobs: TestGPU 🚨
      🟩 Build              Pass: 100%/38  | Total: 21h 01m | Avg: 33m 11s | Max: 56m 19s | Hits:  75%/67481 
      🟩 TestCPU            Pass: 100%/3   | Total: 42m 14s | Avg: 14m 04s | Max: 26m 41s | Hits:  99%/5323  
      🔥 TestGPU            Pass:   0%/4   | Total: 48m 16s | Avg: 12m 04s | Max: 12m 35s
    🔍 sm: 90 🔍
      🔍 90                 Pass:  50%/2   | Total: 31m 02s | Avg: 15m 31s | Max: 18m 27s | Hits:  77%/1777  
      🟩 90;90a;100         Pass: 100%/1   | Total: 29m 41s | Avg: 29m 41s | Max: 29m 41s | Hits:  77%/1777  
    🔍 std: 20 🔍
      🟩 17                 Pass: 100%/20  | Total: 11h 41m | Avg: 35m 05s | Max: 56m 19s | Hits:  75%/35511 
      🔍 20                 Pass:  86%/23  | Total: 10h 10m | Avg: 26m 31s | Max: 53m 27s | Hits:  79%/35516 
    🟨 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  1h 57m | Avg: 29m 18s | Max: 30m 02s | Hits:  77%/7104  
      🟩 Clang15            Pass: 100%/2   | Total: 58m 56s | Avg: 29m 28s | Max: 29m 47s | Hits:  77%/3552  
      🟩 Clang16            Pass: 100%/2   | Total: 57m 17s | Avg: 28m 38s | Max: 28m 58s | Hits:  77%/3552  
      🟩 Clang17            Pass: 100%/2   | Total:  1h 01m | Avg: 30m 32s | Max: 31m 08s | Hits:  77%/3552  
      🟨 Clang18            Pass:  85%/7   | Total:  2h 31m | Avg: 21m 35s | Max: 29m 34s | Hits:  81%/10656 
      🟩 GCC7               Pass: 100%/2   | Total:  1h 02m | Avg: 31m 07s | Max: 33m 35s | Hits:  77%/3554  
      🟩 GCC8               Pass: 100%/1   | Total: 30m 12s | Avg: 30m 12s | Max: 30m 12s | Hits:  77%/1777  
      🟩 GCC9               Pass: 100%/2   | Total:  1h 02m | Avg: 31m 06s | Max: 32m 18s | Hits:  77%/3554  
      🟩 GCC10              Pass: 100%/2   | Total:  1h 02m | Avg: 31m 16s | Max: 32m 48s | Hits:  77%/3554  
      🟩 GCC11              Pass: 100%/2   | Total:  1h 06m | Avg: 33m 01s | Max: 34m 10s | Hits:  77%/3554  
      🟩 GCC12              Pass: 100%/2   | Total:  1h 03m | Avg: 31m 45s | Max: 32m 34s | Hits:  77%/3554  
      🟨 GCC13              Pass:  70%/10  | Total:  3h 37m | Avg: 21m 46s | Max: 35m 34s | Hits:  80%/12439 
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 48m | Avg: 54m 04s | Max: 56m 19s | Hits:  66%/3540  
      🟩 MSVC14.42          Pass: 100%/3   | Total:  2h 11m | Avg: 43m 54s | Max: 53m 27s | Hits:  77%/5310  
      🟩 NVHPC25.1          Pass: 100%/2   | Total:  1h 41m | Avg: 50m 54s | Max: 51m 48s | Hits:  66%/3552  
    🟨 cxx_family
      🟨 Clang              Pass:  94%/17  | Total:  7h 25m | Avg: 26m 12s | Max: 31m 08s | Hits:  79%/28416 
      🟨 GCC                Pass:  85%/21  | Total:  9h 24m | Avg: 26m 52s | Max: 35m 34s | Hits:  78%/31986 
      🟩 MSVC               Pass: 100%/5   | Total:  3h 59m | Avg: 47m 58s | Max: 56m 19s | Hits:  73%/8850  
      🟩 NVHPC              Pass: 100%/2   | Total:  1h 41m | Avg: 50m 54s | Max: 51m 48s | Hits:  66%/3552  
    🟨 gpu
      🟨 h100               Pass:  50%/2   | Total: 31m 02s | Avg: 15m 31s | Max: 18m 27s | Hits:  77%/1777  
      🟩 rtx2080            Pass: 100%/33  | Total: 18h 19m | Avg: 33m 19s | Max: 56m 19s | Hits:  75%/58604 
      🟨 rtx4090            Pass:  70%/10  | Total:  3h 41m | Avg: 22m 07s | Max: 53m 27s | Hits:  85%/12423 
    🟨 cmake_options
      🟨 -DTHRUST_DISPATCH_TYPE=Force32bit Pass:  50%/2   | Total: 40m 03s | Avg: 20m 01s | Max: 27m 51s | Hits:  77%/1777  
    
  • 🟩 cub: Pass: 100%/45 | Total: 1d 18h | Avg: 56m 47s | Max: 1h 25m | Hits: 75%/53780

    🟩 cpu
      🟩 amd64              Pass: 100%/43  | Total:  1d 16h | Avg: 56m 25s | Max:  1h 25m | Hits:  75%/51336 
      🟩 arm64              Pass: 100%/2   | Total:  2h 09m | Avg:  1h 04m | Max:  1h 05m | Hits:  69%/2444  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  5h 16m | Avg:  1h 03m | Max:  1h 05m | Hits:  70%/5940  
      🟩 12.6               Pass: 100%/2   | Total:  2h 17m | Avg:  1h 08m | Max:  1h 09m | Hits:  69%/2260  
      🟩 12.8               Pass: 100%/38  | Total:  1d 11h | Avg: 55m 17s | Max:  1h 25m | Hits:  76%/45580 
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  2h 01m | Avg:  1h 00m | Max:  1h 01m | Hits:  75%/2108  
      🟩 nvcc12.0           Pass: 100%/5   | Total:  5h 16m | Avg:  1h 03m | Max:  1h 05m | Hits:  70%/5940  
      🟩 nvcc12.6           Pass: 100%/2   | Total:  2h 17m | Avg:  1h 08m | Max:  1h 09m | Hits:  69%/2260  
      🟩 nvcc12.8           Pass: 100%/36  | Total:  1d 08h | Avg: 54m 59s | Max:  1h 25m | Hits:  76%/43472 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  2h 01m | Avg:  1h 00m | Max:  1h 01m | Hits:  75%/2108  
      🟩 nvcc               Pass: 100%/43  | Total:  1d 16h | Avg: 56m 36s | Max:  1h 25m | Hits:  75%/51672 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  4h 02m | Avg:  1h 00m | Max:  1h 03m | Hits:  69%/4896  
      🟩 Clang15            Pass: 100%/2   | Total:  1h 58m | Avg: 59m 01s | Max:  1h 00m | Hits:  69%/2444  
      🟩 Clang16            Pass: 100%/2   | Total:  2h 04m | Avg:  1h 02m | Max:  1h 02m | Hits:  69%/2444  
      🟩 Clang17            Pass: 100%/2   | Total:  2h 05m | Avg:  1h 02m | Max:  1h 06m | Hits:  69%/2444  
      🟩 Clang18            Pass: 100%/7   | Total:  6h 25m | Avg: 55m 04s | Max:  1h 25m | Hits:  80%/8218  
      🟩 GCC7               Pass: 100%/2   | Total:  2h 02m | Avg:  1h 01m | Max:  1h 02m | Hits:  69%/2448  
      🟩 GCC8               Pass: 100%/1   | Total:  1h 02m | Avg:  1h 02m | Max:  1h 02m | Hits:  69%/1224  
      🟩 GCC9               Pass: 100%/2   | Total:  2h 07m | Avg:  1h 03m | Max:  1h 05m | Hits:  69%/2448  
      🟩 GCC10              Pass: 100%/2   | Total:  2h 02m | Avg:  1h 01m | Max:  1h 01m | Hits:  69%/2448  
      🟩 GCC11              Pass: 100%/2   | Total:  2h 00m | Avg:  1h 00m | Max:  1h 01m | Hits:  69%/2444  
      🟩 GCC12              Pass: 100%/2   | Total:  2h 01m | Avg:  1h 00m | Max:  1h 01m | Hits:  69%/2444  
      🟩 GCC13              Pass: 100%/11  | Total:  7h 27m | Avg: 40m 43s | Max:  1h 08m | Hits:  85%/13442 
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 20m | Avg:  1h 10m | Max:  1h 14m | Hits:  75%/2088  
      🟩 MSVC14.42          Pass: 100%/2   | Total:  2h 35m | Avg:  1h 17m | Max:  1h 22m | Hits:  75%/2088  
      🟩 NVHPC25.1          Pass: 100%/2   | Total:  2h 17m | Avg:  1h 08m | Max:  1h 09m | Hits:  69%/2260  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/17  | Total: 16h 36m | Avg: 58m 36s | Max:  1h 25m | Hits:  73%/20446 
      🟩 GCC                Pass: 100%/22  | Total: 18h 46m | Avg: 51m 11s | Max:  1h 08m | Hits:  77%/26898 
      🟩 MSVC               Pass: 100%/4   | Total:  4h 55m | Avg:  1h 13m | Max:  1h 22m | Hits:  75%/4176  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 17m | Avg:  1h 08m | Max:  1h 09m | Hits:  69%/2260  
    🟩 gpu
      🟩 h100               Pass: 100%/3   | Total:  1h 25m | Avg: 28m 35s | Max: 29m 52s | Hits:  89%/3666  
      🟩 rtx2080            Pass: 100%/34  | Total:  1d 12h | Avg:  1h 04m | Max:  1h 25m | Hits:  70%/40338 
      🟩 rtxa6000           Pass: 100%/8   | Total:  4h 42m | Avg: 35m 20s | Max:  1h 02m | Hits:  92%/9776  
    🟩 jobs
      🟩 Build              Pass: 100%/37  | Total:  1d 14h | Avg:  1h 03m | Max:  1h 25m | Hits:  70%/44004 
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 28m 44s | Avg: 28m 44s | Max: 28m 44s | Hits:  99%/1222  
      🟩 GraphCapture       Pass: 100%/1   | Total: 22m 54s | Avg: 22m 54s | Max: 22m 54s | Hits:  99%/1222  
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 28m | Avg: 29m 28s | Max: 30m 13s | Hits:  99%/3666  
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 16m | Avg: 25m 25s | Max: 27m 13s | Hits:  99%/3666  
    🟩 sm
      🟩 90                 Pass: 100%/3   | Total:  1h 25m | Avg: 28m 35s | Max: 29m 52s | Hits:  89%/3666  
      🟩 90;90a;100         Pass: 100%/1   | Total:  1h 08m | Avg:  1h 08m | Max:  1h 08m | Hits:  69%/1222  
    🟩 std
      🟩 17                 Pass: 100%/20  | Total: 21h 39m | Avg:  1h 04m | Max:  1h 25m | Hits:  70%/23662 
      🟩 20                 Pass: 100%/25  | Total: 20h 56m | Avg: 50m 15s | Max:  1h 22m | Hits:  79%/30118 
    
  • 🟩 stdpar: Pass: 100%/4 | Total: 16m 02s | Avg: 4m 00s | Max: 4m 30s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total:  8m 57s | Avg:  4m 28s | Max:  4m 30s
      🟩 arm64              Pass: 100%/2   | Total:  7m 05s | Avg:  3m 32s | Max:  3m 34s
    🟩 ctk
      🟩 12.6               Pass: 100%/4   | Total: 16m 02s | Avg:  4m 00s | Max:  4m 30s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/4   | Total: 16m 02s | Avg:  4m 00s | Max:  4m 30s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/4   | Total: 16m 02s | Avg:  4m 00s | Max:  4m 30s
    🟩 cxx
      🟩 NVHPC25.1          Pass: 100%/4   | Total: 16m 02s | Avg:  4m 00s | Max:  4m 30s
    🟩 cxx_family
      🟩 NVHPC              Pass: 100%/4   | Total: 16m 02s | Avg:  4m 00s | Max:  4m 30s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/4   | Total: 16m 02s | Avg:  4m 00s | Max:  4m 30s
    🟩 jobs
      🟩 Build              Pass: 100%/4   | Total: 16m 02s | Avg:  4m 00s | Max:  4m 30s
    🟩 std
      🟩 17                 Pass: 100%/2   | Total:  8m 04s | Avg:  4m 02s | Max:  4m 30s
      🟩 20                 Pass: 100%/2   | Total:  7m 58s | Avg:  3m 59s | Max:  4m 27s
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 18m 24s | Avg: 9m 12s | Max: 16m 07s | Hits: 98%/320

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 18m 24s | Avg:  9m 12s | Max: 16m 07s | Hits:  98%/320   
    🟩 ctk
      🟩 12.8               Pass: 100%/2   | Total: 18m 24s | Avg:  9m 12s | Max: 16m 07s | Hits:  98%/320   
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/2   | Total: 18m 24s | Avg:  9m 12s | Max: 16m 07s | Hits:  98%/320   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total: 18m 24s | Avg:  9m 12s | Max: 16m 07s | Hits:  98%/320   
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total: 18m 24s | Avg:  9m 12s | Max: 16m 07s | Hits:  98%/320   
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total: 18m 24s | Avg:  9m 12s | Max: 16m 07s | Hits:  98%/320   
    🟩 gpu
      🟩 rtx2080            Pass: 100%/2   | Total: 18m 24s | Avg:  9m 12s | Max: 16m 07s | Hits:  98%/320   
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 17s | Avg:  2m 17s | Max:  2m 17s | Hits:  98%/160   
      🟩 Test               Pass: 100%/1   | Total: 16m 07s | Avg: 16m 07s | Max: 16m 07s | Hits:  98%/160   
    
  • 🟩 python: Pass: 100%/1 | Total: 1h 20m | Avg: 1h 20m | Max: 1h 20m

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total:  1h 20m | Avg:  1h 20m | Max:  1h 20m
    🟩 ctk
      🟩 12.8               Pass: 100%/1   | Total:  1h 20m | Avg:  1h 20m | Max:  1h 20m
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/1   | Total:  1h 20m | Avg:  1h 20m | Max:  1h 20m
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total:  1h 20m | Avg:  1h 20m | Max:  1h 20m
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total:  1h 20m | Avg:  1h 20m | Max:  1h 20m
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total:  1h 20m | Avg:  1h 20m | Max:  1h 20m
    🟩 gpu
      🟩 rtx2080            Pass: 100%/1   | Total:  1h 20m | Avg:  1h 20m | Max:  1h 20m
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total:  1h 20m | Avg:  1h 20m | Max:  1h 20m
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
CUB
+/- Thrust
CUDA Experimental
stdpar
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental
+/- stdpar
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 97)

# Runner
68 linux-amd64-cpu16
9 windows-amd64-cpu16
6 linux-arm64-cpu16
6 linux-amd64-gpu-rtxa6000-latest-1
3 linux-amd64-gpu-h100-latest-1
3 linux-amd64-gpu-rtx4090-latest-1
2 linux-amd64-gpu-rtx2080-latest-1

This avoids instantiating thrust::copy and thrust::transform when
assigning a single element between thrust references.
@github-actions
Copy link
Contributor

🟩 CI finished in 1h 33m: Pass: 100%/97 | Total: 2d 19h | Avg: 41m 57s | Max: 1h 20m | Hits: 78%/134011
  • 🟩 cub: Pass: 100%/45 | Total: 1d 19h | Avg: 57m 39s | Max: 1h 14m | Hits: 75%/53780

    🟩 cpu
      🟩 amd64              Pass: 100%/43  | Total:  1d 17h | Avg: 57m 22s | Max:  1h 14m | Hits:  75%/51336 
      🟩 arm64              Pass: 100%/2   | Total:  2h 07m | Avg:  1h 03m | Max:  1h 04m | Hits:  69%/2444  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  5h 39m | Avg:  1h 07m | Max:  1h 11m | Hits:  70%/5940  
      🟩 12.6               Pass: 100%/2   | Total:  2h 23m | Avg:  1h 11m | Max:  1h 12m | Hits:  69%/2260  
      🟩 12.8               Pass: 100%/38  | Total:  1d 11h | Avg: 55m 33s | Max:  1h 14m | Hits:  76%/45580 
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  2h 06m | Avg:  1h 03m | Max:  1h 05m | Hits:  75%/2108  
      🟩 nvcc12.0           Pass: 100%/5   | Total:  5h 39m | Avg:  1h 07m | Max:  1h 11m | Hits:  70%/5940  
      🟩 nvcc12.6           Pass: 100%/2   | Total:  2h 23m | Avg:  1h 11m | Max:  1h 12m | Hits:  69%/2260  
      🟩 nvcc12.8           Pass: 100%/36  | Total:  1d 09h | Avg: 55m 07s | Max:  1h 14m | Hits:  76%/43472 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  2h 06m | Avg:  1h 03m | Max:  1h 05m | Hits:  75%/2108  
      🟩 nvcc               Pass: 100%/43  | Total:  1d 17h | Avg: 57m 23s | Max:  1h 14m | Hits:  75%/51672 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  4h 25m | Avg:  1h 06m | Max:  1h 11m | Hits:  69%/4896  
      🟩 Clang15            Pass: 100%/2   | Total:  2h 04m | Avg:  1h 02m | Max:  1h 05m | Hits:  69%/2444  
      🟩 Clang16            Pass: 100%/2   | Total:  2h 03m | Avg:  1h 01m | Max:  1h 01m | Hits:  69%/2444  
      🟩 Clang17            Pass: 100%/2   | Total:  1h 59m | Avg: 59m 36s | Max: 59m 56s | Hits:  69%/2444  
      🟩 Clang18            Pass: 100%/7   | Total:  5h 56m | Avg: 50m 56s | Max:  1h 05m | Hits:  80%/8218  
      🟩 GCC7               Pass: 100%/2   | Total:  2h 15m | Avg:  1h 07m | Max:  1h 10m | Hits:  69%/2448  
      🟩 GCC8               Pass: 100%/1   | Total:  1h 09m | Avg:  1h 09m | Max:  1h 09m | Hits:  69%/1224  
      🟩 GCC9               Pass: 100%/2   | Total:  2h 13m | Avg:  1h 06m | Max:  1h 09m | Hits:  69%/2448  
      🟩 GCC10              Pass: 100%/2   | Total:  2h 06m | Avg:  1h 03m | Max:  1h 04m | Hits:  69%/2448  
      🟩 GCC11              Pass: 100%/2   | Total:  2h 04m | Avg:  1h 02m | Max:  1h 03m | Hits:  69%/2444  
      🟩 GCC12              Pass: 100%/2   | Total:  2h 11m | Avg:  1h 05m | Max:  1h 10m | Hits:  69%/2444  
      🟩 GCC13              Pass: 100%/11  | Total:  7h 26m | Avg: 40m 37s | Max:  1h 13m | Hits:  85%/13442 
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 22m | Avg:  1h 11m | Max:  1h 14m | Hits:  75%/2088  
      🟩 MSVC14.42          Pass: 100%/2   | Total:  2h 28m | Avg:  1h 14m | Max:  1h 14m | Hits:  75%/2088  
      🟩 NVHPC25.1          Pass: 100%/2   | Total:  2h 23m | Avg:  1h 11m | Max:  1h 12m | Hits:  69%/2260  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/17  | Total: 16h 30m | Avg: 58m 15s | Max:  1h 11m | Hits:  73%/20446 
      🟩 GCC                Pass: 100%/22  | Total: 19h 28m | Avg: 53m 07s | Max:  1h 13m | Hits:  77%/26898 
      🟩 MSVC               Pass: 100%/4   | Total:  4h 51m | Avg:  1h 12m | Max:  1h 14m | Hits:  75%/4176  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 23m | Avg:  1h 11m | Max:  1h 12m | Hits:  69%/2260  
    🟩 gpu
      🟩 h100               Pass: 100%/3   | Total:  1h 21m | Avg: 27m 16s | Max: 29m 59s | Hits:  89%/3666  
      🟩 rtx2080            Pass: 100%/34  | Total:  1d 13h | Avg:  1h 05m | Max:  1h 14m | Hits:  70%/40338 
      🟩 rtxa6000           Pass: 100%/8   | Total:  4h 28m | Avg: 33m 36s | Max:  1h 04m | Hits:  92%/9776  
    🟩 jobs
      🟩 Build              Pass: 100%/37  | Total:  1d 16h | Avg:  1h 04m | Max:  1h 14m | Hits:  70%/44004 
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 27m 06s | Avg: 27m 06s | Max: 27m 06s | Hits:  99%/1222  
      🟩 GraphCapture       Pass: 100%/1   | Total: 20m 21s | Avg: 20m 21s | Max: 20m 21s | Hits:  99%/1222  
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 20m | Avg: 26m 47s | Max: 28m 31s | Hits:  99%/3666  
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 06m | Avg: 22m 07s | Max: 23m 18s | Hits:  99%/3666  
    🟩 sm
      🟩 90                 Pass: 100%/3   | Total:  1h 21m | Avg: 27m 16s | Max: 29m 59s | Hits:  89%/3666  
      🟩 90;90a;100         Pass: 100%/1   | Total:  1h 13m | Avg:  1h 13m | Max:  1h 13m | Hits:  69%/1222  
    🟩 std
      🟩 17                 Pass: 100%/20  | Total: 22h 17m | Avg:  1h 06m | Max:  1h 14m | Hits:  70%/23662 
      🟩 20                 Pass: 100%/25  | Total: 20h 57m | Avg: 50m 17s | Max:  1h 14m | Hits:  79%/30118 
    
  • 🟩 thrust: Pass: 100%/45 | Total: 22h 39m | Avg: 30m 12s | Max: 55m 51s | Hits: 79%/79911

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 37m 19s | Avg: 18m 39s | Max: 25m 51s | Hits:  88%/3554  
    🟩 cpu
      🟩 amd64              Pass: 100%/43  | Total: 21h 44m | Avg: 30m 20s | Max: 55m 51s | Hits:  79%/76358 
      🟩 arm64              Pass: 100%/2   | Total: 55m 01s | Avg: 27m 30s | Max: 28m 42s | Hits:  77%/3553  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  2h 51m | Avg: 34m 22s | Max: 50m 29s | Hits:  75%/8876  
      🟩 12.6               Pass: 100%/2   | Total:  1h 48m | Avg: 54m 25s | Max: 55m 51s | Hits:  66%/3552  
      🟩 12.8               Pass: 100%/38  | Total: 17h 58m | Avg: 28m 23s | Max: 53m 59s | Hits:  80%/67483 
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 51m 39s | Avg: 25m 49s | Max: 27m 33s | Hits:  77%/3552  
      🟩 nvcc12.0           Pass: 100%/5   | Total:  2h 51m | Avg: 34m 22s | Max: 50m 29s | Hits:  75%/8876  
      🟩 nvcc12.6           Pass: 100%/2   | Total:  1h 48m | Avg: 54m 25s | Max: 55m 51s | Hits:  66%/3552  
      🟩 nvcc12.8           Pass: 100%/36  | Total: 17h 07m | Avg: 28m 31s | Max: 53m 59s | Hits:  81%/63931 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 51m 39s | Avg: 25m 49s | Max: 27m 33s | Hits:  77%/3552  
      🟩 nvcc               Pass: 100%/43  | Total: 21h 47m | Avg: 30m 24s | Max: 55m 51s | Hits:  79%/76359 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  1h 58m | Avg: 29m 44s | Max: 32m 44s | Hits:  77%/7104  
      🟩 Clang15            Pass: 100%/2   | Total: 58m 21s | Avg: 29m 10s | Max: 29m 40s | Hits:  77%/3552  
      🟩 Clang16            Pass: 100%/2   | Total: 57m 32s | Avg: 28m 46s | Max: 29m 23s | Hits:  77%/3552  
      🟩 Clang17            Pass: 100%/2   | Total: 58m 06s | Avg: 29m 03s | Max: 29m 31s | Hits:  77%/3552  
      🟩 Clang18            Pass: 100%/7   | Total:  2h 34m | Avg: 22m 03s | Max: 30m 35s | Hits:  84%/12432 
      🟩 GCC7               Pass: 100%/2   | Total:  1h 05m | Avg: 32m 37s | Max: 34m 18s | Hits:  77%/3554  
      🟩 GCC8               Pass: 100%/1   | Total: 30m 28s | Avg: 30m 28s | Max: 30m 28s | Hits:  77%/1777  
      🟩 GCC9               Pass: 100%/2   | Total:  1h 02m | Avg: 31m 05s | Max: 31m 51s | Hits:  77%/3554  
      🟩 GCC10              Pass: 100%/2   | Total:  1h 02m | Avg: 31m 24s | Max: 31m 59s | Hits:  77%/3554  
      🟩 GCC11              Pass: 100%/2   | Total:  1h 03m | Avg: 31m 57s | Max: 33m 11s | Hits:  77%/3554  
      🟩 GCC12              Pass: 100%/2   | Total:  1h 02m | Avg: 31m 28s | Max: 31m 41s | Hits:  77%/3554  
      🟩 GCC13              Pass: 100%/10  | Total:  3h 37m | Avg: 21m 42s | Max: 35m 14s | Hits:  86%/17770 
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 44m | Avg: 52m 14s | Max: 53m 59s | Hits:  66%/3540  
      🟩 MSVC14.42          Pass: 100%/3   | Total:  2h 14m | Avg: 44m 43s | Max: 53m 52s | Hits:  77%/5310  
      🟩 NVHPC25.1          Pass: 100%/2   | Total:  1h 48m | Avg: 54m 25s | Max: 55m 51s | Hits:  66%/3552  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/17  | Total:  7h 27m | Avg: 26m 18s | Max: 32m 44s | Hits:  80%/30192 
      🟩 GCC                Pass: 100%/21  | Total:  9h 24m | Avg: 26m 53s | Max: 35m 14s | Hits:  81%/37317 
      🟩 MSVC               Pass: 100%/5   | Total:  3h 58m | Avg: 47m 43s | Max: 53m 59s | Hits:  73%/8850  
      🟩 NVHPC              Pass: 100%/2   | Total:  1h 48m | Avg: 54m 25s | Max: 55m 51s | Hits:  66%/3552  
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 29m 59s | Avg: 14m 59s | Max: 18m 14s | Hits:  88%/3554  
      🟩 rtx2080            Pass: 100%/33  | Total: 18h 34m | Avg: 33m 45s | Max: 55m 51s | Hits:  75%/58604 
      🟩 rtx4090            Pass: 100%/10  | Total:  3h 35m | Avg: 21m 31s | Max: 52m 47s | Hits:  89%/17753 
    🟩 jobs
      🟩 Build              Pass: 100%/38  | Total: 21h 10m | Avg: 33m 26s | Max: 55m 51s | Hits:  75%/67481 
      🟩 TestCPU            Pass: 100%/3   | Total: 43m 19s | Avg: 14m 26s | Max: 27m 32s | Hits:  99%/5323  
      🟩 TestGPU            Pass: 100%/4   | Total: 45m 29s | Avg: 11m 22s | Max: 11m 49s | Hits:  99%/7107  
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 29m 59s | Avg: 14m 59s | Max: 18m 14s | Hits:  88%/3554  
      🟩 90;90a;100         Pass: 100%/1   | Total: 34m 04s | Avg: 34m 04s | Max: 34m 04s | Hits:  77%/1777  
    🟩 std
      🟩 17                 Pass: 100%/20  | Total: 11h 49m | Avg: 35m 28s | Max: 53m 59s | Hits:  75%/35511 
      🟩 20                 Pass: 100%/23  | Total: 10h 12m | Avg: 26m 37s | Max: 55m 51s | Hits:  82%/40846 
    
  • 🟩 stdpar: Pass: 100%/4 | Total: 16m 47s | Avg: 4m 11s | Max: 5m 01s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total:  9m 54s | Avg:  4m 57s | Max:  5m 01s
      🟩 arm64              Pass: 100%/2   | Total:  6m 53s | Avg:  3m 26s | Max:  3m 30s
    🟩 ctk
      🟩 12.6               Pass: 100%/4   | Total: 16m 47s | Avg:  4m 11s | Max:  5m 01s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/4   | Total: 16m 47s | Avg:  4m 11s | Max:  5m 01s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/4   | Total: 16m 47s | Avg:  4m 11s | Max:  5m 01s
    🟩 cxx
      🟩 NVHPC25.1          Pass: 100%/4   | Total: 16m 47s | Avg:  4m 11s | Max:  5m 01s
    🟩 cxx_family
      🟩 NVHPC              Pass: 100%/4   | Total: 16m 47s | Avg:  4m 11s | Max:  5m 01s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/4   | Total: 16m 47s | Avg:  4m 11s | Max:  5m 01s
    🟩 jobs
      🟩 Build              Pass: 100%/4   | Total: 16m 47s | Avg:  4m 11s | Max:  5m 01s
    🟩 std
      🟩 17                 Pass: 100%/2   | Total:  8m 23s | Avg:  4m 11s | Max:  4m 53s
      🟩 20                 Pass: 100%/2   | Total:  8m 24s | Avg:  4m 12s | Max:  5m 01s
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 18m 32s | Avg: 9m 16s | Max: 16m 08s | Hits: 98%/320

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 18m 32s | Avg:  9m 16s | Max: 16m 08s | Hits:  98%/320   
    🟩 ctk
      🟩 12.8               Pass: 100%/2   | Total: 18m 32s | Avg:  9m 16s | Max: 16m 08s | Hits:  98%/320   
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/2   | Total: 18m 32s | Avg:  9m 16s | Max: 16m 08s | Hits:  98%/320   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total: 18m 32s | Avg:  9m 16s | Max: 16m 08s | Hits:  98%/320   
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total: 18m 32s | Avg:  9m 16s | Max: 16m 08s | Hits:  98%/320   
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total: 18m 32s | Avg:  9m 16s | Max: 16m 08s | Hits:  98%/320   
    🟩 gpu
      🟩 rtx2080            Pass: 100%/2   | Total: 18m 32s | Avg:  9m 16s | Max: 16m 08s | Hits:  98%/320   
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 24s | Avg:  2m 24s | Max:  2m 24s | Hits:  98%/160   
      🟩 Test               Pass: 100%/1   | Total: 16m 08s | Avg: 16m 08s | Max: 16m 08s | Hits:  98%/160   
    
  • 🟩 python: Pass: 100%/1 | Total: 1h 20m | Avg: 1h 20m | Max: 1h 20m

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total:  1h 20m | Avg:  1h 20m | Max:  1h 20m
    🟩 ctk
      🟩 12.8               Pass: 100%/1   | Total:  1h 20m | Avg:  1h 20m | Max:  1h 20m
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/1   | Total:  1h 20m | Avg:  1h 20m | Max:  1h 20m
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total:  1h 20m | Avg:  1h 20m | Max:  1h 20m
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total:  1h 20m | Avg:  1h 20m | Max:  1h 20m
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total:  1h 20m | Avg:  1h 20m | Max:  1h 20m
    🟩 gpu
      🟩 rtx2080            Pass: 100%/1   | Total:  1h 20m | Avg:  1h 20m | Max:  1h 20m
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total:  1h 20m | Avg:  1h 20m | Max:  1h 20m
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
CUB
+/- Thrust
CUDA Experimental
stdpar
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental
+/- stdpar
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 97)

# Runner
68 linux-amd64-cpu16
9 windows-amd64-cpu16
6 linux-arm64-cpu16
6 linux-amd64-gpu-rtxa6000-latest-1
3 linux-amd64-gpu-h100-latest-1
3 linux-amd64-gpu-rtx4090-latest-1
2 linux-amd64-gpu-rtx2080-latest-1

@bernhardmgruber bernhardmgruber merged commit 56e8dac into NVIDIA:main Mar 21, 2025
108 of 110 checks passed
@bernhardmgruber bernhardmgruber deleted the ref_assign_value branch March 21, 2025 23:01
@github-project-automation github-project-automation bot moved this from In Review to Done in CCCL Mar 21, 2025
davebayer pushed a commit to davebayer/cccl that referenced this pull request Apr 7, 2025
This avoids instantiating thrust::copy and thrust::transform when
assigning a single element between thrust references.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

Archived in project

Development

Successfully merging this pull request may close these issues.

2 participants