KEMBAR78
Add a warning that we cannot tune transform by bernhardmgruber · Pull Request #3896 · NVIDIA/cccl · GitHub
Skip to content

Conversation

@bernhardmgruber
Copy link
Contributor

No description provided.

@bernhardmgruber bernhardmgruber requested a review from a team as a code owner February 21, 2025 15:59
@github-actions
Copy link
Contributor

🟩 CI finished in 1h 05m: Pass: 100%/93 | Total: 15h 39m | Avg: 10m 05s | Max: 39m 19s | Hits: 95%/133745
  • 🟩 cub: Pass: 100%/45 | Total: 8h 19m | Avg: 11m 05s | Max: 32m 33s | Hits: 92%/53305

    🟩 cpu
      🟩 amd64              Pass: 100%/43  | Total:  8h 07m | Avg: 11m 20s | Max: 32m 33s | Hits:  92%/50883 
      🟩 arm64              Pass: 100%/2   | Total: 11m 11s | Avg:  5m 35s | Max:  5m 51s | Hits:  99%/2422  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total: 52m 08s | Avg: 10m 25s | Max: 29m 06s | Hits:  84%/5888  
      🟩 12.5               Pass: 100%/2   | Total: 20m 02s | Avg: 10m 01s | Max: 10m 12s | Hits:  98%/2240  
      🟩 12.8               Pass: 100%/38  | Total:  7h 06m | Avg: 11m 13s | Max: 32m 33s | Hits:  93%/45177 
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 10m 11s | Avg:  5m 05s | Max:  5m 06s | Hits: 100%/2092  
      🟩 nvcc12.0           Pass: 100%/5   | Total: 52m 08s | Avg: 10m 25s | Max: 29m 06s | Hits:  84%/5888  
      🟩 nvcc12.5           Pass: 100%/2   | Total: 20m 02s | Avg: 10m 01s | Max: 10m 12s | Hits:  98%/2240  
      🟩 nvcc12.8           Pass: 100%/36  | Total:  6h 56m | Avg: 11m 34s | Max: 32m 33s | Hits:  93%/43085 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 10m 11s | Avg:  5m 05s | Max:  5m 06s | Hits: 100%/2092  
      🟩 nvcc               Pass: 100%/43  | Total:  8h 08m | Avg: 11m 22s | Max: 32m 33s | Hits:  92%/51213 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total: 23m 03s | Avg:  5m 45s | Max:  6m 03s | Hits:  99%/4852  
      🟩 Clang15            Pass: 100%/2   | Total: 13m 03s | Avg:  6m 31s | Max:  6m 33s | Hits:  99%/2422  
      🟩 Clang16            Pass: 100%/2   | Total: 12m 13s | Avg:  6m 06s | Max:  6m 19s | Hits:  99%/2422  
      🟩 Clang17            Pass: 100%/2   | Total: 12m 11s | Avg:  6m 05s | Max:  6m 14s | Hits:  99%/2422  
      🟩 Clang18            Pass: 100%/7   | Total:  1h 09m | Avg:  9m 55s | Max: 22m 19s | Hits:  99%/8147  
      🟩 GCC7               Pass: 100%/2   | Total: 11m 50s | Avg:  5m 55s | Max:  5m 55s | Hits:  99%/2426  
      🟩 GCC8               Pass: 100%/1   | Total:  5m 57s | Avg:  5m 57s | Max:  5m 57s | Hits:  99%/1213  
      🟩 GCC9               Pass: 100%/2   | Total: 12m 12s | Avg:  6m 06s | Max:  6m 16s | Hits:  99%/2426  
      🟩 GCC10              Pass: 100%/2   | Total: 13m 01s | Avg:  6m 30s | Max:  6m 39s | Hits:  99%/2426  
      🟩 GCC11              Pass: 100%/2   | Total: 12m 18s | Avg:  6m 09s | Max:  6m 09s | Hits:  99%/2422  
      🟩 GCC12              Pass: 100%/2   | Total: 13m 57s | Avg:  6m 58s | Max:  7m 10s | Hits:  99%/2422  
      🟩 GCC13              Pass: 100%/11  | Total:  2h 39m | Avg: 14m 30s | Max: 23m 26s | Hits:  99%/13321 
      🟩 MSVC14.29          Pass: 100%/2   | Total: 57m 48s | Avg: 28m 54s | Max: 29m 06s | Hits:  16%/2072  
      🟩 MSVC14.42          Pass: 100%/2   | Total:  1h 02m | Avg: 31m 12s | Max: 32m 33s | Hits:  16%/2072  
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 20m 02s | Avg: 10m 01s | Max: 10m 12s | Hits:  98%/2240  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/17  | Total:  2h 09m | Avg:  7m 38s | Max: 22m 19s | Hits:  99%/20265 
      🟩 GCC                Pass: 100%/22  | Total:  3h 48m | Avg: 10m 24s | Max: 23m 26s | Hits:  99%/26656 
      🟩 MSVC               Pass: 100%/4   | Total:  2h 00m | Avg: 30m 03s | Max: 32m 33s | Hits:  16%/4144  
      🟩 NVHPC              Pass: 100%/2   | Total: 20m 02s | Avg: 10m 01s | Max: 10m 12s | Hits:  98%/2240  
    🟩 gpu
      🟩 h100               Pass: 100%/3   | Total: 50m 13s | Avg: 16m 44s | Max: 23m 26s | Hits:  99%/3633  
      🟩 rtx2080            Pass: 100%/34  | Total:  5h 11m | Avg:  9m 09s | Max: 32m 33s | Hits:  90%/39984 
      🟩 rtxa6000           Pass: 100%/8   | Total:  2h 17m | Avg: 17m 09s | Max: 23m 07s | Hits:  99%/9688  
    🟩 jobs
      🟩 Build              Pass: 100%/37  | Total:  5h 29m | Avg:  8m 54s | Max: 32m 33s | Hits:  91%/43617 
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 21m 43s | Avg: 21m 43s | Max: 21m 43s | Hits:  99%/1211  
      🟩 GraphCapture       Pass: 100%/1   | Total: 16m 08s | Avg: 16m 08s | Max: 16m 08s | Hits:  99%/1211  
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 08m | Avg: 22m 57s | Max: 23m 26s | Hits:  99%/3633  
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 02m | Avg: 20m 58s | Max: 22m 12s | Hits:  99%/3633  
    🟩 sm
      🟩 90                 Pass: 100%/3   | Total: 50m 13s | Avg: 16m 44s | Max: 23m 26s | Hits:  99%/3633  
      🟩 90;90a;100         Pass: 100%/1   | Total:  6m 46s | Avg:  6m 46s | Max:  6m 46s | Hits:  99%/1211  
    🟩 std
      🟩 17                 Pass: 100%/20  | Total:  3h 15m | Avg:  9m 46s | Max: 29m 52s | Hits:  88%/23455 
      🟩 20                 Pass: 100%/25  | Total:  5h 03m | Avg: 12m 08s | Max: 32m 33s | Hits:  96%/29850 
    
  • 🟩 thrust: Pass: 100%/45 | Total: 6h 25m | Avg: 8m 33s | Max: 32m 02s | Hits: 96%/80136

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 17m 03s | Avg:  8m 31s | Max: 11m 18s | Hits:  99%/3564  
    🟩 cpu
      🟩 amd64              Pass: 100%/43  | Total:  6h 15m | Avg:  8m 43s | Max: 32m 02s | Hits:  96%/76573 
      🟩 arm64              Pass: 100%/2   | Total:  9m 41s | Avg:  4m 50s | Max:  4m 59s | Hits:  99%/3563  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total: 41m 42s | Avg:  8m 20s | Max: 22m 06s | Hits:  94%/8901  
      🟩 12.5               Pass: 100%/2   | Total: 27m 57s | Avg: 13m 58s | Max: 14m 28s | Hits:  99%/3562  
      🟩 12.8               Pass: 100%/38  | Total:  5h 15m | Avg:  8m 18s | Max: 32m 02s | Hits:  96%/67673 
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 10m 07s | Avg:  5m 03s | Max:  5m 08s | Hits: 100%/3562  
      🟩 nvcc12.0           Pass: 100%/5   | Total: 41m 42s | Avg:  8m 20s | Max: 22m 06s | Hits:  94%/8901  
      🟩 nvcc12.5           Pass: 100%/2   | Total: 27m 57s | Avg: 13m 58s | Max: 14m 28s | Hits:  99%/3562  
      🟩 nvcc12.8           Pass: 100%/36  | Total:  5h 05m | Avg:  8m 29s | Max: 32m 02s | Hits:  96%/64111 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 10m 07s | Avg:  5m 03s | Max:  5m 08s | Hits: 100%/3562  
      🟩 nvcc               Pass: 100%/43  | Total:  6h 15m | Avg:  8m 43s | Max: 32m 02s | Hits:  96%/76574 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total: 21m 00s | Avg:  5m 15s | Max:  5m 39s | Hits: 100%/7124  
      🟩 Clang15            Pass: 100%/2   | Total: 10m 44s | Avg:  5m 22s | Max:  5m 40s | Hits: 100%/3562  
      🟩 Clang16            Pass: 100%/2   | Total: 10m 30s | Avg:  5m 15s | Max:  5m 20s | Hits: 100%/3562  
      🟩 Clang17            Pass: 100%/2   | Total: 10m 42s | Avg:  5m 21s | Max:  5m 31s | Hits: 100%/3562  
      🟩 Clang18            Pass: 100%/7   | Total: 43m 54s | Avg:  6m 16s | Max: 10m 43s | Hits: 100%/12467 
      🟩 GCC7               Pass: 100%/2   | Total: 10m 20s | Avg:  5m 10s | Max:  5m 18s | Hits:  99%/3564  
      🟩 GCC8               Pass: 100%/1   | Total:  5m 22s | Avg:  5m 22s | Max:  5m 22s | Hits:  99%/1782  
      🟩 GCC9               Pass: 100%/2   | Total: 10m 59s | Avg:  5m 29s | Max:  6m 07s | Hits:  99%/3564  
      🟩 GCC10              Pass: 100%/2   | Total: 11m 17s | Avg:  5m 38s | Max:  5m 39s | Hits:  99%/3564  
      🟩 GCC11              Pass: 100%/2   | Total: 11m 22s | Avg:  5m 41s | Max:  6m 03s | Hits:  99%/3564  
      🟩 GCC12              Pass: 100%/2   | Total: 11m 27s | Avg:  5m 43s | Max:  5m 53s | Hits:  99%/3564  
      🟩 GCC13              Pass: 100%/10  | Total:  1h 14m | Avg:  7m 27s | Max: 11m 19s | Hits:  99%/17820 
      🟩 MSVC14.29          Pass: 100%/2   | Total: 45m 13s | Avg: 22m 36s | Max: 23m 07s | Hits:  70%/3550  
      🟩 MSVC14.42          Pass: 100%/3   | Total:  1h 19m | Avg: 26m 37s | Max: 32m 02s | Hits:  70%/5325  
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 27m 57s | Avg: 13m 58s | Max: 14m 28s | Hits:  99%/3562  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/17  | Total:  1h 36m | Avg:  5m 41s | Max: 10m 43s | Hits: 100%/30277 
      🟩 GCC                Pass: 100%/21  | Total:  2h 15m | Avg:  6m 26s | Max: 11m 19s | Hits:  99%/37422 
      🟩 MSVC               Pass: 100%/5   | Total:  2h 05m | Avg: 25m 01s | Max: 32m 02s | Hits:  70%/8875  
      🟩 NVHPC              Pass: 100%/2   | Total: 27m 57s | Avg: 13m 58s | Max: 14m 28s | Hits:  99%/3562  
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 15m 26s | Avg:  7m 43s | Max: 10m 34s | Hits:  99%/3564  
      🟩 rtx2080            Pass: 100%/33  | Total:  4h 06m | Avg:  7m 28s | Max: 23m 07s | Hits:  97%/58769 
      🟩 rtx4090            Pass: 100%/10  | Total:  2h 02m | Avg: 12m 17s | Max: 32m 02s | Hits:  94%/17803 
    🟩 jobs
      🟩 Build              Pass: 100%/38  | Total:  4h 53m | Avg:  7m 44s | Max: 24m 46s | Hits:  96%/67671 
      🟩 TestCPU            Pass: 100%/3   | Total: 47m 25s | Avg: 15m 48s | Max: 32m 02s | Hits:  90%/5338  
      🟩 TestGPU            Pass: 100%/4   | Total: 43m 54s | Avg: 10m 58s | Max: 11m 19s | Hits:  99%/7127  
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 15m 26s | Avg:  7m 43s | Max: 10m 34s | Hits:  99%/3564  
      🟩 90;90a;100         Pass: 100%/1   | Total:  6m 10s | Avg:  6m 10s | Max:  6m 10s | Hits:  99%/1782  
    🟩 std
      🟩 17                 Pass: 100%/20  | Total:  2h 48m | Avg:  8m 26s | Max: 23m 07s | Hits:  95%/35611 
      🟩 20                 Pass: 100%/23  | Total:  3h 19m | Avg:  8m 39s | Max: 32m 02s | Hits:  97%/40961 
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 15m 29s | Avg: 7m 44s | Max: 13m 07s | Hits: 98%/304

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 15m 29s | Avg:  7m 44s | Max: 13m 07s | Hits:  98%/304   
    🟩 ctk
      🟩 12.8               Pass: 100%/2   | Total: 15m 29s | Avg:  7m 44s | Max: 13m 07s | Hits:  98%/304   
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/2   | Total: 15m 29s | Avg:  7m 44s | Max: 13m 07s | Hits:  98%/304   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total: 15m 29s | Avg:  7m 44s | Max: 13m 07s | Hits:  98%/304   
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total: 15m 29s | Avg:  7m 44s | Max: 13m 07s | Hits:  98%/304   
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total: 15m 29s | Avg:  7m 44s | Max: 13m 07s | Hits:  98%/304   
    🟩 gpu
      🟩 rtx2080            Pass: 100%/2   | Total: 15m 29s | Avg:  7m 44s | Max: 13m 07s | Hits:  98%/304   
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 22s | Avg:  2m 22s | Max:  2m 22s | Hits:  98%/152   
      🟩 Test               Pass: 100%/1   | Total: 13m 07s | Avg: 13m 07s | Max: 13m 07s | Hits:  98%/152   
    
  • 🟩 python: Pass: 100%/1 | Total: 39m 19s | Avg: 39m 19s | Max: 39m 19s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 39m 19s | Avg: 39m 19s | Max: 39m 19s
    🟩 ctk
      🟩 12.8               Pass: 100%/1   | Total: 39m 19s | Avg: 39m 19s | Max: 39m 19s
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/1   | Total: 39m 19s | Avg: 39m 19s | Max: 39m 19s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 39m 19s | Avg: 39m 19s | Max: 39m 19s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 39m 19s | Avg: 39m 19s | Max: 39m 19s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 39m 19s | Avg: 39m 19s | Max: 39m 19s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/1   | Total: 39m 19s | Avg: 39m 19s | Max: 39m 19s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 39m 19s | Avg: 39m 19s | Max: 39m 19s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
+/- CUB
Thrust
CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 93)

# Runner
66 linux-amd64-cpu16
9 windows-amd64-cpu16
6 linux-amd64-gpu-rtxa6000-latest-1
4 linux-arm64-cpu16
3 linux-amd64-gpu-h100-latest-1
3 linux-amd64-gpu-rtx4090-latest-1
2 linux-amd64-gpu-rtx2080-latest-1

@bernhardmgruber bernhardmgruber enabled auto-merge (squash) February 21, 2025 17:22
@bernhardmgruber bernhardmgruber merged commit da1c270 into NVIDIA:main Feb 21, 2025
105 of 108 checks passed
@bernhardmgruber bernhardmgruber deleted the bench_transf_warning branch February 21, 2025 22:00
davebayer pushed a commit to davebayer/cccl that referenced this pull request Apr 7, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

Archived in project

Development

Successfully merging this pull request may close these issues.

2 participants