KEMBAR78
Make sure that nested `tuple` and `pair` have the expected size by miscco · Pull Request #5246 · NVIDIA/cccl · GitHub
Skip to content

Conversation

@miscco
Copy link
Contributor

@miscco miscco commented Jul 14, 2025

We were inheriting from a base class that was not unique to the instance of the tuple

That lead the compiler to assign padding for it.

We need to make sure that the SFINAE helper class is unique the the instantiated tuple

Fixes #5226

@miscco miscco requested a review from a team as a code owner July 14, 2025 11:15
@miscco miscco requested a review from bernhardmgruber July 14, 2025 11:15
@github-project-automation github-project-automation bot moved this to Todo in CCCL Jul 14, 2025
@cccl-authenticator-app cccl-authenticator-app bot moved this from Todo to In Review in CCCL Jul 14, 2025
@miscco miscco requested a review from ericniebler July 14, 2025 11:15
@miscco miscco force-pushed the fix_tuple_size branch 2 times, most recently from aa840fb to 5732a4d Compare July 14, 2025 14:20
@github-actions
Copy link
Contributor

🟩 CI finished in 16h 33m: Pass: 100%/205 | Total: 4d 11h | Avg: 31m 34s | Max: 1h 46m | Hits: 72%/338147
  • 🟩 cub: Pass: 100%/50 | Total: 1d 22h | Avg: 55m 48s | Max: 1h 40m | Hits: 70%/61706

    🟩 cpu
      🟩 amd64              Pass: 100%/48  | Total:  1d 20h | Avg: 55m 46s | Max:  1h 40m | Hits:  70%/59190 
      🟩 arm64              Pass: 100%/2   | Total:  1h 53m | Avg: 56m 42s | Max:  1h 00m | Hits:  66%/2516  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  5h 23m | Avg:  1h 04m | Max:  1h 40m | Hits:  56%/6186  
      🟩 12.9               Pass: 100%/45  | Total:  1d 17h | Avg: 54m 49s | Max:  1h 24m | Hits:  71%/55520 
    🟩 cudacxx
      🟩 ClangCUDA19        Pass: 100%/2   | Total:  1h 02m | Avg: 31m 02s | Max: 32m 00s | Hits:  72%/2165  
      🟩 nvcc12.0           Pass: 100%/5   | Total:  5h 23m | Avg:  1h 04m | Max:  1h 40m | Hits:  56%/6186  
      🟩 nvcc12.9           Pass: 100%/43  | Total:  1d 16h | Avg: 55m 56s | Max:  1h 24m | Hits:  71%/53355 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  1h 02m | Avg: 31m 02s | Max: 32m 00s | Hits:  72%/2165  
      🟩 nvcc               Pass: 100%/48  | Total:  1d 21h | Avg: 56m 50s | Max:  1h 40m | Hits:  70%/59541 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  3h 47m | Avg: 56m 46s | Max:  1h 01m | Hits:  66%/5034  
      🟩 Clang15            Pass: 100%/2   | Total:  1h 59m | Avg: 59m 54s | Max:  1h 01m | Hits:  66%/2513  
      🟩 Clang16            Pass: 100%/2   | Total:  2h 10m | Avg:  1h 05m | Max:  1h 08m | Hits:  66%/2513  
      🟩 Clang17            Pass: 100%/2   | Total:  2h 03m | Avg:  1h 01m | Max:  1h 02m | Hits:  66%/2513  
      🟩 Clang18            Pass: 100%/2   | Total:  1h 56m | Avg: 58m 14s | Max:  1h 01m | Hits:  66%/2513  
      🟩 Clang19            Pass: 100%/7   | Total:  4h 56m | Avg: 42m 20s | Max:  1h 01m | Hits:  77%/8449  
      🟩 GCC7               Pass: 100%/2   | Total:  1h 57m | Avg: 58m 36s | Max:  1h 00m | Hits:  65%/2516  
      🟩 GCC8               Pass: 100%/1   | Total:  1h 03m | Avg:  1h 03m | Max:  1h 03m | Hits:  65%/1258  
      🟩 GCC9               Pass: 100%/2   | Total:  2h 04m | Avg:  1h 02m | Max:  1h 04m | Hits:  65%/2516  
      🟩 GCC10              Pass: 100%/2   | Total:  2h 05m | Avg:  1h 02m | Max:  1h 05m | Hits:  65%/2517  
      🟩 GCC11              Pass: 100%/2   | Total:  2h 16m | Avg:  1h 08m | Max:  1h 11m | Hits:  65%/2513  
      🟩 GCC12              Pass: 100%/2   | Total:  2h 15m | Avg:  1h 07m | Max:  1h 11m | Hits:  65%/2513  
      🟩 GCC13              Pass: 100%/12  | Total:  8h 59m | Avg: 44m 58s | Max:  1h 03m | Hits:  82%/15105 
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 38m | Avg:  1h 19m | Max:  1h 40m | Hits:  48%/2306  
      🟩 MSVC14.43          Pass: 100%/4   | Total:  3h 28m | Avg: 52m 09s | Max:  1h 07m | Hits:  84%/4612  
      🟩 NVHPC25.5          Pass: 100%/2   | Total:  2h 47m | Avg:  1h 23m | Max:  1h 24m | Hits:  11%/2315  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/19  | Total: 16h 54m | Avg: 53m 23s | Max:  1h 08m | Hits:  70%/23535 
      🟩 GCC                Pass: 100%/23  | Total: 20h 41m | Avg: 53m 58s | Max:  1h 11m | Hits:  74%/28938 
      🟩 MSVC               Pass: 100%/6   | Total:  6h 07m | Avg:  1h 01m | Max:  1h 40m | Hits:  72%/6918  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 47m | Avg:  1h 23m | Max:  1h 24m | Hits:  11%/2315  
    🟩 gpu
      🟩 h100               Pass: 100%/3   | Total:  1h 32m | Avg: 30m 54s | Max: 36m 23s | Hits:  88%/3777  
      🟩 rtx2080            Pass: 100%/39  | Total:  1d 15h | Avg:  1h 00m | Max:  1h 40m | Hits:  64%/47863 
      🟩 rtxa6000           Pass: 100%/8   | Total:  5h 41m | Avg: 42m 43s | Max:  1h 02m | Hits:  91%/10066 
    🟩 jobs
      🟩 Build              Pass: 100%/42  | Total:  1d 17h | Avg: 59m 46s | Max:  1h 40m | Hits:  64%/51638 
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 42m 19s | Avg: 42m 19s | Max: 42m 19s | Hits:  99%/1259  
      🟩 GraphCapture       Pass: 100%/1   | Total: 34m 31s | Avg: 34m 31s | Max: 34m 31s | Hits:  99%/1259  
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 53m | Avg: 37m 45s | Max: 44m 17s | Hits:  99%/3775  
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 29m | Avg: 29m 57s | Max: 35m 26s | Hits:  99%/3775  
    🟩 sm
      🟩 90                 Pass: 100%/3   | Total:  1h 32m | Avg: 30m 54s | Max: 36m 23s | Hits:  88%/3777  
      🟩 90;90a             Pass: 100%/2   | Total:  1h 29m | Avg: 44m 36s | Max: 52m 11s | Hits:  77%/2412  
      🟩 100;120            Pass: 100%/2   | Total:  1h 32m | Avg: 46m 25s | Max: 52m 01s | Hits:  76%/2412  
    🟩 std
      🟩 17                 Pass: 100%/21  | Total: 21h 54m | Avg:  1h 02m | Max:  1h 40m | Hits:  62%/25810 
      🟩 20                 Pass: 100%/29  | Total:  1d 00h | Avg: 50m 53s | Max:  1h 24m | Hits:  75%/35896 
    
  • 🟩 thrust: Pass: 100%/50 | Total: 1d 07h | Avg: 37m 17s | Max: 1h 46m | Hits: 79%/95621

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 51m 15s | Avg: 25m 37s | Max: 36m 38s | Hits:  89%/3828  
    🟩 cpu
      🟩 amd64              Pass: 100%/48  | Total:  1d 05h | Avg: 37m 22s | Max:  1h 46m | Hits:  79%/91794 
      🟩 arm64              Pass: 100%/2   | Total:  1h 10m | Avg: 35m 13s | Max: 37m 14s | Hits:  79%/3827  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  2h 48m | Avg: 33m 46s | Max: 36m 38s | Hits:  81%/9560  
      🟩 12.9               Pass: 100%/45  | Total:  1d 04h | Avg: 37m 40s | Max:  1h 46m | Hits:  79%/86061 
    🟩 cudacxx
      🟩 ClangCUDA19        Pass: 100%/2   | Total: 55m 43s | Avg: 27m 51s | Max: 28m 45s | Hits:  79%/3826  
      🟩 nvcc12.0           Pass: 100%/5   | Total:  2h 48m | Avg: 33m 46s | Max: 36m 38s | Hits:  81%/9560  
      🟩 nvcc12.9           Pass: 100%/43  | Total:  1d 03h | Avg: 38m 08s | Max:  1h 46m | Hits:  79%/82235 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 55m 43s | Avg: 27m 51s | Max: 28m 45s | Hits:  79%/3826  
      🟩 nvcc               Pass: 100%/48  | Total:  1d 06h | Avg: 37m 41s | Max:  1h 46m | Hits:  79%/91795 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  2h 16m | Avg: 34m 03s | Max: 37m 12s | Hits:  79%/7652  
      🟩 Clang15            Pass: 100%/2   | Total:  1h 12m | Avg: 36m 06s | Max: 38m 16s | Hits:  79%/3826  
      🟩 Clang16            Pass: 100%/2   | Total:  1h 12m | Avg: 36m 05s | Max: 37m 07s | Hits:  79%/3826  
      🟩 Clang17            Pass: 100%/2   | Total:  1h 17m | Avg: 38m 35s | Max: 40m 30s | Hits:  79%/3826  
      🟩 Clang18            Pass: 100%/2   | Total:  1h 09m | Avg: 34m 58s | Max: 35m 23s | Hits:  79%/3826  
      🟩 Clang19            Pass: 100%/7   | Total:  3h 02m | Avg: 26m 05s | Max: 38m 16s | Hits:  85%/13391 
      🟩 GCC7               Pass: 100%/2   | Total:  1h 14m | Avg: 37m 23s | Max: 39m 09s | Hits:  79%/3828  
      🟩 GCC8               Pass: 100%/1   | Total: 42m 12s | Avg: 42m 12s | Max: 42m 12s | Hits:  79%/1914  
      🟩 GCC9               Pass: 100%/2   | Total:  1h 19m | Avg: 39m 59s | Max: 43m 20s | Hits:  79%/3828  
      🟩 GCC10              Pass: 100%/2   | Total:  1h 20m | Avg: 40m 18s | Max: 42m 49s | Hits:  79%/3828  
      🟩 GCC11              Pass: 100%/2   | Total:  1h 21m | Avg: 40m 37s | Max: 41m 30s | Hits:  79%/3828  
      🟩 GCC12              Pass: 100%/2   | Total:  1h 26m | Avg: 43m 03s | Max: 43m 15s | Hits:  79%/3828  
      🟩 GCC13              Pass: 100%/11  | Total:  4h 55m | Avg: 26m 53s | Max: 42m 26s | Hits:  87%/21054 
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 09m | Avg: 34m 35s | Max: 34m 49s | Hits:  88%/3812  
      🟩 MSVC14.43          Pass: 100%/5   | Total:  4h 03m | Avg: 48m 42s | Max:  1h 16m | Hits:  74%/9530  
      🟩 NVHPC25.5          Pass: 100%/2   | Total:  3h 20m | Avg:  1h 40m | Max:  1h 46m | Hits:  11%/3824  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/19  | Total: 10h 10m | Avg: 32m 07s | Max: 40m 30s | Hits:  81%/36347 
      🟩 GCC                Pass: 100%/22  | Total: 12h 20m | Avg: 33m 40s | Max: 43m 20s | Hits:  83%/42108 
      🟩 MSVC               Pass: 100%/7   | Total:  5h 12m | Avg: 44m 40s | Max:  1h 16m | Hits:  78%/13342 
      🟩 NVHPC              Pass: 100%/2   | Total:  3h 20m | Avg:  1h 40m | Max:  1h 46m | Hits:  11%/3824  
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 36m 19s | Avg: 18m 09s | Max: 21m 43s | Hits:  89%/3828  
      🟩 rtx2080            Pass: 100%/38  | Total:  1d 01h | Avg: 40m 39s | Max:  1h 46m | Hits:  76%/72672 
      🟩 rtx4090            Pass: 100%/10  | Total:  4h 42m | Avg: 28m 17s | Max:  1h 16m | Hits:  89%/19121 
    🟩 jobs
      🟩 Build              Pass: 100%/43  | Total:  1d 05h | Avg: 40m 54s | Max:  1h 46m | Hits:  75%/82233 
      🟩 TestCPU            Pass: 100%/3   | Total: 50m 35s | Avg: 16m 51s | Max: 33m 10s | Hits:  99%/5733  
      🟩 TestGPU            Pass: 100%/4   | Total: 55m 05s | Avg: 13m 46s | Max: 14m 46s | Hits:  99%/7655  
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 36m 19s | Avg: 18m 09s | Max: 21m 43s | Hits:  89%/3828  
      🟩 90;90a             Pass: 100%/2   | Total:  1h 03m | Avg: 31m 32s | Max: 33m 11s | Hits:  83%/3820  
      🟩 100;120            Pass: 100%/2   | Total:  1h 29m | Avg: 44m 30s | Max: 56m 51s | Hits:  67%/3820  
    🟩 std
      🟩 17                 Pass: 100%/21  | Total: 14h 27m | Avg: 41m 19s | Max:  1h 46m | Hits:  77%/40160 
      🟩 20                 Pass: 100%/27  | Total: 15h 45m | Avg: 35m 01s | Max:  1h 34m | Hits:  80%/51633 
    
  • 🟩 libcudacxx: Pass: 100%/48 | Total: 17h 23m | Avg: 21m 44s | Max: 1h 16m | Hits: 72%/164079

    🟩 cpu
      🟩 amd64              Pass: 100%/46  | Total: 16h 55m | Avg: 22m 04s | Max:  1h 16m | Hits:  71%/156740
      🟩 arm64              Pass: 100%/2   | Total: 27m 49s | Avg: 13m 54s | Max: 23m 12s | Hits:  77%/7339  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  1h 42m | Avg: 20m 31s | Max: 49m 37s | Hits:  73%/17982 
      🟩 12.9               Pass: 100%/43  | Total: 15h 41m | Avg: 21m 53s | Max:  1h 16m | Hits:  71%/146097
    🟩 cudacxx
      🟩 ClangCUDA19        Pass: 100%/2   | Total: 53m 22s | Avg: 26m 41s | Max: 27m 52s | Hits:  28%/7303  
      🟩 nvcc12.0           Pass: 100%/5   | Total:  1h 42m | Avg: 20m 31s | Max: 49m 37s | Hits:  73%/17982 
      🟩 nvcc12.9           Pass: 100%/41  | Total: 14h 47m | Avg: 21m 38s | Max:  1h 16m | Hits:  74%/138794
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 53m 22s | Avg: 26m 41s | Max: 27m 52s | Hits:  28%/7303  
      🟩 nvcc               Pass: 100%/46  | Total: 16h 30m | Avg: 21m 31s | Max:  1h 16m | Hits:  74%/156776
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total: 47m 18s | Avg: 11m 49s | Max: 22m 18s | Hits:  83%/14562 
      🟩 Clang15            Pass: 100%/2   | Total: 10m 13s | Avg:  5m 06s | Max:  5m 08s | Hits:  99%/7299  
      🟩 Clang16            Pass: 100%/2   | Total: 37m 52s | Avg: 18m 56s | Max: 22m 06s | Hits:  67%/7299  
      🟩 Clang17            Pass: 100%/2   | Total: 30m 21s | Avg: 15m 10s | Max: 24m 41s | Hits:  78%/7299  
      🟩 Clang18            Pass: 100%/2   | Total: 40m 33s | Avg: 20m 16s | Max: 24m 34s | Hits:  67%/7299  
      🟩 Clang19            Pass: 100%/6   | Total:  2h 26m | Avg: 24m 26s | Max: 27m 52s | Hits:  54%/21940 
      🟩 GCC7               Pass: 100%/2   | Total: 24m 47s | Avg: 12m 23s | Max: 19m 50s | Hits:  78%/7235  
      🟩 GCC8               Pass: 100%/1   | Total:  5m 04s | Avg:  5m 04s | Max:  5m 04s | Hits:  99%/3628  
      🟩 GCC9               Pass: 100%/2   | Total: 49m 58s | Avg: 24m 59s | Max: 25m 57s | Hits:  56%/7247  
      🟩 GCC10              Pass: 100%/2   | Total: 27m 45s | Avg: 13m 52s | Max: 22m 41s | Hits:  78%/7301  
      🟩 GCC11              Pass: 100%/2   | Total: 49m 57s | Avg: 24m 58s | Max: 25m 10s | Hits:  56%/7297  
      🟩 GCC12              Pass: 100%/2   | Total: 23m 13s | Avg: 11m 36s | Max: 18m 06s | Hits:  88%/7301  
      🟩 GCC13              Pass: 100%/11  | Total:  3h 10m | Avg: 17m 19s | Max:  1h 16m | Hits:  88%/29695 
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 38m | Avg: 49m 01s | Max: 49m 37s | Hits:  52%/6971  
      🟩 MSVC14.43          Pass: 100%/4   | Total:  3h 10m | Avg: 47m 30s | Max: 52m 50s | Hits:  53%/14421 
      🟩 NVHPC25.5          Pass: 100%/2   | Total:  1h 11m | Avg: 35m 39s | Max: 36m 34s | Hits:  53%/7285  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/18  | Total:  5h 12m | Avg: 17m 23s | Max: 27m 52s | Hits:  71%/65698 
      🟩 GCC                Pass: 100%/22  | Total:  6h 11m | Avg: 16m 52s | Max:  1h 16m | Hits:  80%/69704 
      🟩 MSVC               Pass: 100%/6   | Total:  4h 48m | Avg: 48m 00s | Max: 52m 50s | Hits:  53%/21392 
      🟩 NVHPC              Pass: 100%/2   | Total:  1h 11m | Avg: 35m 39s | Max: 36m 34s | Hits:  53%/7285  
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 22m 53s | Avg: 11m 26s | Max: 17m 51s | Hits:  99%/7506  
      🟩 rtx2080            Pass: 100%/46  | Total: 17h 00m | Avg: 22m 11s | Max:  1h 16m | Hits:  70%/156573
    🟩 jobs
      🟩 Build              Pass: 100%/42  | Total: 14h 42m | Avg: 21m 00s | Max: 52m 50s | Hits:  71%/152945
      🟩 NVRTC              Pass: 100%/2   | Total: 43m 01s | Avg: 21m 30s | Max: 21m 33s | Hits:  90%/42    
      🟩 Test               Pass: 100%/3   | Total:  1h 56m | Avg: 38m 48s | Max:  1h 16m | Hits:  85%/11092 
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  1m 51s | Avg:  1m 51s | Max:  1m 51s
    🟩 sm
      🟩 75                 Pass: 100%/2   | Total: 43m 01s | Avg: 21m 30s | Max: 21m 33s | Hits:  90%/42    
      🟩 90                 Pass: 100%/2   | Total: 22m 53s | Avg: 11m 26s | Max: 17m 51s | Hits:  99%/7506  
      🟩 90;90a             Pass: 100%/2   | Total: 59m 00s | Avg: 29m 30s | Max: 44m 44s | Hits:  66%/7452  
      🟩 100;120            Pass: 100%/2   | Total: 49m 44s | Avg: 24m 52s | Max: 44m 41s | Hits:  76%/7452  
    🟩 std
      🟩 17                 Pass: 100%/22  | Total:  8h 07m | Avg: 22m 10s | Max: 49m 37s | Hits:  67%/75763 
      🟩 20                 Pass: 100%/25  | Total:  9h 13m | Avg: 22m 09s | Max:  1h 16m | Hits:  75%/88316 
    
  • 🟩 cudax: Pass: 100%/28 | Total: 8h 35m | Avg: 18m 24s | Max: 34m 54s | Hits: 53%/16246

    🟩 cpu
      🟩 amd64              Pass: 100%/24  | Total:  7h 25m | Avg: 18m 32s | Max: 34m 54s | Hits:  54%/13754 
      🟩 arm64              Pass: 100%/4   | Total:  1h 10m | Avg: 17m 32s | Max: 19m 10s | Hits:  50%/2492  
    🟩 ctk
      🟩 12.0               Pass: 100%/3   | Total: 49m 47s | Avg: 16m 35s | Max: 17m 41s | Hits:  50%/1568  
      🟩 12.9               Pass: 100%/25  | Total:  7h 45m | Avg: 18m 37s | Max: 34m 54s | Hits:  54%/14678 
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/3   | Total: 49m 47s | Avg: 16m 35s | Max: 17m 41s | Hits:  50%/1568  
      🟩 nvcc12.9           Pass: 100%/25  | Total:  7h 45m | Avg: 18m 37s | Max: 34m 54s | Hits:  54%/14678 
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/28  | Total:  8h 35m | Avg: 18m 24s | Max: 34m 54s | Hits:  53%/16246 
    🟩 cxx
      🟩 Clang14            Pass: 100%/2   | Total: 34m 20s | Avg: 17m 10s | Max: 17m 37s | Hits:  50%/1248  
      🟩 Clang15            Pass: 100%/1   | Total: 19m 38s | Avg: 19m 38s | Max: 19m 38s | Hits:  50%/623   
      🟩 Clang16            Pass: 100%/1   | Total: 20m 09s | Avg: 20m 09s | Max: 20m 09s | Hits:  50%/623   
      🟩 Clang17            Pass: 100%/1   | Total: 17m 54s | Avg: 17m 54s | Max: 17m 54s | Hits:  50%/623   
      🟩 Clang18            Pass: 100%/1   | Total: 18m 27s | Avg: 18m 27s | Max: 18m 27s | Hits:  50%/623   
      🟩 Clang19            Pass: 100%/4   | Total: 59m 12s | Avg: 14m 48s | Max: 18m 17s | Hits:  62%/2492  
      🟩 GCC10              Pass: 100%/2   | Total: 40m 45s | Avg: 20m 22s | Max: 23m 04s | Hits:  50%/1248  
      🟩 GCC11              Pass: 100%/1   | Total: 20m 35s | Avg: 20m 35s | Max: 20m 35s | Hits:  49%/623   
      🟩 GCC12              Pass: 100%/1   | Total: 24m 03s | Avg: 24m 03s | Max: 24m 03s | Hits:  49%/623   
      🟩 GCC13              Pass: 100%/8   | Total:  2h 10m | Avg: 16m 17s | Max: 21m 50s | Hits:  62%/4984  
      🟩 MSVC14.39          Pass: 100%/1   | Total: 15m 23s | Avg: 15m 23s | Max: 15m 23s | Hits:  47%/322   
      🟩 MSVC14.43          Pass: 100%/3   | Total: 44m 45s | Avg: 14m 55s | Max: 15m 25s | Hits:  46%/972   
      🟩 NVHPC25.5          Pass: 100%/2   | Total:  1h 09m | Avg: 34m 51s | Max: 34m 54s | Hits:  26%/1242  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/10  | Total:  2h 49m | Avg: 16m 58s | Max: 20m 09s | Hits:  55%/6232  
      🟩 GCC                Pass: 100%/12  | Total:  3h 35m | Avg: 17m 58s | Max: 24m 03s | Hits:  58%/7478  
      🟩 MSVC               Pass: 100%/4   | Total:  1h 00m | Avg: 15m 02s | Max: 15m 25s | Hits:  47%/1294  
      🟩 NVHPC              Pass: 100%/2   | Total:  1h 09m | Avg: 34m 51s | Max: 34m 54s | Hits:  26%/1242  
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 22m 59s | Avg: 11m 29s | Max: 15m 37s | Hits:  74%/1246  
      🟩 rtx2080            Pass: 100%/26  | Total:  8h 12m | Avg: 18m 55s | Max: 34m 54s | Hits:  52%/15000 
    🟩 jobs
      🟩 Build              Pass: 100%/25  | Total:  8h 07m | Avg: 19m 30s | Max: 34m 54s | Hits:  47%/14377 
      🟩 Test               Pass: 100%/3   | Total: 27m 28s | Avg:  9m 09s | Max: 11m 49s | Hits:  99%/1869  
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 22m 59s | Avg: 11m 29s | Max: 15m 37s | Hits:  74%/1246  
      🟩 90;90a             Pass: 100%/2   | Total: 34m 20s | Avg: 17m 10s | Max: 18m 55s | Hits:  48%/947   
      🟩 100;120            Pass: 100%/2   | Total: 31m 28s | Avg: 15m 44s | Max: 17m 14s | Hits:  48%/947   
    🟩 std
      🟩 17                 Pass: 100%/3   | Total:  1h 08m | Avg: 22m 51s | Max: 34m 49s | Hits:  42%/1867  
      🟩 20                 Pass: 100%/25  | Total:  7h 26m | Avg: 17m 51s | Max: 34m 54s | Hits:  55%/14379 
    
  • 🟩 python: Pass: 100%/18 | Total: 3h 16m | Avg: 10m 53s | Max: 23m 13s

    🟩 cpu
      🟩 amd64              Pass: 100%/18  | Total:  3h 16m | Avg: 10m 53s | Max: 23m 13s
    🟩 ctk
      🟩 12.9               Pass: 100%/18  | Total:  3h 16m | Avg: 10m 53s | Max: 23m 13s
    🟩 cudacxx
      🟩 nvcc12.9           Pass: 100%/18  | Total:  3h 16m | Avg: 10m 53s | Max: 23m 13s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/18  | Total:  3h 16m | Avg: 10m 53s | Max: 23m 13s
    🟩 cxx
      🟩 GCC13              Pass: 100%/18  | Total:  3h 16m | Avg: 10m 53s | Max: 23m 13s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/18  | Total:  3h 16m | Avg: 10m 53s | Max: 23m 13s
    🟩 gpu
      🟩 h100               Pass: 100%/8   | Total:  1h 18m | Avg:  9m 50s | Max: 16m 57s
      🟩 rtxa6000           Pass: 100%/10  | Total:  1h 57m | Avg: 11m 44s | Max: 23m 13s
    🟩 jobs
      🟩 Build cuda.cccl    Pass: 100%/2   | Total: 20m 37s | Avg: 10m 18s | Max: 10m 43s
      🟩 Test cuda.cccl.cooperative Pass: 100%/4   | Total:  1h 10m | Avg: 17m 37s | Max: 23m 13s
      🟩 Test cuda.cccl.examples Pass: 100%/4   | Total: 18m 12s | Avg:  4m 33s | Max:  4m 37s
      🟩 Test cuda.cccl.headers Pass: 100%/4   | Total: 19m 14s | Avg:  4m 48s | Max:  5m 40s
      🟩 Test cuda.cccl.parallel Pass: 100%/4   | Total:  1h 07m | Avg: 16m 53s | Max: 17m 28s
    🟩 py_version
      🟩 3.10               Pass: 100%/9   | Total:  1h 38m | Avg: 10m 53s | Max: 22m 35s
      🟩 3.13               Pass: 100%/9   | Total:  1h 38m | Avg: 10m 54s | Max: 23m 13s
    
  • 🟩 packaging: Pass: 100%/4 | Total: 19m 52s | Avg: 4m 58s | Max: 5m 55s

    🟩 cpu
      🟩 amd64              Pass: 100%/4   | Total: 19m 52s | Avg:  4m 58s | Max:  5m 55s
    🟩 ctk
      🟩 12.0               Pass: 100%/2   | Total:  9m 16s | Avg:  4m 38s | Max:  4m 45s
      🟩 12.9               Pass: 100%/2   | Total: 10m 36s | Avg:  5m 18s | Max:  5m 55s
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/2   | Total:  9m 16s | Avg:  4m 38s | Max:  4m 45s
      🟩 nvcc12.9           Pass: 100%/2   | Total: 10m 36s | Avg:  5m 18s | Max:  5m 55s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/4   | Total: 19m 52s | Avg:  4m 58s | Max:  5m 55s
    🟩 cxx
      🟩 Clang14            Pass: 100%/1   | Total:  4m 31s | Avg:  4m 31s | Max:  4m 31s
      🟩 Clang19            Pass: 100%/1   | Total:  4m 41s | Avg:  4m 41s | Max:  4m 41s
      🟩 GCC12              Pass: 100%/1   | Total:  4m 45s | Avg:  4m 45s | Max:  4m 45s
      🟩 GCC13              Pass: 100%/1   | Total:  5m 55s | Avg:  5m 55s | Max:  5m 55s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/2   | Total:  9m 12s | Avg:  4m 36s | Max:  4m 41s
      🟩 GCC                Pass: 100%/2   | Total: 10m 40s | Avg:  5m 20s | Max:  5m 55s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/4   | Total: 19m 52s | Avg:  4m 58s | Max:  5m 55s
    🟩 jobs
      🟩 Test               Pass: 100%/4   | Total: 19m 52s | Avg:  4m 58s | Max:  5m 55s
    
  • 🟩 stdpar: Pass: 100%/4 | Total: 15m 57s | Avg: 3m 59s | Max: 4m 14s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total:  8m 18s | Avg:  4m 09s | Max:  4m 14s
      🟩 arm64              Pass: 100%/2   | Total:  7m 39s | Avg:  3m 49s | Max:  3m 52s
    🟩 ctk
      🟩 12.9               Pass: 100%/4   | Total: 15m 57s | Avg:  3m 59s | Max:  4m 14s
    🟩 cudacxx
      🟩 nvcc12.9           Pass: 100%/4   | Total: 15m 57s | Avg:  3m 59s | Max:  4m 14s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/4   | Total: 15m 57s | Avg:  3m 59s | Max:  4m 14s
    🟩 cxx
      🟩 NVHPC25.5          Pass: 100%/4   | Total: 15m 57s | Avg:  3m 59s | Max:  4m 14s
    🟩 cxx_family
      🟩 NVHPC              Pass: 100%/4   | Total: 15m 57s | Avg:  3m 59s | Max:  4m 14s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/4   | Total: 15m 57s | Avg:  3m 59s | Max:  4m 14s
    🟩 jobs
      🟩 Build              Pass: 100%/4   | Total: 15m 57s | Avg:  3m 59s | Max:  4m 14s
    🟩 std
      🟩 17                 Pass: 100%/2   | Total:  7m 56s | Avg:  3m 58s | Max:  4m 04s
      🟩 20                 Pass: 100%/2   | Total:  8m 01s | Avg:  4m 00s | Max:  4m 14s
    
  • 🟩 cccl_c_parallel: Pass: 100%/3 | Total: 26m 14s | Avg: 8m 44s | Max: 14m 19s | Hits: 98%/495

    🟩 cpu
      🟩 amd64              Pass: 100%/3   | Total: 26m 14s | Avg:  8m 44s | Max: 14m 19s | Hits:  98%/495   
    🟩 ctk
      🟩 12.9               Pass: 100%/3   | Total: 26m 14s | Avg:  8m 44s | Max: 14m 19s | Hits:  98%/495   
    🟩 cudacxx
      🟩 nvcc12.9           Pass: 100%/3   | Total: 26m 14s | Avg:  8m 44s | Max: 14m 19s | Hits:  98%/495   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/3   | Total: 26m 14s | Avg:  8m 44s | Max: 14m 19s | Hits:  98%/495   
    🟩 cxx
      🟩 GCC13              Pass: 100%/3   | Total: 26m 14s | Avg:  8m 44s | Max: 14m 19s | Hits:  98%/495   
    🟩 cxx_family
      🟩 GCC                Pass: 100%/3   | Total: 26m 14s | Avg:  8m 44s | Max: 14m 19s | Hits:  98%/495   
    🟩 gpu
      🟩 h100               Pass: 100%/1   | Total: 14m 19s | Avg: 14m 19s | Max: 14m 19s | Hits:  98%/165   
      🟩 rtx2080            Pass: 100%/2   | Total: 11m 55s | Avg:  5m 57s | Max:  9m 33s | Hits:  98%/330   
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 22s | Avg:  2m 22s | Max:  2m 22s | Hits:  98%/165   
      🟩 Test               Pass: 100%/2   | Total: 23m 52s | Avg: 11m 56s | Max: 14m 19s | Hits:  98%/330   
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
CCCL Packaging
+/- libcu++
CUB
Thrust
CUDA Experimental
stdpar
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
+/- CCCL Packaging
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- stdpar
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 205)

# Runner
128 linux-amd64-cpu16
23 windows-amd64-cpu16
14 linux-amd64-gpu-h100-latest-1
14 linux-amd64-gpu-rtxa6000-latest-1
12 linux-arm64-cpu16
11 linux-amd64-gpu-rtx2080-latest-1
3 linux-amd64-gpu-rtx4090-latest-1

@miscco miscco merged commit df3dcf4 into NVIDIA:main Jul 15, 2025
404 of 429 checks passed
@github-project-automation github-project-automation bot moved this from In Review to Done in CCCL Jul 15, 2025
@miscco miscco deleted the fix_tuple_size branch July 15, 2025 10:52
miscco added a commit to miscco/cccl that referenced this pull request Jul 15, 2025
miscco added a commit to miscco/cccl that referenced this pull request Jul 15, 2025
miscco added a commit to miscco/cccl that referenced this pull request Jul 15, 2025
miscco added a commit that referenced this pull request Jul 15, 2025
@tomilov
Copy link

tomilov commented Jul 17, 2025

What is the expected time to market of the fixes? When they will be added in CUDA installer?

@bernhardmgruber
Copy link
Contributor

What is the expected time to market of the fixes? When they will be added in CUDA installer?

The changes will ship in CCCCL 3.1, which is scheduled for CUDA Toolkit 13.1. We don't know when they will ship. In the meantime, you can use CCCL from GitHub directly. See here.

@miscco
Copy link
Contributor Author

miscco commented Jul 17, 2025

actually we have backported them to the 3.0 release branch, whenever that may be

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

Archived in project

Development

Successfully merging this pull request may close these issues.

[BUG]: size of tuple of tuples is not a sum of sizes of tuples

3 participants