KEMBAR78
Implement fp properties by davebayer · Pull Request #4213 · NVIDIA/cccl · GitHub
Skip to content

Conversation

@davebayer
Copy link
Contributor

This PR implements internal fp properties module.

@davebayer davebayer requested a review from a team as a code owner March 20, 2025 14:49
@davebayer davebayer requested a review from griwes March 20, 2025 14:50
@github-project-automation github-project-automation bot moved this to Todo in CCCL Mar 20, 2025
@copy-pr-bot
Copy link
Contributor

copy-pr-bot bot commented Mar 20, 2025

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

@cccl-authenticator-app cccl-authenticator-app bot moved this from Todo to In Review in CCCL Mar 20, 2025
// __fp_exp_nbits_v

template <__fp_format _Fmt>
inline constexpr int __fp_exp_nbits_v = 0;
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I am wondering whether this is something we want to derive from numeric_limits

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

There are some fundamental differences between the variables defined here and in numeric_limits. Example:

  • the numeric_limits::digits does not specify if the type has the implict bit or not
  • the numeric_limits::min_exponent and numeric_limits::max_exponent are defined in a different way than the actual minimum and maximum exponent values

For these reasons I've decided to implement our own fp properties. We can then use them to simplify the implementation of numeric_limits by a lot

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Also, I am trying to avoid circular dependencies because __cccl_fp requires conversion rank which requires numeric_limits which requires __fp_from_storage which requires __cccl_fp

@miscco
Copy link
Contributor

miscco commented Mar 20, 2025

/ok to test

@github-actions
Copy link
Contributor

🟩 CI finished in 2h 01m: Pass: 100%/162 | Total: 3d 08h | Avg: 29m 42s | Max: 1h 26m | Hits: 63%/252622
  • 🟩 cub: Pass: 100%/45 | Total: 1d 17h | Avg: 55m 17s | Max: 1h 26m | Hits: 30%/53780

    🟩 cpu
      🟩 amd64              Pass: 100%/43  | Total:  1d 15h | Avg: 55m 07s | Max:  1h 26m | Hits:  31%/51336 
      🟩 arm64              Pass: 100%/2   | Total:  1h 58m | Avg: 59m 03s | Max:  1h 07m | Hits:  16%/2444  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  5h 07m | Avg:  1h 01m | Max:  1h 07m | Hits:  15%/5940  
      🟩 12.6               Pass: 100%/2   | Total:  2h 49m | Avg:  1h 24m | Max:  1h 26m | Hits:  12%/2260  
      🟩 12.8               Pass: 100%/38  | Total:  1d 09h | Avg: 52m 56s | Max:  1h 14m | Hits:  33%/45580 
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  2h 13m | Avg:  1h 06m | Max:  1h 07m | Hits:  15%/2108  
      🟩 nvcc12.0           Pass: 100%/5   | Total:  5h 07m | Avg:  1h 01m | Max:  1h 07m | Hits:  15%/5940  
      🟩 nvcc12.6           Pass: 100%/2   | Total:  2h 49m | Avg:  1h 24m | Max:  1h 26m | Hits:  12%/2260  
      🟩 nvcc12.8           Pass: 100%/36  | Total:  1d 07h | Avg: 52m 11s | Max:  1h 14m | Hits:  34%/43472 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  2h 13m | Avg:  1h 06m | Max:  1h 07m | Hits:  15%/2108  
      🟩 nvcc               Pass: 100%/43  | Total:  1d 15h | Avg: 54m 46s | Max:  1h 26m | Hits:  31%/51672 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  4h 25m | Avg:  1h 06m | Max:  1h 07m | Hits:  16%/4896  
      🟩 Clang15            Pass: 100%/2   | Total:  2h 23m | Avg:  1h 11m | Max:  1h 14m | Hits:  16%/2444  
      🟩 Clang16            Pass: 100%/2   | Total:  2h 12m | Avg:  1h 06m | Max:  1h 06m | Hits:  16%/2444  
      🟩 Clang17            Pass: 100%/2   | Total:  2h 14m | Avg:  1h 07m | Max:  1h 09m | Hits:  16%/2444  
      🟩 Clang18            Pass: 100%/7   | Total:  6h 22m | Avg: 54m 41s | Max:  1h 08m | Hits:  41%/8218  
      🟩 GCC7               Pass: 100%/2   | Total:  1h 47m | Avg: 53m 38s | Max: 54m 56s | Hits:  16%/2448  
      🟩 GCC8               Pass: 100%/1   | Total: 56m 48s | Avg: 56m 48s | Max: 56m 48s | Hits:  16%/1224  
      🟩 GCC9               Pass: 100%/2   | Total:  1h 53m | Avg: 56m 40s | Max: 57m 14s | Hits:  16%/2448  
      🟩 GCC10              Pass: 100%/2   | Total:  1h 46m | Avg: 53m 09s | Max: 53m 32s | Hits:  16%/2448  
      🟩 GCC11              Pass: 100%/2   | Total:  1h 43m | Avg: 51m 33s | Max: 51m 57s | Hits:  16%/2444  
      🟩 GCC12              Pass: 100%/2   | Total:  1h 49m | Avg: 54m 53s | Max: 55m 12s | Hits:  16%/2444  
      🟩 GCC13              Pass: 100%/11  | Total:  6h 55m | Avg: 37m 46s | Max: 54m 31s | Hits:  58%/13442 
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 04m | Avg:  1h 02m | Max:  1h 03m | Hits:  12%/2088  
      🟩 MSVC14.42          Pass: 100%/2   | Total:  2h 04m | Avg:  1h 02m | Max:  1h 02m | Hits:  12%/2088  
      🟩 NVHPC25.1          Pass: 100%/2   | Total:  2h 49m | Avg:  1h 24m | Max:  1h 26m | Hits:  12%/2260  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/17  | Total: 17h 38m | Avg:  1h 02m | Max:  1h 14m | Hits:  26%/20446 
      🟩 GCC                Pass: 100%/22  | Total: 16h 52m | Avg: 46m 00s | Max: 57m 14s | Hits:  37%/26898 
      🟩 MSVC               Pass: 100%/4   | Total:  4h 08m | Avg:  1h 02m | Max:  1h 03m | Hits:  12%/4176  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 49m | Avg:  1h 24m | Max:  1h 26m | Hits:  12%/2260  
    🟩 gpu
      🟩 h100               Pass: 100%/3   | Total:  1h 45m | Avg: 35m 00s | Max: 47m 55s | Hits:  61%/3666  
      🟩 rtx2080            Pass: 100%/34  | Total:  1d 11h | Avg:  1h 02m | Max:  1h 26m | Hits:  15%/40338 
      🟩 rtxa6000           Pass: 100%/8   | Total:  4h 29m | Avg: 33m 41s | Max:  1h 05m | Hits:  78%/9776  
    🟩 jobs
      🟩 Build              Pass: 100%/37  | Total:  1d 13h | Avg:  1h 01m | Max:  1h 26m | Hits:  15%/44004 
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 28m 15s | Avg: 28m 15s | Max: 28m 15s | Hits:  99%/1222  
      🟩 GraphCapture       Pass: 100%/1   | Total: 21m 40s | Avg: 21m 40s | Max: 21m 40s | Hits:  99%/1222  
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 20m | Avg: 26m 57s | Max: 27m 58s | Hits:  99%/3666  
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 34m | Avg: 31m 33s | Max: 47m 55s | Hits:  89%/3666  
    🟩 sm
      🟩 90                 Pass: 100%/3   | Total:  1h 45m | Avg: 35m 00s | Max: 47m 55s | Hits:  61%/3666  
      🟩 90;90a;100         Pass: 100%/1   | Total: 51m 11s | Avg: 51m 11s | Max: 51m 11s | Hits:  16%/1222  
    🟩 std
      🟩 17                 Pass: 100%/20  | Total: 20h 26m | Avg:  1h 01m | Max:  1h 23m | Hits:  15%/23662 
      🟩 20                 Pass: 100%/25  | Total: 21h 01m | Avg: 50m 28s | Max:  1h 26m | Hits:  41%/30118 
    
  • 🟩 thrust: Pass: 100%/45 | Total: 1d 02h | Avg: 34m 56s | Max: 1h 09m | Hits: 56%/79911

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 43m 19s | Avg: 21m 39s | Max: 31m 52s | Hits:  73%/3554  
    🟩 cpu
      🟩 amd64              Pass: 100%/43  | Total:  1d 01h | Avg: 35m 05s | Max:  1h 09m | Hits:  57%/76358 
      🟩 arm64              Pass: 100%/2   | Total:  1h 03m | Avg: 31m 41s | Max: 33m 12s | Hits:  47%/3553  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  3h 31m | Avg: 42m 20s | Max:  1h 09m | Hits:  57%/8876  
      🟩 12.6               Pass: 100%/2   | Total:  2h 11m | Avg:  1h 05m | Max:  1h 07m | Hits:  25%/3552  
      🟩 12.8               Pass: 100%/38  | Total: 20h 29m | Avg: 32m 21s | Max:  1h 04m | Hits:  58%/67483 
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  1h 01m | Avg: 30m 54s | Max: 30m 59s | Hits:  48%/3552  
      🟩 nvcc12.0           Pass: 100%/5   | Total:  3h 31m | Avg: 42m 20s | Max:  1h 09m | Hits:  57%/8876  
      🟩 nvcc12.6           Pass: 100%/2   | Total:  2h 11m | Avg:  1h 05m | Max:  1h 07m | Hits:  25%/3552  
      🟩 nvcc12.8           Pass: 100%/36  | Total: 19h 27m | Avg: 32m 26s | Max:  1h 04m | Hits:  58%/63931 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  1h 01m | Avg: 30m 54s | Max: 30m 59s | Hits:  48%/3552  
      🟩 nvcc               Pass: 100%/43  | Total:  1d 01h | Avg: 35m 08s | Max:  1h 09m | Hits:  57%/76359 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  2h 16m | Avg: 34m 07s | Max: 34m 44s | Hits:  56%/7104  
      🟩 Clang15            Pass: 100%/2   | Total:  1h 10m | Avg: 35m 28s | Max: 36m 26s | Hits:  48%/3552  
      🟩 Clang16            Pass: 100%/2   | Total:  1h 08m | Avg: 34m 21s | Max: 34m 52s | Hits:  48%/3552  
      🟩 Clang17            Pass: 100%/2   | Total:  1h 09m | Avg: 34m 32s | Max: 34m 34s | Hits:  48%/3552  
      🟩 Clang18            Pass: 100%/7   | Total:  2h 58m | Avg: 25m 25s | Max: 34m 19s | Hits:  64%/12432 
      🟩 GCC7               Pass: 100%/2   | Total:  1h 09m | Avg: 34m 43s | Max: 35m 26s | Hits:  60%/3554  
      🟩 GCC8               Pass: 100%/1   | Total: 34m 50s | Avg: 34m 50s | Max: 34m 50s | Hits:  47%/1777  
      🟩 GCC9               Pass: 100%/2   | Total:  1h 17m | Avg: 38m 37s | Max: 38m 40s | Hits:  54%/3554  
      🟩 GCC10              Pass: 100%/2   | Total:  1h 15m | Avg: 37m 48s | Max: 39m 40s | Hits:  47%/3554  
      🟩 GCC11              Pass: 100%/2   | Total:  1h 14m | Avg: 37m 14s | Max: 38m 02s | Hits:  47%/3554  
      🟩 GCC12              Pass: 100%/2   | Total:  1h 12m | Avg: 36m 01s | Max: 36m 03s | Hits:  47%/3554  
      🟩 GCC13              Pass: 100%/10  | Total:  3h 51m | Avg: 23m 07s | Max: 35m 49s | Hits:  74%/17770 
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 10m | Avg:  1h 05m | Max:  1h 09m | Hits:  41%/3540  
      🟩 MSVC14.42          Pass: 100%/3   | Total:  2h 32m | Avg: 50m 52s | Max:  1h 04m | Hits:  50%/5310  
      🟩 NVHPC25.1          Pass: 100%/2   | Total:  2h 11m | Avg:  1h 05m | Max:  1h 07m | Hits:  25%/3552  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/17  | Total:  8h 43m | Avg: 30m 46s | Max: 36m 26s | Hits:  56%/30192 
      🟩 GCC                Pass: 100%/21  | Total: 10h 34m | Avg: 30m 13s | Max: 39m 40s | Hits:  62%/37317 
      🟩 MSVC               Pass: 100%/5   | Total:  4h 43m | Avg: 56m 36s | Max:  1h 09m | Hits:  46%/8850  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 11m | Avg:  1h 05m | Max:  1h 07m | Hits:  25%/3552  
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 36m 13s | Avg: 18m 06s | Max: 24m 24s | Hits:  73%/3554  
      🟩 rtx2080            Pass: 100%/33  | Total: 21h 41m | Avg: 39m 25s | Max:  1h 09m | Hits:  48%/58604 
      🟩 rtx4090            Pass: 100%/10  | Total:  3h 55m | Avg: 23m 32s | Max:  1h 04m | Hits:  80%/17753 
    🟩 jobs
      🟩 Build              Pass: 100%/38  | Total:  1d 00h | Avg: 39m 08s | Max:  1h 09m | Hits:  48%/67481 
      🟩 TestCPU            Pass: 100%/3   | Total: 39m 42s | Avg: 13m 14s | Max: 24m 19s | Hits:  99%/5323  
      🟩 TestGPU            Pass: 100%/4   | Total: 45m 16s | Avg: 11m 19s | Max: 11m 49s | Hits:  99%/7107  
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 36m 13s | Avg: 18m 06s | Max: 24m 24s | Hits:  73%/3554  
      🟩 90;90a;100         Pass: 100%/1   | Total: 31m 00s | Avg: 31m 00s | Max: 31m 00s | Hits:  75%/1777  
    🟩 std
      🟩 17                 Pass: 100%/20  | Total: 13h 46m | Avg: 41m 18s | Max:  1h 09m | Hits:  48%/35511 
      🟩 20                 Pass: 100%/23  | Total: 11h 43m | Avg: 30m 34s | Max:  1h 04m | Hits:  62%/40846 
    
  • 🟩 libcudacxx: Pass: 100%/43 | Total: 8h 27m | Avg: 11m 47s | Max: 36m 30s | Hits: 81%/106759

    🟩 cpu
      🟩 amd64              Pass: 100%/41  | Total:  8h 16m | Avg: 12m 05s | Max: 36m 30s | Hits:  81%/100920
      🟩 arm64              Pass: 100%/2   | Total: 11m 01s | Avg:  5m 30s | Max:  5m 38s | Hits:  92%/5839  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total: 51m 18s | Avg: 10m 15s | Max: 22m 12s | Hits:  89%/14223 
      🟩 12.6               Pass: 100%/2   | Total:  1h 11m | Avg: 35m 36s | Max: 36m 30s | Hits:  28%/5786  
      🟩 12.8               Pass: 100%/36  | Total:  6h 24m | Avg: 10m 40s | Max: 24m 22s | Hits:  83%/86750 
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 43m 52s | Avg: 21m 56s | Max: 23m 01s | Hits:  27%/5800  
      🟩 nvcc12.0           Pass: 100%/5   | Total: 51m 18s | Avg: 10m 15s | Max: 22m 12s | Hits:  89%/14223 
      🟩 nvcc12.6           Pass: 100%/2   | Total:  1h 11m | Avg: 35m 36s | Max: 36m 30s | Hits:  28%/5786  
      🟩 nvcc12.8           Pass: 100%/34  | Total:  5h 40m | Avg: 10m 01s | Max: 24m 22s | Hits:  87%/80950 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 43m 52s | Avg: 21m 56s | Max: 23m 01s | Hits:  27%/5800  
      🟩 nvcc               Pass: 100%/41  | Total:  7h 43m | Avg: 11m 17s | Max: 36m 30s | Hits:  84%/100959
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total: 32m 42s | Avg:  8m 10s | Max: 11m 07s | Hits:  88%/11568 
      🟩 Clang15            Pass: 100%/2   | Total: 16m 05s | Avg:  8m 02s | Max: 10m 02s | Hits:  91%/5796  
      🟩 Clang16            Pass: 100%/2   | Total: 12m 08s | Avg:  6m 04s | Max:  6m 09s | Hits:  93%/5796  
      🟩 Clang17            Pass: 100%/2   | Total: 13m 54s | Avg:  6m 57s | Max:  7m 36s | Hits:  92%/5796  
      🟩 Clang18            Pass: 100%/6   | Total:  1h 10m | Avg: 11m 47s | Max: 23m 01s | Hits:  66%/14515 
      🟩 GCC7               Pass: 100%/2   | Total: 31m 28s | Avg: 15m 44s | Max: 21m 22s | Hits:  59%/5734  
      🟩 GCC8               Pass: 100%/1   | Total:  7m 25s | Avg:  7m 25s | Max:  7m 25s | Hits:  88%/2877  
      🟩 GCC9               Pass: 100%/2   | Total: 10m 22s | Avg:  5m 11s | Max:  5m 50s | Hits:  92%/5746  
      🟩 GCC10              Pass: 100%/2   | Total: 26m 40s | Avg: 13m 20s | Max: 20m 16s | Hits:  67%/5802  
      🟩 GCC11              Pass: 100%/2   | Total: 15m 42s | Avg:  7m 51s | Max: 10m 03s | Hits:  89%/5798  
      🟩 GCC12              Pass: 100%/2   | Total: 12m 45s | Avg:  6m 22s | Max:  6m 28s | Hits:  92%/5798  
      🟩 GCC13              Pass: 100%/10  | Total:  1h 35m | Avg:  9m 32s | Max: 22m 11s | Hits:  92%/14776 
      🟩 MSVC14.29          Pass: 100%/2   | Total: 44m 21s | Avg: 22m 10s | Max: 22m 12s | Hits:  93%/5450  
      🟩 MSVC14.42          Pass: 100%/2   | Total: 46m 03s | Avg: 23m 01s | Max: 24m 22s | Hits:  93%/5521  
      🟩 NVHPC25.1          Pass: 100%/2   | Total:  1h 11m | Avg: 35m 36s | Max: 36m 30s | Hits:  28%/5786  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/16  | Total:  2h 25m | Avg:  9m 05s | Max: 23m 01s | Hits:  82%/43471 
      🟩 GCC                Pass: 100%/21  | Total:  3h 19m | Avg:  9m 30s | Max: 22m 11s | Hits:  84%/46531 
      🟩 MSVC               Pass: 100%/4   | Total:  1h 30m | Avg: 22m 36s | Max: 24m 22s | Hits:  93%/10971 
      🟩 NVHPC              Pass: 100%/2   | Total:  1h 11m | Avg: 35m 36s | Max: 36m 30s | Hits:  28%/5786  
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 18m 29s | Avg:  9m 14s | Max: 13m 40s | Hits:  93%/3009  
      🟩 rtx2080            Pass: 100%/41  | Total:  8h 08m | Avg: 11m 54s | Max: 36m 30s | Hits:  81%/103750
    🟩 jobs
      🟩 Build              Pass: 100%/37  | Total:  7h 12m | Avg: 11m 41s | Max: 36m 30s | Hits:  81%/106719
      🟩 NVRTC              Pass: 100%/2   | Total: 37m 51s | Avg: 18m 55s | Max: 22m 11s | Hits:  90%/40    
      🟩 Test               Pass: 100%/3   | Total: 34m 02s | Avg: 11m 20s | Max: 13m 40s
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  2m 24s | Avg:  2m 24s | Max:  2m 24s
    🟩 sm
      🟩 75                 Pass: 100%/2   | Total: 37m 51s | Avg: 18m 55s | Max: 22m 11s | Hits:  90%/40    
      🟩 90                 Pass: 100%/2   | Total: 18m 29s | Avg:  9m 14s | Max: 13m 40s | Hits:  93%/3009  
      🟩 90;90a;100         Pass: 100%/1   | Total:  7m 01s | Avg:  7m 01s | Max:  7m 01s | Hits:  91%/3009  
    🟩 std
      🟩 17                 Pass: 100%/21  | Total:  4h 34m | Avg: 13m 03s | Max: 34m 43s | Hits:  79%/57073 
      🟩 20                 Pass: 100%/21  | Total:  3h 50m | Avg: 10m 58s | Max: 36m 30s | Hits:  84%/49686 
    
  • 🟩 cudax: Pass: 100%/22 | Total: 2h 17m | Avg: 6m 14s | Max: 12m 52s | Hits: 95%/11852

    🟩 cpu
      🟩 amd64              Pass: 100%/18  | Total:  2h 01m | Avg:  6m 45s | Max: 12m 52s | Hits:  95%/9512  
      🟩 arm64              Pass: 100%/4   | Total: 15m 50s | Avg:  3m 57s | Max:  4m 07s | Hits:  95%/2340  
    🟩 ctk
      🟩 12.0               Pass: 100%/1   | Total: 10m 40s | Avg: 10m 40s | Max: 10m 40s | Hits:  87%/282   
      🟩 12.6               Pass: 100%/2   | Total: 14m 50s | Avg:  7m 25s | Max:  7m 40s | Hits:  90%/754   
      🟩 12.8               Pass: 100%/19  | Total:  1h 51m | Avg:  5m 53s | Max: 12m 52s | Hits:  95%/10816 
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/1   | Total: 10m 40s | Avg: 10m 40s | Max: 10m 40s | Hits:  87%/282   
      🟩 nvcc12.6           Pass: 100%/2   | Total: 14m 50s | Avg:  7m 25s | Max:  7m 40s | Hits:  90%/754   
      🟩 nvcc12.8           Pass: 100%/19  | Total:  1h 51m | Avg:  5m 53s | Max: 12m 52s | Hits:  95%/10816 
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/22  | Total:  2h 17m | Avg:  6m 14s | Max: 12m 52s | Hits:  95%/11852 
    🟩 cxx
      🟩 Clang14            Pass: 100%/1   | Total:  4m 33s | Avg:  4m 33s | Max:  4m 33s | Hits:  95%/587   
      🟩 Clang15            Pass: 100%/1   | Total:  4m 58s | Avg:  4m 58s | Max:  4m 58s | Hits:  95%/585   
      🟩 Clang16            Pass: 100%/1   | Total:  4m 43s | Avg:  4m 43s | Max:  4m 43s | Hits:  95%/585   
      🟩 Clang17            Pass: 100%/1   | Total:  4m 52s | Avg:  4m 52s | Max:  4m 52s | Hits:  95%/585   
      🟩 Clang18            Pass: 100%/4   | Total: 24m 43s | Avg:  6m 10s | Max: 12m 22s | Hits:  96%/2340  
      🟩 GCC10              Pass: 100%/1   | Total:  4m 49s | Avg:  4m 49s | Max:  4m 49s | Hits:  94%/587   
      🟩 GCC11              Pass: 100%/1   | Total:  4m 26s | Avg:  4m 26s | Max:  4m 26s | Hits:  94%/585   
      🟩 GCC12              Pass: 100%/2   | Total: 17m 36s | Avg:  8m 48s | Max: 12m 52s | Hits:  97%/1170  
      🟩 GCC13              Pass: 100%/6   | Total: 31m 05s | Avg:  5m 10s | Max: 11m 33s | Hits:  95%/3510  
      🟩 MSVC14.39          Pass: 100%/1   | Total: 10m 40s | Avg: 10m 40s | Max: 10m 40s | Hits:  87%/282   
      🟩 MSVC14.42          Pass: 100%/1   | Total: 10m 13s | Avg: 10m 13s | Max: 10m 13s | Hits:  87%/282   
      🟩 NVHPC25.1          Pass: 100%/2   | Total: 14m 50s | Avg:  7m 25s | Max:  7m 40s | Hits:  90%/754   
    🟩 cxx_family
      🟩 Clang              Pass: 100%/8   | Total: 43m 49s | Avg:  5m 28s | Max: 12m 22s | Hits:  95%/4682  
      🟩 GCC                Pass: 100%/10  | Total: 57m 56s | Avg:  5m 47s | Max: 12m 52s | Hits:  95%/5852  
      🟩 MSVC               Pass: 100%/2   | Total: 20m 53s | Avg: 10m 26s | Max: 10m 40s | Hits:  87%/564   
      🟩 NVHPC              Pass: 100%/2   | Total: 14m 50s | Avg:  7m 25s | Max:  7m 40s | Hits:  90%/754   
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 15m 22s | Avg:  7m 41s | Max: 11m 33s | Hits:  97%/1170  
      🟩 rtx2080            Pass: 100%/20  | Total:  2h 02m | Avg:  6m 06s | Max: 12m 52s | Hits:  94%/10682 
    🟩 jobs
      🟩 Build              Pass: 100%/19  | Total:  1h 40m | Avg:  5m 17s | Max: 10m 40s | Hits:  94%/10097 
      🟩 Test               Pass: 100%/3   | Total: 36m 47s | Avg: 12m 15s | Max: 12m 52s | Hits:  99%/1755  
    🟩 sm
      🟩 90                 Pass: 100%/3   | Total: 19m 04s | Avg:  6m 21s | Max: 11m 33s | Hits:  96%/1755  
      🟩 90a                Pass: 100%/1   | Total:  3m 48s | Avg:  3m 48s | Max:  3m 48s | Hits:  94%/585   
    🟩 std
      🟩 17                 Pass: 100%/4   | Total: 19m 14s | Avg:  4m 48s | Max:  7m 40s | Hits:  94%/2132  
      🟩 20                 Pass: 100%/18  | Total:  1h 58m | Avg:  6m 34s | Max: 12m 52s | Hits:  95%/9720  
    
  • 🟩 stdpar: Pass: 100%/4 | Total: 17m 35s | Avg: 4m 23s | Max: 5m 05s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total:  9m 52s | Avg:  4m 56s | Max:  5m 05s
      🟩 arm64              Pass: 100%/2   | Total:  7m 43s | Avg:  3m 51s | Max:  3m 55s
    🟩 ctk
      🟩 12.6               Pass: 100%/4   | Total: 17m 35s | Avg:  4m 23s | Max:  5m 05s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/4   | Total: 17m 35s | Avg:  4m 23s | Max:  5m 05s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/4   | Total: 17m 35s | Avg:  4m 23s | Max:  5m 05s
    🟩 cxx
      🟩 NVHPC25.1          Pass: 100%/4   | Total: 17m 35s | Avg:  4m 23s | Max:  5m 05s
    🟩 cxx_family
      🟩 NVHPC              Pass: 100%/4   | Total: 17m 35s | Avg:  4m 23s | Max:  5m 05s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/4   | Total: 17m 35s | Avg:  4m 23s | Max:  5m 05s
    🟩 jobs
      🟩 Build              Pass: 100%/4   | Total: 17m 35s | Avg:  4m 23s | Max:  5m 05s
    🟩 std
      🟩 17                 Pass: 100%/2   | Total:  8m 35s | Avg:  4m 17s | Max:  4m 47s
      🟩 20                 Pass: 100%/2   | Total:  9m 00s | Avg:  4m 30s | Max:  5m 05s
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 18m 59s | Avg: 9m 29s | Max: 16m 17s | Hits: 96%/320

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 18m 59s | Avg:  9m 29s | Max: 16m 17s | Hits:  96%/320   
    🟩 ctk
      🟩 12.8               Pass: 100%/2   | Total: 18m 59s | Avg:  9m 29s | Max: 16m 17s | Hits:  96%/320   
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/2   | Total: 18m 59s | Avg:  9m 29s | Max: 16m 17s | Hits:  96%/320   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total: 18m 59s | Avg:  9m 29s | Max: 16m 17s | Hits:  96%/320   
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total: 18m 59s | Avg:  9m 29s | Max: 16m 17s | Hits:  96%/320   
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total: 18m 59s | Avg:  9m 29s | Max: 16m 17s | Hits:  96%/320   
    🟩 gpu
      🟩 rtx2080            Pass: 100%/2   | Total: 18m 59s | Avg:  9m 29s | Max: 16m 17s | Hits:  96%/320   
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 42s | Avg:  2m 42s | Max:  2m 42s | Hits:  94%/160   
      🟩 Test               Pass: 100%/1   | Total: 16m 17s | Avg: 16m 17s | Max: 16m 17s | Hits:  98%/160   
    
  • 🟩 python: Pass: 100%/1 | Total: 1h 10m | Avg: 1h 10m | Max: 1h 10m

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total:  1h 10m | Avg:  1h 10m | Max:  1h 10m
    🟩 ctk
      🟩 12.8               Pass: 100%/1   | Total:  1h 10m | Avg:  1h 10m | Max:  1h 10m
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/1   | Total:  1h 10m | Avg:  1h 10m | Max:  1h 10m
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total:  1h 10m | Avg:  1h 10m | Max:  1h 10m
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total:  1h 10m | Avg:  1h 10m | Max:  1h 10m
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total:  1h 10m | Avg:  1h 10m | Max:  1h 10m
    🟩 gpu
      🟩 rtx2080            Pass: 100%/1   | Total:  1h 10m | Avg:  1h 10m | Max:  1h 10m
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total:  1h 10m | Avg:  1h 10m | Max:  1h 10m
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
+/- libcu++
CUB
Thrust
CUDA Experimental
stdpar
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- stdpar
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 162)

# Runner
113 linux-amd64-cpu16
15 windows-amd64-cpu16
12 linux-arm64-cpu16
8 linux-amd64-gpu-rtx2080-latest-1
6 linux-amd64-gpu-rtxa6000-latest-1
5 linux-amd64-gpu-h100-latest-1
3 linux-amd64-gpu-rtx4090-latest-1

@miscco miscco merged commit 810b189 into NVIDIA:main Mar 20, 2025
180 of 181 checks passed
@github-project-automation github-project-automation bot moved this from In Review to Done in CCCL Mar 20, 2025
davebayer added a commit to davebayer/cccl that referenced this pull request Apr 7, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

Archived in project

Development

Successfully merging this pull request may close these issues.

2 participants