diff options
author | modiking <mmo@nvidia.com> | 2025-10-03 09:48:59 -0700 |
---|---|---|
committer | GitHub <noreply@github.com> | 2025-10-03 09:48:59 -0700 |
commit | 74180eb024f3e45c4e0ebeb5dd07f34f85ff6539 (patch) | |
tree | 2a9c76c9225e184b7da17b754c4f7e4c4c49bb9e /libcxx/include/__algorithm/comp.h | |
parent | d0e98909d28be377408b1e52fa35423a2236036c (diff) | |
download | llvm-74180eb024f3e45c4e0ebeb5dd07f34f85ff6539.zip llvm-74180eb024f3e45c4e0ebeb5dd07f34f85ff6539.tar.gz llvm-74180eb024f3e45c4e0ebeb5dd07f34f85ff6539.tar.bz2 |
[flang][rt] Add noinline attributes for CUDA compile path for successful compilation (#161760)
NVCC does more aggressive inlining than Clang/GCC causing the exported
functions in extrema.cpp and findloc.cpp to become extremely large from
function specializations leading to compilation timeouts. Marking the 2
functions in this change as noinline for NVCC alleviates this problem as
it removes the worst of the cross-matrix argument specializations.
Also remove the workaround in
https://github.com/llvm/llvm-project/pull/156542 that opted out
findloc.cpp from the CUDA flang-rt build
Testing:
ninja flang-rt builds in ~30 minutes, these 2 files build in ~3 minutes
Diffstat (limited to 'libcxx/include/__algorithm/comp.h')
0 files changed, 0 insertions, 0 deletions