riscv-gnu-toolchain/llvm.git - Unnamed repository; edit this file 'description' to name the repository.

diff options

author	modiking <mmo@nvidia.com>	2025-10-03 09:48:59 -0700
committer	GitHub <noreply@github.com>	2025-10-03 09:48:59 -0700
commit	74180eb024f3e45c4e0ebeb5dd07f34f85ff6539 (patch)
tree	2a9c76c9225e184b7da17b754c4f7e4c4c49bb9e /libcxx/include/__algorithm/comp.h
parent	d0e98909d28be377408b1e52fa35423a2236036c (diff)
download	llvm-74180eb024f3e45c4e0ebeb5dd07f34f85ff6539.zip llvm-74180eb024f3e45c4e0ebeb5dd07f34f85ff6539.tar.gz llvm-74180eb024f3e45c4e0ebeb5dd07f34f85ff6539.tar.bz2

[flang][rt] Add noinline attributes for CUDA compile path for successful compilation (#161760)

NVCC does more aggressive inlining than Clang/GCC causing the exported functions in extrema.cpp and findloc.cpp to become extremely large from function specializations leading to compilation timeouts. Marking the 2 functions in this change as noinline for NVCC alleviates this problem as it removes the worst of the cross-matrix argument specializations. Also remove the workaround in https://github.com/llvm/llvm-project/pull/156542 that opted out findloc.cpp from the CUDA flang-rt build Testing: ninja flang-rt builds in ~30 minutes, these 2 files build in ~3 minutes

Diffstat (limited to 'libcxx/include/__algorithm/comp.h')

0 files changed, 0 insertions, 0 deletions


context:
space:
mode: