aboutsummaryrefslogtreecommitdiff
path: root/clang/lib/Frontend/CompilerInvocation.cpp
diff options
context:
space:
mode:
authorJohannes Doerfert <johannes@jdoerfert.de>2020-08-16 10:49:37 -0500
committerJohannes Doerfert <johannes@jdoerfert.de>2020-08-16 14:38:33 -0500
commitaa27cfc1e7d7456325e951a4ba3ced405027f7d0 (patch)
treed83bd80ef78294c169876b31b1f323e6cca6da5c /clang/lib/Frontend/CompilerInvocation.cpp
parent95a25e4c3203f35e9f57f9fac620b4a21bffd6e1 (diff)
downloadllvm-aa27cfc1e7d7456325e951a4ba3ced405027f7d0.zip
llvm-aa27cfc1e7d7456325e951a4ba3ced405027f7d0.tar.gz
llvm-aa27cfc1e7d7456325e951a4ba3ced405027f7d0.tar.bz2
[OpenMP][CUDA] Cache the maximal number of threads per block (per kernel)
Instead of calling `cuFuncGetAttribute` with `CU_FUNC_ATTRIBUTE_MAX_THREADS_PER_BLOCK` for every kernel invocation, we can do it for the first one and cache the result as part of the `KernelInfo` struct. The only functional change is that we now expect `cuFuncGetAttribute` to succeed and otherwise propagate the error. Ignoring any error seems like a slippery slope... Reviewed By: JonChesterfield Differential Revision: https://reviews.llvm.org/D86038
Diffstat (limited to 'clang/lib/Frontend/CompilerInvocation.cpp')
0 files changed, 0 insertions, 0 deletions