diff options
author | Johannes Doerfert <johannes@jdoerfert.de> | 2020-08-16 10:49:37 -0500 |
---|---|---|
committer | Johannes Doerfert <johannes@jdoerfert.de> | 2020-08-16 14:38:33 -0500 |
commit | aa27cfc1e7d7456325e951a4ba3ced405027f7d0 (patch) | |
tree | d83bd80ef78294c169876b31b1f323e6cca6da5c /clang/lib/Frontend/CompilerInvocation.cpp | |
parent | 95a25e4c3203f35e9f57f9fac620b4a21bffd6e1 (diff) | |
download | llvm-aa27cfc1e7d7456325e951a4ba3ced405027f7d0.zip llvm-aa27cfc1e7d7456325e951a4ba3ced405027f7d0.tar.gz llvm-aa27cfc1e7d7456325e951a4ba3ced405027f7d0.tar.bz2 |
[OpenMP][CUDA] Cache the maximal number of threads per block (per kernel)
Instead of calling `cuFuncGetAttribute` with
`CU_FUNC_ATTRIBUTE_MAX_THREADS_PER_BLOCK` for every kernel invocation,
we can do it for the first one and cache the result as part of the
`KernelInfo` struct. The only functional change is that we now expect
`cuFuncGetAttribute` to succeed and otherwise propagate the error.
Ignoring any error seems like a slippery slope...
Reviewed By: JonChesterfield
Differential Revision: https://reviews.llvm.org/D86038
Diffstat (limited to 'clang/lib/Frontend/CompilerInvocation.cpp')
0 files changed, 0 insertions, 0 deletions