riscv-gnu-toolchain/llvm.git - Unnamed repository; edit this file 'description' to name the repository.

diff options

author	Johannes Doerfert <johannes@jdoerfert.de>	2020-08-16 10:49:37 -0500
committer	Johannes Doerfert <johannes@jdoerfert.de>	2020-08-16 14:38:33 -0500
commit	aa27cfc1e7d7456325e951a4ba3ced405027f7d0 (patch)
tree	d83bd80ef78294c169876b31b1f323e6cca6da5c /clang/lib/Frontend/CompilerInvocation.cpp
parent	95a25e4c3203f35e9f57f9fac620b4a21bffd6e1 (diff)
download	llvm-aa27cfc1e7d7456325e951a4ba3ced405027f7d0.zip llvm-aa27cfc1e7d7456325e951a4ba3ced405027f7d0.tar.gz llvm-aa27cfc1e7d7456325e951a4ba3ced405027f7d0.tar.bz2

[OpenMP][CUDA] Cache the maximal number of threads per block (per kernel)

Instead of calling `cuFuncGetAttribute` with `CU_FUNC_ATTRIBUTE_MAX_THREADS_PER_BLOCK` for every kernel invocation, we can do it for the first one and cache the result as part of the `KernelInfo` struct. The only functional change is that we now expect `cuFuncGetAttribute` to succeed and otherwise propagate the error. Ignoring any error seems like a slippery slope... Reviewed By: JonChesterfield Differential Revision: https://reviews.llvm.org/D86038

Diffstat (limited to 'clang/lib/Frontend/CompilerInvocation.cpp')

0 files changed, 0 insertions, 0 deletions


context:
space:
mode: