aboutsummaryrefslogtreecommitdiff
path: root/clang/lib/Sema/SemaCodeComplete.cpp
diff options
context:
space:
mode:
authorLewis Crawford <lcrawford@nvidia.com>2024-11-15 12:40:53 +0000
committerGitHub <noreply@github.com>2024-11-15 12:40:53 +0000
commit6d058317e60c25b71df8b8dc45b69e5202362678 (patch)
tree9fcbb118387c0f1219773afa97be362146c5a98c /clang/lib/Sema/SemaCodeComplete.cpp
parent3d474738df573b89eedf344463a0c9a005078f1d (diff)
downloadllvm-6d058317e60c25b71df8b8dc45b69e5202362678.zip
llvm-6d058317e60c25b71df8b8dc45b69e5202362678.tar.gz
llvm-6d058317e60c25b71df8b8dc45b69e5202362678.tar.bz2
Enable .ptr .global .align attributes for kernel attributes for CUDA (#114874)
Emit .ptr, .address-space, and .align attributes for kernel args in CUDA (previously handled only for OpenCL). This allows for more vectorization opportunities if the PTX consumer is able to know about the pointer alignments. If no alignment is explicitly specified, .align 1 will be emitted to match the LLVM IR semantics in this case. PTX ISA doc - https://docs.nvidia.com/cuda/parallel-thread-execution/index.html#kernel-parameter-attribute-ptr This is a rework of the original patch proposed in #79646 --------- Co-authored-by: Vandana <vandanak@nvidia.com>
Diffstat (limited to 'clang/lib/Sema/SemaCodeComplete.cpp')
0 files changed, 0 insertions, 0 deletions