diff options
author | Lewis Crawford <lcrawford@nvidia.com> | 2024-11-15 12:40:53 +0000 |
---|---|---|
committer | GitHub <noreply@github.com> | 2024-11-15 12:40:53 +0000 |
commit | 6d058317e60c25b71df8b8dc45b69e5202362678 (patch) | |
tree | 9fcbb118387c0f1219773afa97be362146c5a98c /clang/lib/Sema/SemaCodeComplete.cpp | |
parent | 3d474738df573b89eedf344463a0c9a005078f1d (diff) | |
download | llvm-6d058317e60c25b71df8b8dc45b69e5202362678.zip llvm-6d058317e60c25b71df8b8dc45b69e5202362678.tar.gz llvm-6d058317e60c25b71df8b8dc45b69e5202362678.tar.bz2 |
Enable .ptr .global .align attributes for kernel attributes for CUDA (#114874)
Emit .ptr, .address-space, and .align attributes for kernel
args in CUDA (previously handled only for OpenCL).
This allows for more vectorization opportunities if the PTX consumer
is able to know about the pointer alignments.
If no alignment is explicitly specified, .align 1 will be emitted
to match the LLVM IR semantics in this case.
PTX ISA doc -
https://docs.nvidia.com/cuda/parallel-thread-execution/index.html#kernel-parameter-attribute-ptr
This is a rework of the original patch proposed in #79646
---------
Co-authored-by: Vandana <vandanak@nvidia.com>
Diffstat (limited to 'clang/lib/Sema/SemaCodeComplete.cpp')
0 files changed, 0 insertions, 0 deletions