aboutsummaryrefslogtreecommitdiff
path: root/llvm/lib/Transforms/Utils/LoopUtils.cpp
diff options
context:
space:
mode:
authorPradeep Kumar <pradeepku@nvidia.com>2025-03-17 20:44:52 +0530
committerGitHub <noreply@github.com>2025-03-17 20:44:52 +0530
commit52e7ca9279b4cbe30cacca67548347ef5f96b120 (patch)
tree6cfe601d068211fd6f055c30879c69990f896d59 /llvm/lib/Transforms/Utils/LoopUtils.cpp
parent269c40fafc80576ab4efcd7fba954fd5588ea118 (diff)
downloadllvm-52e7ca9279b4cbe30cacca67548347ef5f96b120.zip
llvm-52e7ca9279b4cbe30cacca67548347ef5f96b120.tar.gz
llvm-52e7ca9279b4cbe30cacca67548347ef5f96b120.tar.bz2
[LLVM][NVPTX] Add support for ldmatrix extensions introduced in PTX 8.6 (#124899)
This commit adds support for the following ldmatrix extensions introduced in PTX 8.6 - Support for m16n16 with b8 type with mandatory transpose - Support for m16n16 with m8n16 with source and desitination formats The above extensions are only supported on sm_100a, sm_101a, sm_120a Please refer the PTX ISA for more information: https://docs.nvidia.com/cuda/parallel-thread-execution/#warp-level-matrix-instructions-ldmatrix
Diffstat (limited to 'llvm/lib/Transforms/Utils/LoopUtils.cpp')
0 files changed, 0 insertions, 0 deletions