aboutsummaryrefslogtreecommitdiff
path: root/llvm/tools/llvm-objdump/llvm-objdump.cpp
diff options
context:
space:
mode:
authorNavdeep Kumar <navdeep.navdeep37@gmail.com>2021-05-06 12:05:07 +0530
committerUday Bondhugula <uday@polymagelabs.com>2021-05-06 12:06:25 +0530
commit875eb523c13249114507cb8facd797773e278d9e (patch)
tree8076679a1df69ec424de7f8d78914dcbdeb5f5a0 /llvm/tools/llvm-objdump/llvm-objdump.cpp
parent16c7829784f071d9fd4ae9da4cc8b3786a58018e (diff)
downloadllvm-875eb523c13249114507cb8facd797773e278d9e.zip
llvm-875eb523c13249114507cb8facd797773e278d9e.tar.gz
llvm-875eb523c13249114507cb8facd797773e278d9e.tar.bz2
[MLIR][GPU][NVVM] Add warp synchronous matrix-multiply accumulate ops
Add warp synchronous matrix-multiply accumulate ops in GPU and NVVM dialect. Add following three ops to GPU dialect :- 1.) subgroup_mma_load_matrix 2.) subgroup_mma_store_matrix 3.) subgroup_mma_compute Add following three ops to NVVM dialect :- 1.) wmma.m16n16k16.load.[a,b,c].[f16,f32].row.stride 2.) wmma.m16n16k16.store.d.[f16,f32].row.stride 3.) wmma.m16n16k16.mma.row.row.[f16,f32].[f16,f32] Reviewed By: bondhugula, ftynse, ThomasRaoux Differential Revision: https://reviews.llvm.org/D95330
Diffstat (limited to 'llvm/tools/llvm-objdump/llvm-objdump.cpp')
0 files changed, 0 insertions, 0 deletions