riscv-gnu-toolchain/llvm.git - Unnamed repository; edit this file 'description' to name the repository.

diff options

author	Guray Ozen <guray.ozen@gmail.com>	2022-10-28 13:45:44 +0200
committer	Guray Ozen <guray.ozen@gmail.com>	2022-10-28 14:02:40 +0200
commit	3ac17449cf988bfcde804a4cc532420ed1657595 (patch)
tree	16c7461a82f00e532630d326f721048f70301693 /llvm/lib/Object/Archive.cpp
parent	63e3fe10882476696d8a05907dfe627bd61638a3 (diff)
download	llvm-3ac17449cf988bfcde804a4cc532420ed1657595.zip llvm-3ac17449cf988bfcde804a4cc532420ed1657595.tar.gz llvm-3ac17449cf988bfcde804a4cc532420ed1657595.tar.bz2

[mlir][nvvm] Introduce performance tuning directives

PTX programming models provides some performance tuning directives; see https://docs.nvidia.com/cuda/parallel-thread-execution/index.html#performance-tuning-directives The downstream compiler namely `ptxas` leverages these information for better register allocation or to handle other resource management that improves the performance. This revision introduce all the kernel based directives to MLIR's NVVM dialect. The list is below ``` maxnreg -> max register per thread in CTA maxntid -> max threads per CTA reqntid -> exact number of threads per CTA minnctapersm -> min CTA per SM ``` Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D136931

Diffstat (limited to 'llvm/lib/Object/Archive.cpp')

0 files changed, 0 insertions, 0 deletions


context:
space:
mode: