aboutsummaryrefslogtreecommitdiff
path: root/llvm/lib/Object/ObjectFile.cpp
diff options
context:
space:
mode:
authorGuray Ozen <guray.ozen@gmail.com>2023-10-04 09:50:48 +0200
committerGitHub <noreply@github.com>2023-10-04 09:50:48 +0200
commitafe400620f9d22c9d877bd92d437d5dd761708be (patch)
tree893079871e381cf94871442c594a98dbdeeef2f5 /llvm/lib/Object/ObjectFile.cpp
parent20fc2ffb15001e3fc706e6adda3cfe601041a8b0 (diff)
downloadllvm-afe400620f9d22c9d877bd92d437d5dd761708be.zip
llvm-afe400620f9d22c9d877bd92d437d5dd761708be.tar.gz
llvm-afe400620f9d22c9d877bd92d437d5dd761708be.tar.bz2
[MLIR] Use `test-lower-to-nvvm` for sm_90 Integration Tests on GitHub (#68184)
This PR enables `test-lower-to-nvvm` pass pipeline for the integration tests for NVIDIA sm_90 architecture. This PR adjusts `test-lower-to-nvvm` pass in two ways: 1) Calls `createConvertNVGPUToNVVMPass` before the outlining process. This particular pass is responsible for generating both device and host code. On the host, it calls the CUDA driver to build the TMA descriptor (`cuTensorMap`). 2) Integrates the `createConvertNVVMToLLVMPass` to generate PTXs for NVVM Ops.
Diffstat (limited to 'llvm/lib/Object/ObjectFile.cpp')
0 files changed, 0 insertions, 0 deletions