diff options
author | Guray Ozen <guray.ozen@gmail.com> | 2023-10-04 09:50:48 +0200 |
---|---|---|
committer | GitHub <noreply@github.com> | 2023-10-04 09:50:48 +0200 |
commit | afe400620f9d22c9d877bd92d437d5dd761708be (patch) | |
tree | 893079871e381cf94871442c594a98dbdeeef2f5 /llvm/lib/Object/ObjectFile.cpp | |
parent | 20fc2ffb15001e3fc706e6adda3cfe601041a8b0 (diff) | |
download | llvm-afe400620f9d22c9d877bd92d437d5dd761708be.zip llvm-afe400620f9d22c9d877bd92d437d5dd761708be.tar.gz llvm-afe400620f9d22c9d877bd92d437d5dd761708be.tar.bz2 |
[MLIR] Use `test-lower-to-nvvm` for sm_90 Integration Tests on GitHub (#68184)
This PR enables `test-lower-to-nvvm` pass pipeline for the integration
tests for NVIDIA sm_90 architecture.
This PR adjusts `test-lower-to-nvvm` pass in two ways:
1) Calls `createConvertNVGPUToNVVMPass` before the outlining process.
This particular pass is responsible for generating both device and host
code. On the host, it calls the CUDA driver to build the TMA descriptor
(`cuTensorMap`).
2) Integrates the `createConvertNVVMToLLVMPass` to generate PTXs for
NVVM Ops.
Diffstat (limited to 'llvm/lib/Object/ObjectFile.cpp')
0 files changed, 0 insertions, 0 deletions