diff options
author | Durgadoss R <durgadossr@nvidia.com> | 2024-10-14 19:44:13 +0530 |
---|---|---|
committer | GitHub <noreply@github.com> | 2024-10-14 19:44:13 +0530 |
commit | a8b51154417c7ead2e1bfffee0b70812270e5730 (patch) | |
tree | 063102e10d4a0d7f437af61ee33082fea4ebf381 /flang/lib/Frontend/CompilerInvocation.cpp | |
parent | ddb64e62bd1fb4c26cf3b7ba7e04f5f2e469a063 (diff) | |
download | llvm-a8b51154417c7ead2e1bfffee0b70812270e5730.zip llvm-a8b51154417c7ead2e1bfffee0b70812270e5730.tar.gz llvm-a8b51154417c7ead2e1bfffee0b70812270e5730.tar.bz2 |
[MLIR][NVGPU] Fix the cga_cluster.mlir test (#112191)
This patch fixes the sm90 cluster test by:
* Fixing a typo in LowerGpuOpsToNVVMOps where one of the ClusterDim Op
conversion pattern should actually be for the
ClusterDimBlocks Op. This addresses the compilation error for this test.
* The grid-size should be (4,4,1) instead of (2,2,1). This passes the
scf-if check against the threshold of 3 below and actually
generates the required prints from the GPU.
Signed-off-by: Durgadoss R <durgadossr@nvidia.com>
Diffstat (limited to 'flang/lib/Frontend/CompilerInvocation.cpp')
0 files changed, 0 insertions, 0 deletions