aboutsummaryrefslogtreecommitdiff
path: root/flang/lib/Frontend/CompilerInvocation.cpp
diff options
context:
space:
mode:
authorDurgadoss R <durgadossr@nvidia.com>2024-10-14 19:44:13 +0530
committerGitHub <noreply@github.com>2024-10-14 19:44:13 +0530
commita8b51154417c7ead2e1bfffee0b70812270e5730 (patch)
tree063102e10d4a0d7f437af61ee33082fea4ebf381 /flang/lib/Frontend/CompilerInvocation.cpp
parentddb64e62bd1fb4c26cf3b7ba7e04f5f2e469a063 (diff)
downloadllvm-a8b51154417c7ead2e1bfffee0b70812270e5730.zip
llvm-a8b51154417c7ead2e1bfffee0b70812270e5730.tar.gz
llvm-a8b51154417c7ead2e1bfffee0b70812270e5730.tar.bz2
[MLIR][NVGPU] Fix the cga_cluster.mlir test (#112191)
This patch fixes the sm90 cluster test by: * Fixing a typo in LowerGpuOpsToNVVMOps where one of the ClusterDim Op conversion pattern should actually be for the ClusterDimBlocks Op. This addresses the compilation error for this test. * The grid-size should be (4,4,1) instead of (2,2,1). This passes the scf-if check against the threshold of 3 below and actually generates the required prints from the GPU. Signed-off-by: Durgadoss R <durgadossr@nvidia.com>
Diffstat (limited to 'flang/lib/Frontend/CompilerInvocation.cpp')
0 files changed, 0 insertions, 0 deletions