rocket-tools/riscv-gnu-toolchain/llvm.git - Unnamed repository; edit this file 'description' to name the repository.

diff options

author	Durgadoss R <durgadossr@nvidia.com>	2024-10-14 19:44:13 +0530
committer	GitHub <noreply@github.com>	2024-10-14 19:44:13 +0530
commit	a8b51154417c7ead2e1bfffee0b70812270e5730 (patch)
tree	063102e10d4a0d7f437af61ee33082fea4ebf381 /flang/lib/Frontend/CompilerInvocation.cpp
parent	ddb64e62bd1fb4c26cf3b7ba7e04f5f2e469a063 (diff)
download	llvm-a8b51154417c7ead2e1bfffee0b70812270e5730.zip llvm-a8b51154417c7ead2e1bfffee0b70812270e5730.tar.gz llvm-a8b51154417c7ead2e1bfffee0b70812270e5730.tar.bz2

[MLIR][NVGPU] Fix the cga_cluster.mlir test (#112191)

This patch fixes the sm90 cluster test by: * Fixing a typo in LowerGpuOpsToNVVMOps where one of the ClusterDim Op conversion pattern should actually be for the ClusterDimBlocks Op. This addresses the compilation error for this test. * The grid-size should be (4,4,1) instead of (2,2,1). This passes the scf-if check against the threshold of 3 below and actually generates the required prints from the GPU. Signed-off-by: Durgadoss R <durgadossr@nvidia.com>

Diffstat (limited to 'flang/lib/Frontend/CompilerInvocation.cpp')

0 files changed, 0 insertions, 0 deletions


context:
space:
mode: