diff options
author | Alan Li <me@alanli.org> | 2025-07-24 06:23:15 -0700 |
---|---|---|
committer | GitHub <noreply@github.com> | 2025-07-24 09:23:15 -0400 |
commit | 1c3e4e994b4affc7f0314bb3dbecf467fbfcab3e (patch) | |
tree | 0f80ea944b91ce80ef48e09f2dda8bdbd2ef2d9b /lldb/unittests/ScriptInterpreter/Python | |
parent | baa19c05a3565c930f47ecdbc7fc6693ec1a7c2e (diff) | |
download | llvm-1c3e4e994b4affc7f0314bb3dbecf467fbfcab3e.zip llvm-1c3e4e994b4affc7f0314bb3dbecf467fbfcab3e.tar.gz llvm-1c3e4e994b4affc7f0314bb3dbecf467fbfcab3e.tar.bz2 |
Reapply "[AMDGPU] fold `memref.subview/expand_shape/collapse_shape` into `amdgpu.gather_to_lds`" (#150334)
This is a reapply of patch #149851. The reapply also fixes a CMake/Bazel
build issue, which was the reason of the revert. (Thanks @rupprecht )
Original patch (#149851) message:
-----
This PR adds a new optimization pass to fold
`memref.subview/expand_shape/collapse_shape` ops into consumer
`amdgpu.gather_to_lds` operations.
* Implements a new pass `AmdgpuFoldMemRefOpsPass` with pattern
`FoldMemRefOpsIntoGatherToLDSOp`
* Adds corresponding folding tests
Diffstat (limited to 'lldb/unittests/ScriptInterpreter/Python')
0 files changed, 0 insertions, 0 deletions