riscv-gnu-toolchain/llvm.git - Unnamed repository; edit this file 'description' to name the repository.

diff options

author	Nicolas Vasilache <nicolasvasilache@users.noreply.github.com>	2023-07-25 01:22:15 +0200
committer	Nicolas Vasilache <nicolasvasilache@users.noreply.github.com>	2023-07-26 00:09:08 +0200
commit	44e6318ceacdc00d4f9b0fbb2814d6dc03e27f7d (patch)
tree	ca76bf7f57cc83f2676e07dba9a473995452bf95 /llvm/lib/CodeGen/TargetLoweringObjectFileImpl.cpp
parent	2ee4d0386c783f58abe708298228de648239b435 (diff)
download	llvm-44e6318ceacdc00d4f9b0fbb2814d6dc03e27f7d.zip llvm-44e6318ceacdc00d4f9b0fbb2814d6dc03e27f7d.tar.gz llvm-44e6318ceacdc00d4f9b0fbb2814d6dc03e27f7d.tar.bz2

[mlir][transforms] Revamp the implementation of mapping loops to GPUs

This revision significantly simplifies the specification and implementation of mapping loops to GPU ids. Each type of mapping (block, warpgroup, warp, thread) now comes with 2 mapping modes: 1. a 3-D "grid-like" mode, subject to alignment considerations on threadIdx.x, on which predication may occur on a per-dimension 3-D sub-rectangle basis. 2. a n-D linearized mode, on which predication may only occur on a linear basis. In the process, better size and alignment requirement inference are introduced along with improved runtime verification messages. The `warp_dims` attribute was deemed confusing and is removed from the transform in favor of better size inference. Differential Revision: https://reviews.llvm.org/D155941

Diffstat (limited to 'llvm/lib/CodeGen/TargetLoweringObjectFileImpl.cpp')

0 files changed, 0 insertions, 0 deletions


context:
space:
mode: