aboutsummaryrefslogtreecommitdiff
path: root/clang/lib/Frontend/CompilerInvocation.cpp
diff options
context:
space:
mode:
authorSrinivasa Ravi <srinivasar@nvidia.com>2025-10-23 10:27:54 +0530
committerGitHub <noreply@github.com>2025-10-23 10:27:54 +0530
commitd8563e51b56684e7c5491647424aff55d60017fd (patch)
treecb02fe8b6bb6c4dacc183a80f80c21be5f7404b4 /clang/lib/Frontend/CompilerInvocation.cpp
parentf3df058b03867e64af0195001d1e455257a81603 (diff)
downloadllvm-d8563e51b56684e7c5491647424aff55d60017fd.zip
llvm-d8563e51b56684e7c5491647424aff55d60017fd.tar.gz
llvm-d8563e51b56684e7c5491647424aff55d60017fd.tar.bz2
[MLIR][NVVM] Add support for converting fp4/6/8 to fp16x2 (#162439)
This change adds the following NVVM dialect Ops for converting fp4/6/8 to fp16x2: - `convert.f4x2.to.f16x2` - `convert.f6x2.to.f16x2` - `convert.f8x2.to.f16x2` - `convert.f8x2.to.bf16x2` Tests are added in `convert_fp4x2.mlir`, `convert_fp6x2.mlir`, and `convert_fp8x2.mlir`. PTX Reference: https://docs.nvidia.com/cuda/parallel-thread-execution/index.html#data-movement-and-conversion-instructions-cvt
Diffstat (limited to 'clang/lib/Frontend/CompilerInvocation.cpp')
0 files changed, 0 insertions, 0 deletions