diff options
| author | Srinivasa Ravi <srinivasar@nvidia.com> | 2025-10-23 10:27:54 +0530 |
|---|---|---|
| committer | GitHub <noreply@github.com> | 2025-10-23 10:27:54 +0530 |
| commit | d8563e51b56684e7c5491647424aff55d60017fd (patch) | |
| tree | cb02fe8b6bb6c4dacc183a80f80c21be5f7404b4 /llvm/lib/Bitcode/Reader/BitcodeReader.cpp | |
| parent | f3df058b03867e64af0195001d1e455257a81603 (diff) | |
| download | llvm-d8563e51b56684e7c5491647424aff55d60017fd.zip llvm-d8563e51b56684e7c5491647424aff55d60017fd.tar.gz llvm-d8563e51b56684e7c5491647424aff55d60017fd.tar.bz2 | |
[MLIR][NVVM] Add support for converting fp4/6/8 to fp16x2 (#162439)
This change adds the following NVVM dialect Ops for converting fp4/6/8
to fp16x2:
- `convert.f4x2.to.f16x2`
- `convert.f6x2.to.f16x2`
- `convert.f8x2.to.f16x2`
- `convert.f8x2.to.bf16x2`
Tests are added in `convert_fp4x2.mlir`, `convert_fp6x2.mlir`, and
`convert_fp8x2.mlir`.
PTX Reference:
https://docs.nvidia.com/cuda/parallel-thread-execution/index.html#data-movement-and-conversion-instructions-cvt
Diffstat (limited to 'llvm/lib/Bitcode/Reader/BitcodeReader.cpp')
0 files changed, 0 insertions, 0 deletions
