diff options
author | Haochen Jiang <haochen.jiang@intel.com> | 2024-08-26 10:53:56 +0800 |
---|---|---|
committer | Haochen Jiang <haochen.jiang@intel.com> | 2024-08-26 13:22:04 +0800 |
commit | b4ac2c23d8745d98984954e88f02aa73f1c3594b (patch) | |
tree | f08c8af5b6463bfb9df1149aa32ca292c3ddd289 /gcc/tree-if-conv.cc | |
parent | 576bd309ded9dfe258023f26924c064a7bf12875 (diff) | |
download | gcc-b4ac2c23d8745d98984954e88f02aa73f1c3594b.zip gcc-b4ac2c23d8745d98984954e88f02aa73f1c3594b.tar.gz gcc-b4ac2c23d8745d98984954e88f02aa73f1c3594b.tar.bz2 |
i386: Add bf8 -> fp16 intrin
Since BF8 and FP16 have same bits for exponent, the type conversion
between them is just a cast for fraction part. We will use a sequence
of instrctions instead of new instructions to do that. For convenience,
intrins are also provided.
gcc/ChangeLog:
* config/i386/avx10_2-512convertintrin.h
(_mm512_cvtpbf8_ph): New.
(_mm512_mask_cvtpbf8_ph): Ditto.
(_mm512_maskz_cvtpbf8_ph): Ditto.
* config/i386/avx10_2convertintrin.h
(_mm_cvtpbf8_ph): Ditto.
(_mm_mask_cvtpbf8_ph): Ditto.
(_mm_maskz_cvtpbf8_ph): Ditto.
(_mm256_cvtpbf8_ph): Ditto.
(_mm256_mask_cvtpbf8_ph): Ditto.
(_mm256_maskz_cvtpbf8_ph): Ditto.
gcc/testsuite/ChangeLog:
* gcc.target/i386/avx10_2-512-convert-1.c: Add tests for new
intrin.
* gcc.target/i386/avx10_2-convert-1.c: Ditto.
Diffstat (limited to 'gcc/tree-if-conv.cc')
0 files changed, 0 insertions, 0 deletions