aboutsummaryrefslogtreecommitdiff
path: root/libvtv
diff options
context:
space:
mode:
authorTom de Vries <tdevries@suse.de>2022-02-07 14:12:34 +0100
committerTom de Vries <tdevries@suse.de>2022-02-10 09:50:49 +0100
commit5b2d679bbbcc2b976c6e228ba63afdf67c33164e (patch)
tree022ca4396c16c841b3975beaf40beac7819f4778 /libvtv
parent9694f6121982668285a21020b55b44c3099f7042 (diff)
downloadgcc-5b2d679bbbcc2b976c6e228ba63afdf67c33164e.zip
gcc-5b2d679bbbcc2b976c6e228ba63afdf67c33164e.tar.gz
gcc-5b2d679bbbcc2b976c6e228ba63afdf67c33164e.tar.bz2
[nvptx] Workaround sub.u16 driver JIT bug
There's a nvidia driver JIT bug that mishandles this code (minimized from builtin-arith-overflow-15.c): ... int main (void) { signed char r; unsigned char y = (unsigned char) 0x80; if (__builtin_sub_overflow ((unsigned char)0, (unsigned char)y, &r)) __builtin_abort (); return 0; } ... which at ptx level minimizes to: ... mov.u16 r22, 0x0080; st.local.u16 [frame_var],r22; ld.local.u16 r32,[frame_var]; sub.u16 r33,0x0000,r32; cvt.u32.u16 r35,r33; ... where we expect r35 == 0x0000ff80 but get instead 0xffffff80, and where using nvptx-none-run -O0 fixes the problem. [ See also https://github.com/vries/nvidia-bugs/tree/master/builtin-arith-overflow-15 . ] Try to workaround the bug by using sub.s16 instead of sub.u16. Tested on nvptx. gcc/ChangeLog: 2022-02-07 Tom de Vries <tdevries@suse.de> PR target/97005 * config/nvptx/nvptx.md (define_insn "sub<mode>3"): Workaround driver JIT bug by using sub.s16 instead of sub.u16.
Diffstat (limited to 'libvtv')
0 files changed, 0 insertions, 0 deletions