aboutsummaryrefslogtreecommitdiff
path: root/llvm/test/CodeGen/NVPTX
AgeCommit message (Expand)AuthorFilesLines
2024-03-29Reland "[NVPTX] Use .common linkage for common globals" (#86824)Alex MacLean2-3/+35
2024-03-19[NVPTX] Use PTX prmt for llvm.bswap (#85545)Alex MacLean1-0/+77
2024-03-19[llvm][NVPTX] Add missing feature guard.Adrian Kuegel1-2/+2
2024-03-15Revert "[NVPTX] Use .common linkage for common globals (#84416)"Sterling Augustine2-27/+1
2024-03-15[NVPTX] support dynamic allocas with PTX alloca instruction (#84585)Alex MacLean1-7/+41
2024-03-15Reland "[NVPTX] Add support for atomic add for f16 type" (#85197)Adrian Kuegel2-0/+149
2024-03-14[NVPTX] Use .common linkage for common globals (#84416)Alex MacLean2-1/+27
2024-03-13[Tests] Drop inrange attribute from some tests (NFC)Nikita Popov1-1/+1
2024-03-12Revert "[NVPTX] Add support for atomic add for f16 type" (#84918)Danial Klimkin2-128/+0
2024-03-12[NVPTX] Add support for atomic add for f16 type (#84295)Adrian Kuegel2-0/+128
2024-03-05[NVPTX] Remove sub.s16x2 instructionBenjamin Kramer1-6/+5
2024-02-22[NVPTX] fixup support for unaligned parameters and returns (#82562)Alex MacLean2-19/+459
2024-02-21Correctly round FP -> BF16 when SDAG expands such nodes (#82399)David Majnemer1-1/+1
2024-02-13[LLVM] Add `__builtin_readsteadycounter` intrinsic and builtin for realtime c...Joseph Huber1-0/+12
2024-02-12[NVPTX] pass correct GPU arch to ptxas test (#81535)Artem Belevich1-1/+1
2024-02-12[NVPTX] Fix the error in a pattern match in v4i8 comparisons. (#81308)Artem Belevich1-175/+191
2024-02-12[NVPTX] Implement `__builtin_readcyclecounter` on NVPTX (#81344)Joseph Huber1-0/+12
2024-02-12Do not use PerformEXTRACTCombine for v8i8 types (#81242)Adrian Kuegel1-45/+47
2024-02-09[NVPTX] Add clang builtin for `__nvvm_reflect` intrinsic (#81277)Joseph Huber2-4/+4
2024-02-09[NVPTX][Fix] Update minimum CPU for NVPTX intrinsics testJoseph Huber1-4/+4
2024-02-09[NVVMReflect] Improve folding inside of the NVVMReflect pass (#81253)Joseph Huber1-14/+64
2024-02-08[NVVMReflect][Reland] Force dead branch elimination in NVVMReflect (#81189)Joseph Huber2-1/+175
2024-02-08Revert "[NVVMReflect] Force dead branch elimination in NVVMReflect (#81189)"Joseph Huber2-141/+1
2024-02-08[NVVMReflect] Force dead branch elimination in NVVMReflect (#81189)Joseph Huber2-1/+141
2024-02-08[NVPTX] Add support for calling aliases (#81170)Alex MacLean1-13/+39
2024-02-05[CodeGen] Convert tests to opaque pointers (NFC)Nikita Popov9-123/+123
2024-01-31[NVPTX] improve Boolean ISel (#80166)Alex MacLean1-0/+33
2024-01-29Revert "Disable incorrect peephole optimizations" (#79916)Justin Fargnoli1-291/+171
2024-01-29Disable incorrect peephole optimizationsJustin Fargnoli1-171/+291
2024-01-29[NVPTX] Add builtin support for 'globaltimer' (#79765)Joseph Huber1-0/+12
2024-01-29[NVPTX] Add builtin for 'exit' handling (#79777)Joseph Huber1-0/+8
2024-01-29[NVPTX] Add builtin support for 'nanosleep' PTX instrunction (#79888)Joseph Huber1-0/+20
2024-01-29[NVPTX] Add 'activemask' builtin and intrinsic support (#79768)Joseph Huber1-0/+38
2024-01-26[NVPTX] improve identifier renaming for PTX (#79459)Alex MacLean1-1/+4
2024-01-26[SeperateConstOffsetFromGEP] Handle `or disjoint` flags (#76997)Krzysztof Drewniak1-4/+4
2024-01-24[NVPTX] use incomplete aggregate initializers (#79062)Alex MacLean3-2/+24
2024-01-19[LLVM][NVPTX] Add cp.async.bulk.commit/wait intrinsics (#78698)Durgadoss R1-0/+28
2024-01-17[NVPTX] extend type support for nvvm.{min,max,mulhi,sad} (#78385)Alex MacLean2-0/+214
2024-01-17[NVPTX] Add tex.grad.cube{array} intrinsics (#77693)Alex MacLean1-11/+30
2024-01-16[NVPTX] Fix generating permute bytes from register pair when the initial valu...mmoadeli1-0/+18
2024-01-13[LLVM][NVPTX]: Add aligned versions of cluster barriers (#77940)Durgadoss R1-0/+13
2024-01-09[LLVM][NVPTX]: Add intrinsic for setmaxnreg (#77289)Durgadoss R1-0/+16
2024-01-08Set MaxAtomicSizeInBitsSupported for remaining targets. (#75703)James Y Knight1-22/+26
2023-12-31[FuncAttrs] Deduce `noundef` attributes for return values (#76553)Yingwei Zheng2-2/+2
2023-12-14[llvm][NVPTX] Inform that 'DYNAMIC_STACKALLOC' is unsupported (#74684)Youngsuk Kim1-0/+10
2023-12-11[NVPTX] Custom lower integer<->bf16 conversions for sm_80 (#74827)Benjamin Kramer1-0/+103
2023-12-08[NVPTX] Fix a typo that makes the output invalid PTXBenjamin Kramer1-0/+10
2023-12-06Revert "[NVPTX] Lower 16xi8 and 8xi8 stores efficiently (#73646)" (#74518)Artem Belevich2-19/+4
2023-12-01[NVPTX] Lower 16xi8 and 8xi8 stores efficiently (#73646)Uday Bondhugula2-4/+19
2023-11-28[NFC][NVPTX] Add a simpler test case for 0b80288e9e0b (#73379)Uday Bondhugula1-4/+12