aboutsummaryrefslogtreecommitdiff
path: root/llvm/lib/Target/NVPTX
AgeCommit message (Expand)AuthorFilesLines
2026-02-10Reland "[NVPTX] Validate user-specified PTX version against SM version" (#180...Justin Fargnoli3-61/+113
2026-02-06[NVPTX] Remove `NoSignedZerosFPMath` uses (#180086)paperchalice1-1/+1
2026-02-05Revert "Reland "[NVPTX] Validate user-specified PTX version against SM versio...Justin Fargnoli3-113/+61
2026-02-05Reland "[NVPTX] Validate user-specified PTX version against SM version" (#179...Justin Fargnoli3-61/+113
2026-02-04Reland "[NVPTX][AtomicExpandPass] Complete support for AtomicRMW in NVPTX (#1...Akshay Deodhar2-95/+163
2026-02-04[LowerMemIntrinsics] Optimize memset lowering (#169040)Fabian Ritter1-1/+1
2026-02-03[NVPTX] Print PM Event Mask value as unsigned integer. (#178891)Kirill Vedernikov3-1/+20
2026-02-02[CodeGen] Refactor targets to override the new getTgtMemIntrinsic overload (N...Nicolai Hähnle2-62/+115
2026-02-01[DebugInfo][NVPTX] Adding support for `inlined_at` debug directive in NVPTX b...Laxman Sole5-0/+232
2026-01-30[NFC] modify comment for NVPTXLowerAlloca (#178583)woruyu1-1/+1
2026-01-28Revert "[NVPTX][AtomicExpandPass] Complete support for AtomicRMW in NVPTX (#1...Akshay Deodhar2-163/+95
2026-01-27[NVPTX][AtomicExpandPass] Complete support for AtomicRMW in NVPTX (#176015)Akshay Deodhar2-95/+163
2026-01-27[TTI] Add VectorInstrContext for context-aware insert/extract costs. (#175982)Florian Hahn1-4/+7
2026-01-26Revert "Reland "[NVPTX] Validate user-specified PTX version against SM versio...Justin Fargnoli3-113/+61
2026-01-26Reland "[NVPTX] Validate user-specified PTX version against SM version" (#177...Justin Fargnoli3-61/+113
2026-01-22[NVPTX] fix illegal name for .extern .shared global variables (#173018)Kjetil Kjeka1-2/+7
2026-01-22[NVPTX] Update the default SM to 7.5 (#176021)Justin Fargnoli1-2/+2
2026-01-21[NFC][NVVM][NVPTX] Moved common code for tcgen05.mma to the base class (#176327)Kirill Vedernikov1-280/+215
2026-01-20[clang][NVPTX] Add missing half-precision add/mul/fma intrinsics (#170079)Srinivasa Ravi3-22/+117
2026-01-19Use AllocaInst::getAllocationSize instead of manual size calculations (#176486)Jameson Nash1-6/+5
2026-01-19[LLVM][NVPTX] Add support for tcgen05.ld.red Instruction (#175919)Pradeep Kumar3-8/+363
2026-01-15[NFC][TargetLowering] Make shouldExpandAtomicRMWInIR and shouldExpandAtomicCm...Akshay Deodhar2-2/+2
2026-01-13[LLVM] Clean up code using [not_]equal_to (NFC) (#175824)Ramkumar Ramachandra2-3/+3
2026-01-13Revert "[NVPTX] Validate user-specified PTX version against SM version" (#175...Mehdi Amini3-108/+61
2026-01-13[CodeGen][InlineSpiller] Add SubReg argument to loadRegFromStackSlot for subr...Christudasan Devadasan1-1/+1
2026-01-12[NVPTX] Validate user-specified PTX version against SM version (#174834)Justin Fargnoli3-61/+108
2026-01-08[NVPTX] Use correct `mul.wide` operand type when matching on `shl` in `combin...Justin Fargnoli1-2/+2
2026-01-08[LLVM][NVPTX] Enable family specific support for a few intrinsics (#173268)Pradeep Kumar3-65/+108
2026-01-01[NVPTX] Add intrinsics and codegen for tensormap.replace (#172458)Srinivasa Ravi3-0/+177
2025-12-28[CodeGen] Fix EVT::changeVectorElementType assertion on simple-to-extended fa...Islam Imad1-5/+3
2025-12-21[MLIR][NVPTX] Add intrinsics and Ops to read smem-sizes (#173089)Durgadoss R1-0/+8
2025-12-19[NVPTX] Add missing type suffixes for barrier.cta.red (#172945)Alex MacLean1-13/+13
2025-12-18[NVPTX] Add support for barrier.cta.red.* instructions (#172541)Alex MacLean1-54/+58
2025-12-17[NVPTX][DagCombiner] Eliminate guards on shift amount because PTX shifts auto...Yonah Goldberg2-7/+75
2025-12-15[clang][NVPTX] Add support for mixed-precision FP arithmetic (#168359)Srinivasa Ravi1-0/+111
2025-12-12[NVPTX] Fixup and refactor brx.idx support (#171933)Alex MacLean6-57/+44
2025-12-10[NFC][TTI] Introduce getInstructionUniformity API for uniformity analysis (#1...Pankaj Dwivedi2-2/+14
2025-12-09[NVPTX] Add IR pass for FMA transformation in the llc pipeline (#154735)Rajat Bajpai5-0/+185
2025-12-09[llvm] Use ConstantInt::getAllOnesValue()Nikita Popov1-4/+5
2025-12-08[NVPTX] Fix lit test issue from used_bytes_mask (#171220)Drew Kersnar2-4/+14
2025-12-02[NFC] Refactor TargetLowering::getTgtMemIntrinsic to take CallBase parameter ...Robert Imschweiler2-4/+5
2025-11-26CodeGen: Make all targets override pseudos with pointers (#159881)Matt Arsenault1-0/+10
2025-11-25[NVPTX] Fix maybe unused variable in 17852ded (#169542)Walter Lee1-1/+1
2025-11-25[NVPTX] Lower LLVM masked vector loads and stores to PTX (#159387)Drew Kersnar14-41/+352
2025-11-25[DAG][X86] Improve custom i256/i512 AVX512 CTLZ/CTTZ Handling with MVT::i256/...Simon Pilgrim1-1/+2
2025-11-24[NVPTX] Use PRMT instruction to lower i16 bswap (#168968)Chengjun2-35/+44
2025-11-21[NVPTX] Support for dense and sparse MMA intrinsics with block scaling. (#163...Kirill Vedernikov1-4/+136
2025-11-21[NVPTX] Fix PTX and SM conditions for narrow FP conversions (#168680)Srinivasa Ravi2-28/+51
2025-11-19CodeGen: Add subtarget to TargetLoweringBase constructor (#168620)Matt Arsenault1-1/+1
2025-11-19[clang][NVPTX] Add remaining float to fp16 conversions (#167641)Srinivasa Ravi2-1/+43