aboutsummaryrefslogtreecommitdiff
path: root/llvm/lib/Target/AMDGPU
AgeCommit message (Expand)AuthorFilesLines
2026-01-13[AMDGPU] Put back ProperlyAlighedRC helper functionsusers/cdevadas/putback-ProperlyAlighedRCChristudasan Devadasan2-0/+27
2026-01-13suggestions incorporated.users/cdevadas/make-AMDGPURewriteAGPRCopyMFMA-subreg-reload-awareChristudasan Devadasan1-2/+2
2026-01-13[AMDGPU] Make AMDGPURewriteAGPRCopyMFMA aware of subreg reloadChristudasan Devadasan1-1/+41
2026-01-13fixed a comment.users/cdevadas/make-getNumSubRegsForSpillOp-member-functionChristudasan Devadasan1-1/+1
2026-01-13moved the implementation to SIInstrInfo.Christudasan Devadasan4-151/+150
2026-01-13[AMDGPU] Make getNumSubRegsForSpillOp externally available (NFC).Christudasan Devadasan2-3/+5
2026-01-13incorporated review comments.users/cdevadas/add-spill-offset-to-sgpr-spill-pseudosChristudasan Devadasan1-3/+3
2026-01-13[AMDGPU] Introduce Offset field in SGPR spill PseudosChristudasan Devadasan3-6/+8
2026-01-13[CodeGen][InlineSpiller] Add SubReg argument to loadRegFromStackSlot for subr...Christudasan Devadasan2-1/+2
2026-01-13[AMDGPU]Add specific instruction feature for multicast load (#175503)Shoreshen3-3/+16
2026-01-12[AMDGPU] Add DS loop waitcnt optimization for GFX12+ (#172728)hidekisaito1-61/+164
2026-01-12[AMDGPU]: Rewrite mbcnt_lo/mbcnt_hi to work item ID where applicable (#160496)Teja Alaghari1-0/+116
2026-01-12[AMDGPU] Fix crash in SIInsertWaitcnts debug output (#175518)Jay Foad1-32/+28
2026-01-12[AMDGPU] Inline two helpers in SIInsertWaitcnts. NFC. (#174557)Jay Foad1-22/+9
2026-01-12[AMDGPU][SIInsertWaitcnt] Implement Waitcnt Expansion for Profiling (#169345)Pankaj Dwivedi3-92/+242
2026-01-11[TargetLowering] Change the `softPromoteHalfType` default to `true` (#175149)Trevor Gross1-0/+2
2026-01-09[AMDGPU] Fix AMDGPULowerKernelAttributes not invalidating analysis (#175256)Joseph Huber1-2/+4
2026-01-09[AMDGPU] Handle `s_setreg_imm32_b32` targeting `MODE` register (#174681)Shilei Tian3-6/+121
2026-01-09[AMDGPU][GlobalISel] Add RegBankLegalize support for G_FLDEXP (#175072)vangthao951-0/+8
2026-01-09[AMDGPU][GlobalISel] Add RegBankLegalize support for G_PTRMASK (#174889)vangthao951-0/+8
2026-01-09[AMDGPU][GlobalISel] Add RegBankLegalize support for G_FCANONICALIZE (#174662)vangthao951-0/+12
2026-01-09[AMDGPU] Add liverange split instructions into BB Prolog (#117544)Christudasan Devadasan2-7/+29
2026-01-09[AMDGPU] Add support for GFX12 expert scheduling mode 2 (#170319)Jay Foad9-39/+373
2026-01-08[AMDGPU] Optimize block count calculations to the new ABI (#174112)Joseph Huber1-0/+46
2026-01-08[AMDGPU][GlobalISel] Add RegBankLegalize support for G_UMULH, G_SMULH (#174555)vangthao951-0/+6
2026-01-08[NFC][AMDGPU] Reorganize features for gfx125x (#175054)Shilei Tian1-2/+3
2026-01-08[AMDGPU] Fix a potential use-after-erase in `AMDGPUPromoteAlloca` pass (#174529)Shilei Tian1-3/+12
2026-01-08[AMDGPU] Simplify class VINTERPe. NFC. (#175006)Jay Foad1-9/+9
2026-01-08AMDGPU: Teach lowering that sqrt never returns subnormal (#174838)Matt Arsenault2-0/+5
2026-01-07[AMDGPU] Improve llvm.amdgcn.wave.shuffle handling for pre-GFX8 (#174845)saxlungs2-0/+7
2026-01-07[AMDGPU] Reimplement VOP3 CMPX encoding fixup using PostEncoderMethod. NFCI. ...Jay Foad3-17/+24
2026-01-07[AMDGPU] Remove some redundant SubtargetPredicate settings. NFC. (#174788)Jay Foad1-4/+0
2026-01-07[NFCI][AMDGPU] Update Mode register mask for gfx1250 (#174771)Shilei Tian1-5/+5
2026-01-07Reapply "[AMDGPU] Rework the clamp support for WMMA instructions" (#174674) (...Shilei Tian2-13/+15
2026-01-07Revert "[AMDGPU] Rework the clamp support for WMMA instructions" (#174674)dyung2-16/+14
2026-01-06[AMDGPU] Rework the clamp support for WMMA instructions (#174310)Shilei Tian2-14/+16
2026-01-06Reland "[AMDGPU] Add new llvm.amdgcn.wave.shuffle intrinsic (#167372)" (#174614)saxlungs5-1/+223
2026-01-06Revert "AMDGPU: Do not infer implicit inputs for !nocallback intrinsics" (#17...theRonShark1-23/+4
2026-01-06AMDGPU: clang-format AMDGPULowerKernelAttributes (#174567)Matt Arsenault1-23/+22
2026-01-05Revert "[AMDGPU] Add new llvm.amdgcn.wave.shuffle intrinsic" (#174501)Joe Nash5-223/+1
2026-01-05[AMDGPU] Add new llvm.amdgcn.wave.shuffle intrinsic (#167372)saxlungs5-1/+223
2026-01-05[NFC][AMDGPU] Declare variables initialized with getDebugLoc as const ref (#1...LU-JOHN10-21/+21
2026-01-05[AMDGPU] Generate more efficient code to avoid shift64 hazard (#171871)LU-JOHN1-11/+25
2026-01-05[AMDGPU] Make WaitcntBrackets::simplifyWaitcnt const again (#173390)Jay Foad1-47/+61
2026-01-05[AMDGPU] Simplify and document waitcnt handling on call and return (#172453)Jay Foad1-25/+5
2026-01-05Revert "[AMDGPU] Fix excessive stack usage in SIInsertWaitcnts::run (#134835)...Jay Foad1-8/+3
2026-01-04Revert "[AMDGPU] add clamp immediate operand to WMMA iu8 intrinsic (#171069)"...Shilei Tian1-3/+1
2026-01-02[LLVM][ADT] Migrate users of `make_scope_exit` to CTAD (#174030)Victor Chernyakin1-1/+1
2026-01-02[NFC][AMDGPU] Add comments for literal arguments of WMMA profiles (#174175)Shilei Tian1-56/+77
2026-01-01[AMDGPU][NPM] Disable few non useful passes (#172796)Vikram Hegde1-2/+2