riscv-gnu-toolchain/llvm.git - Unnamed repository; edit this file 'description' to name the repository.

Age	Commit message (Collapse)	Author	Files	Lines
2025-09-10	[ExpandVectorPredication] Keep the original value name when expanding ↵	Craig Topper	1	-16/+17
	predicated instructions. (#157943) This makes it easier to follow a value through the pass. If we pass the original name to the create function, a number will be added as a suffix since the original name is still used until it is replaced.
2025-09-10	[ExpandVectorPredication] Expand vp_merge and vp_select in ↵	Craig Topper	1	-0/+8
	expandPredication. (#157777)
2025-09-09	[ExpandVectorPredication] Add vp.select to foldEVLIntoMask. (#157720)	Craig Topper	1	-2/+4

2025-09-05	[ExpandVectorPredication] Change function returns from Value* to bool. NFC ↵	Craig Topper	1	-55/+46
	(#157199) None of the callers do anything with the pointer other than check for null or equality. We can use bool to accomplish the same thing.
2025-09-05	[ExpandVectorPredication] Support vp.merge in foldEVLIntoMask. (#157195)	Craig Topper	1	-1/+10
	Partial fix for #157184. It still crashes later in SelectionDAG.
2025-08-19	[ExpandVectorPredication] Use IRBuilder::CreateNUWMul instead of passing ↵	Craig Topper	1	-2/+1
	flags to CreateMul. NFC
2025-06-10	[NFC][LLVM] Refactor IRBuilder::Create{VScale,ElementCount,TypeSize}. (#142803)	Paul Walker	1	-2/+1
	CreateVScale took a scaling parameter that had a single use outside of IRBuilder with all other callers having to create a redundant ConstantInt. To work round this some code perferred to use CreateIntrinsic directly. This patch simplifies CreateVScale to return a call to the llvm.vscale() intrinsic and nothing more. As well as simplifying the existing call sites I've also migrated the uses of CreateIntrinsic. Whilst IRBuilder used CreateVScale's scaling parameter as part of the implementations of CreateElementCount and CreateTypeSize, I have follow-on work to switch them to the NUW varaiety and thus they would stop using CreateVScale's scaling as well. To prepare for this I have moved the multiplication and constant folding into the implementations of CreateElementCount and CreateTypeSize. As a final step I have replaced some callers of CreateVScale with CreateElementCount where it's clear from the code they wanted the latter.
2025-03-20	[Intrinsics] Add Intrinsic::getFnAttributes() (NFC) (#132029)	Nikita Popov	1	-2/+2
	Most places that call Intrinsic::getAttributes() are only interested in the function attributes, so add a separate function for that. The motivation for this is that I'd like to add the ability to specify range attributes on intrinsics, which requires knowing the function type. This avoids needing to know the type for most attribute queries.
2025-01-16	[VP] Remove createStepVector implementation and use ↵	LiqinWeng	1	-23/+1
	IRBuilderBase::CreateStepVector instead. NFC (#122868)
2025-01-16	[VP] IR expansion to Int Func Call (#122867)	LiqinWeng	1	-0/+9
	Add basic handling for VP ops that can expand to Int intrinsics, which includes: ctpop/cttz/ctlz/sadd.sat/uadd.sat/ssub.sat/usub.sat/fshl/fshr
2024-11-25	[VP] Refactoring some functions in ExpandVectorPredication.NFC (#115840)	LiqinWeng	1	-90/+26
	Building vp intrinsic functions using a unified interface for expandPredicationToIntCall/expandPredicationToFPCall/expandPredicationToCastIntrinsic functions.
2024-11-12	[CodeGen] Remove unused includes (NFC) (#115996)	Kazu Hirata	1	-1/+0
	Identified with misc-include-cleaner.
2024-10-17	[LLVM] Make more use of IRBuilder::CreateIntrinsic. NFC. (#112706)	Jay Foad	1	-20/+17
	Convert many instances of: Fn = Intrinsic::getOrInsertDeclaration(...); CreateCall(Fn, ...) to the equivalent CreateIntrinsic call.
2024-10-11	[NFC] Rename `Intrinsic::getDeclaration` to `getOrInsertDeclaration` (#111752)	Rahul Joshi	1	-7/+7
	Rename the function to reflect its correct behavior and to be consistent with `Module::getOrInsertFunction`. This is also in preparation of adding a new `Intrinsic::getDeclaration` that will have behavior similar to `Module::getFunction` (i.e, just lookup, no creation).
2024-09-04	Consolidate all IR logic for getting the identity value of a reduction [nfc]	Philip Reames	1	-46/+5
	This change merges the three different places (at the IR layer) for finding the identity value of a reduction into a single copy. This depends on several prior commits which fix ommissions and bugs in the distinct copies, but this patch itself should be fully non-functional. As the new comments and naming try to make clear, the identity value is a property of the @llvm.vector.reduce.* intrinsic, not of e.g. the recurrence descriptor. (We still provide an interface for clients using recurrence descriptors, but the implementation simply translates to the intrinsic which each corresponds to.) As a note, the getIntrinsicIdentity API does not support fminnum/fmaxnum or fminimum/fmaximum which is why we still need manual logic (but at least only one copy of manual logic) for those cases.
2024-09-03	Prefer use of 0.0 over -0.0 for fadd reductions w/nsz (in IR) (#106770)	Philip Reames	1	-1/+3
	This is a follow up to 924907bc6, and is mostly motivated by consistency but does include one additional optimization. In general, we prefer 0.0 over -0.0 as the identity value for an fadd. We use that value in several places, but don't in others. So, let's be consistent and use the same identity (when nsz allows) everywhere. This creates a bunch of test churn, but due to 924907bc6, most of that churn doesn't actually indicate a change in codegen. The exception is that this change enables the use of 0.0 for nsz, but not reasoc, fadd reductions. Or said differently, it allows the neutral value of an ordered fadd reduction to be 0.0.
2024-08-30	[VP] Reduce duplicate code in vp.reduce expansions	Philip Reames	1	-50/+15
	Primary goal is having one way of doing this, to ensure that we don't end up with accidental divergence.
2024-08-09	[ExpandVectorPredication] Be more precise reporting changes (#102313)	Roger Ferrer Ibáñez	1	-19/+32
	This is used by PreISelIntrinsicLowering. With this change, PreISelIntrinsicLowering does not have to assume that there were changes just because we encountered a VP intrinsic.
2024-08-06	[VP] Merge ExpandVP pass into PreISelIntrinsicLowering (#101652)	Alexis Engelke	1	-103/+35
	Similar to #97727; avoid an extra pass over the entire IR by performing the lowering as part of the pre-isel-intrinsic-lowering pass.
2024-06-28	[IR] Add getDataLayout() helpers to Function and GlobalValue (#96919)	Nikita Popov	1	-1/+1
	Similar to https://github.com/llvm/llvm-project/pull/96902, this adds `getDataLayout()` helpers to Function and GlobalValue, replacing the current `getParent()->getDataLayout()` pattern.
2024-06-24	Revert "[IR][NFC] Update IRBuilder to use InsertPosition (#96497)"	Stephen Tozer	1	-1/+1
	Reverts the above commit, as it updates a common header function and did not update all callsites: https://lab.llvm.org/buildbot/#/builders/29/builds/382 This reverts commit 6481dc57612671ebe77fe9c34214fba94e1b3b27.
2024-06-24	[IR][NFC] Update IRBuilder to use InsertPosition (#96497)	Stephen Tozer	1	-1/+1
	Uses the new InsertPosition class (added in #94226) to simplify some of the IRBuilder interface, and removes the need to pass a BasicBlock alongside a BasicBlock::iterator, using the fact that we can now get the parent basic block from the iterator even if it points to the sentinel. This patch removes the BasicBlock argument from each constructor or call to setInsertPoint. This has no functional effect, but later on as we look to remove the `Instruction *InsertBefore` argument from instruction-creation (discussed [here](https://discourse.llvm.org/t/psa-instruction-constructors-changing-to-iterator-only-insertion/77845)), this will simplify the process by allowing us to deprecate the InsertPosition constructor directly and catch all the cases where we use instructions rather than iterators.
2024-06-21	Revert "Intrinsic: introduce minimumnum and maximumnum (#93841)"	Nikita Popov	1	-17/+1
	As far as I can tell, this pull request was not approved, and did not go through an RFC on discourse. This reverts commit 89881480030f48f83af668175b70a9798edca2fb. This reverts commit 225d8fc8eb24fb797154c1ef6dcbe5ba033142da.
2024-06-21	Intrinsic: introduce minimumnum and maximumnum (#93841)	YunQiang Su	1	-1/+17
	Currently, on different platform, the behaivor of llvm.minnum is different if one operand is sNaN: When we compare sNaN vs NUM: ARM/AArch64/PowerPC: follow the IEEE754-2008's minNUM: return qNaN. RISC-V/Hexagon follow the IEEE754-2019's minimumNumber: return NUM. X86: Returns NUM but not same with IEEE754-2019's minimumNumber as +0.0 is not always greater than -0.0. MIPS/LoongArch/Generic: return NUM. LIBCALL: returns qNaN. So, let's introduce llvm.minmumnum/llvm.maximumnum, which always follow IEEE754-2019's minimumNumber/maximumNumber. Half-fix: #93033
2024-05-10	[VP][RISCV] Add vp.reduce.fmaximum/fminimum and its RISC-V codegen (#91782)	Min-Yih Hsu	1	-3/+21
	`vp.reduce.fmaximum/fminimum` are the VP version of `vector.reduce.fmaximum/fminimum`.
2024-04-17	[VP] Correct lowering of predicated fma and faddmul to avoid strictfp. (#85272)	Kevin P. Neal	1	-2/+10
	Correct missing cases in a switch that result in @llvm.vp.fma.v4f32 getting lowered to a constrained fma intrinsic. Vector predicated lowering to contrained intrinsics is not supported currently, and there's no consensus on the path forward. We certainly shouldn't be introducing constrained intrinsics into a function that isn't strictfp. Problem found with D146845.
2024-01-22	[VP][RISCV] Introduce llvm.vp.minimum/maximum intrinsics (#74840)	Simeon K	1	-0/+2
	Although there are predicated versions of minnum/maxnum, the ones for minimum/maximum are currently missing. This patch introduces these intrinsics and implements their lowering to RISC-V.
2023-10-26	[VP] Check if VP ops with functional intrinsics are speculatable (#69504)	Luke Lau	1	-3/+6
	Noticed whilst working on #69494. VP intrinsics whose functional equivalent is an intrinsic were being marked as their lanes being non-speculatable, even if the underlying intrinsic was speculatable. This meant that ```llvm %1 = call <4 x i32> @llvm.vp.umax(<4 x i32> %x, <4 x i32> %y, <4 x i1> %mask, i32 %evl) ``` would be expanded out to ```llvm %.splatinsert = insertelement <4 x i32> poison, i32 %evl, i64 0 %.splat = shufflevector <4 x i32> %.splatinsert, <4 x i32> poison, <4 x i32> zeroinitializer %1 = icmp ult <4 x i32> <i32 0, i32 1, i32 2, i32 3>, %.splat %2 = and <4 x i1> %1, %mask %3 = call <4 x i32> @llvm.umax.v4i32(<4 x i32> %x, <4 x i32> %y) ``` instead of ```llvm %1 = call <4 x i32> @llvm.umax.v4i32(<4 x i32> %x, <4 x i32> %y) ``` The cause of this was isSafeToSpeculativelyExecuteWithOpcode checking the function attributes for the VP instruction itself, not the functional intrinsic. Since isSafeToSpeculativelyExecuteWithOpcode expects an already materialized instruction, we can't use it directly for the intrinsic case. So this fixes it by manually checking the function attributes on the intrinsic.
2023-10-09	[VP] IR expansion for bitreverse/bswap (#68504)	LiqinWeng	1	-1/+12

2023-10-08	[VP] Use the interface of 'getFunctionalIntrinsicID' to get the non-p… ↵	LiqinWeng	1	-10/+5
	(#68508) …redicated Intrinsic ID
2023-09-12	[VP] IR expansion for maxnum/minnum	liqin.weng	1	-0/+14
	Add basic handling for VP ops that can expand to non-predicate ops Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D159494
2023-09-11	[VP] IR expansion for zext/sext/trunc/fptosi/fptosi/sitofp/uitofp/fptrunc/fpext	liqin.weng	1	-11/+46
	Add basic handling for VP ops that can expand to Cast intrinsics Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D159491
2023-09-11	[VP] IR expansion for abs/smax/smin/umax/umin	liqin.weng	1	-0/+34
	Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D159495
2023-09-09	[VP] IR expansion for inttoptr/ptrtoint	liqin.weng	1	-0/+29
	Add basic handling for VP ops that can expand to cast intrinsics Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D159478
2023-04-24	[VP] Add IR expansion for fneg	Simon Pilgrim	1	-0/+5
	Followup to D149052, it wasn't worthwhile to add general support for unary opcodes
2023-04-24	[VP] IR expansion for fabs/fsqrt/fma/fmadd	Simon Pilgrim	1	-0/+44
	Add basic handling for VP ops that can expand to FP intrinsics Fixes #60464 Differential Revision: https://reviews.llvm.org/D149052
2022-12-16	[Transforms,CodeGen] std::optional::value => operator*/operator->	Fangrui Song	1	-2/+2
	value() has undesired exception checking semantics and calls __throw_bad_optional_access in libc++. Moreover, the API is unavailable without _LIBCPP_NO_EXCEPTIONS on older Mach-O platforms (see _LIBCPP_AVAILABILITY_BAD_OPTIONAL_ACCESS).
2022-12-04	Instructions: convert Optional to std::optional	Krzysztof Parzyszek	1	-1/+2

2022-10-05	[VP] Fix unused variable in release configurations	Fraser Cormack	1	-2/+2

2022-10-05	[VP][NFC] Remove \brief commands from doxygen comments	Fraser Cormack	1	-10/+8
	Following a precedent set in D46861.
2022-10-05	[VP] Add IR expansion for vp.icmp and vp.fcmp	Fraser Cormack	1	-0/+25
	These intrinsics are simply expanded to regular icmp/fcmp instructions. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D121594
2022-08-08	[llvm] LLVM_FALLTHROUGH => [[fallthrough]]. NFC	Fangrui Song	1	-1/+1
	With C++17 there is no Clang pedantic warning or MSVC C5051.
2022-07-19	Use value instead of getValue (NFC)	Kazu Hirata	1	-2/+2

2022-07-19	Use has_value instead of hasValue (NFC)	Kazu Hirata	1	-2/+2

2022-07-19	[CodeGen] Use value_or instead of getValueOr (NFC)	Kazu Hirata	1	-3/+3

2022-07-18	[VP] IR expansion pass for VP gather and scatter	Lorenzo Albano	1	-0/+20
	Add vp_gather and vp_scatter expansion to unpredicated intrinsics. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D120664
2022-07-18	[VP] Implementing expansion pass for VP load and store.	Lorenzo Albano	1	-2/+65
	Added function to the ExpandVectorPredication pass to handle VP loads and stores. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D109584
2022-07-06	[ValueTracking] Accept Instruction in isSafeToSpeculativelyExecute() (NFC)	Nikita Popov	1	-2/+1
	As constant expressions can no longer trap, it only makes sense to call isSafeToSpeculativelyExecute on Instructions, so limit the API to accept only them, rather than general Operators or Values.
2022-06-18	[llvm] Use value_or instead of getValueOr (NFC)	Kazu Hirata	1	-1/+1

2022-05-30	Re-land "[VP] vp intrinsics are not speculatable" with test fix	Simon Moll	1	-5/+5
	Update the llvmir-intrinsics.mlir test to account for the modified attribute sets. This reverts commit 2e2a8a2d9082250e4aad312c6008a526f2b007c7.