riscv-gnu-toolchain/llvm.git - Unnamed repository; edit this file 'description' to name the repository.

Age	Commit message (Collapse)	Author	Files	Lines
2024-07-16	[ValueTracking][X86] Compute KnownBits for phadd/phsub (#92429)	mskamp	1	-0/+64
	Add KnownBits computations to ValueTracking and X86 DAG lowering. These instructions add/subtract adjacent vector elements in their operands. Example: phadd [X1, X2] [Y1, Y2] = [X1 + X2, Y1 + Y2]. This means that, in this example, we can compute the KnownBits of the operation by computing the KnownBits of [X1, X2] + [X1, X2] and [Y1, Y2] + [Y1, Y2] and intersecting the results. This approach also generalizes to all x86 vector types. There are also the operations phadd.sw and phsub.sw, which perform saturating addition/subtraction. Use sadd_sat and ssub_sat to compute the KnownBits of these operations. Also adjust the existing test case pr53247.ll because it can be transformed to a constant using the new KnownBits computation. Fixes #82516.
2024-07-16	Revert "[SLP]Correctly detect minnum/maxnum patterns for select/cmp ↵	Alexey Bataev	1	-5/+4
	operations on floats." This reverts commit c7aac38c29f564bc48f7cfb71d3b3b8b482c873b to fix crashes reavealed by the buildbot in https://lab.llvm.org/buildbot/#/builders/168/builds/1104.
2024-07-16	[SLP]Correctly detect minnum/maxnum patterns for select/cmp operations on ↵	Alexey Bataev	1	-4/+5
	floats. The patch enables detection of minnum/maxnum patterns for float point instruction, represented as select/cmp. Also, enables better cost estimation for integer min/max patterns since the compiler starts to estimate the scalars separately. Reviewers: nikic, RKSimon Reviewed By: RKSimon Pull Request: https://github.com/llvm/llvm-project/pull/98570
2024-07-05	[IR] Add Constant::toConstantRange() (NFC)	Nikita Popov	1	-40/+2
	The logic in llvm::getVectorConstantRange() can be a bit inconvenient to use in some cases because of the need to handle the scalar case separately. Generalize it to handle all constants, and move it to live directly on Constant.
2024-07-04	[ValueTracking][X86] computeKnownBitsFromOperator - add PMULH/PMULHU ↵	Simon Pilgrim	1	-0/+14
	intrinsics mulhs/mulhu known bits handling. These map directly to the KnownBits implementations.
2024-07-04	[IR] Don't strip through pointer to vector of pointer bitcasts	Nikita Popov	1	-2/+3
	When using stripPointerCasts() and getUnderlyingObject(), don't strip through a bitcast from ptr to <1 x ptr>, which is not a no-op pointer cast. Calling code is generally not prepared to handle that situation, resulting in incorrect alias analysis results for example. Fixes https://github.com/llvm/llvm-project/issues/97600.
2024-07-03	[ValueTracking][LVI] Consolidate vector constant range calculation	Nikita Popov	1	-11/+40
	Add a common helper used for computeConstantRange() and LVI. The implementation is a mix of both, with the efficient handling for ConstantDataVector taken from computeConstantRange(), and the general handling (including non-splat poison) from LVI.
2024-07-03	[ValueTracking] Extend LHS/RHS with matching operand to work without constants.	Noah Goldstein	1	-17/+35
	Previously we only handled the `L0 == R0` case if both `L1` and `R1` where constant. We can get more out of the analysis using general constant ranges instead. For example, `X u> Y` implies `X != 0`. In general, any strict comparison on `X` implies that `X` is not equal to the boundary value for the sign and constant ranges with/without sign bits can be useful in deducing implications. Closes #85557
2024-07-01	[InstCombine] Sync KnownBits logic for select arms	Nikita Popov	1	-30/+36
	Extract an adjustKnownBitsForSelectArm() helper for the ValueTracking logic and make use of it in SimplifyDemandedBits(). This fixes a consistency violation under instcombine-verify-known-bits.
2024-07-01	[InstCombine] Simplify select using KnownBits of condition (#95923)	Nikita Popov	1	-0/+4
	Simplify the arms of a select based on the KnownBits implied by its condition. For now this only handles the case where the select arm folds to a constant, but this can be generalized to handle other patterns by using SimplifyDemandedBits instead (in that case we would also have to limit to non-undef conditions). This is implemented by adding a new member to SimplifyQuery that can be used to inject an additional condition. The affected values are pre-computed and we don't call computeKnownBits() if the select arms don't contain affected values. This reduces the cost in some pathological cases.
2024-06-30	ValueTracking: Simplify intrinsic ID asserts	Matt Arsenault	1	-2/+2

2024-06-28	[IR] Add getDataLayout() helpers to Function and GlobalValue (#96919)	Nikita Popov	1	-1/+1
	Similar to https://github.com/llvm/llvm-project/pull/96902, this adds `getDataLayout()` helpers to Function and GlobalValue, replacing the current `getParent()->getDataLayout()` pattern.
2024-06-27	[SLPVectorizer] Support SLPVectorizer cases of tan across all backends (#95517)	Farzon Lotfi	1	-0/+4
	This PR is intended to address the limited SLPVectorizer support of tan raised in the comments of this PR: https://github.com/llvm/llvm-project/pull/94559. Right now emitting the tan intrinsisic allows you to vectorize tan, but emitting the libfunc does not. to address this the libcall needs to be mapped to the intrinsic. and the libcall and function name need to be marked approriately so they can be optimized or defined as a call lowering.
2024-06-27	[IR] Add getDataLayout() helpers to BasicBlock and Instruction (#96902)	Nikita Popov	1	-2/+2
	This is a helper to avoid writing `getModule()->getDataLayout()`. I regularly try to use this method only to remember it doesn't exist... `getModule()->getDataLayout()` is also a common (the most common?) reason why code has to include the Module.h header.
2024-06-26	[ValueTracking][RISCV] Use ConstantRange::getUnsignedMax instead of getUpper ↵	Craig Topper	1	-3/+1
	to simplify some code. (#96816) This avoids the need to subtract 1 and explain why.
2024-06-24	[InstSimplify] Provide information about the range of possible values that ↵	Poseydon42	1	-0/+4
	`ucmp`/`scmp` can return (#96410) This makes it possible to fold dumb comparisons like `ucmp(x, y) == 7`.
2024-06-20	[ValueTracking] Support gep nuw in isKnownNonZero()	Nikita Popov	1	-2/+5
	gep nuw can be null if and only if both the base pointer and offset are null. Unlike the inbounds case this does not depend on whether the null pointer is valid. Proofs: https://alive2.llvm.org/ce/z/PLoqK5
2024-06-13	[ValueTracking][NFC] move isKnownInversion to ValueTracking (#95321)	Zain Jaffal	1	-0/+22
	I am using `isKnownInversion` in the following pr https://github.com/llvm/llvm-project/pull/94915 it is useful to have the method in a shared class so I can reuse it. I am not sure if `ValueTracking` is the correct place but it looks like most of the methods with the pattern `isKnownX` belong there.
2024-06-07	[KnownBits] Remove `hasConflict()` assertions (#94568)	c8ef	1	-3/+0
	Allow KnownBits to represent "always poison" values via conflict. close: #94436
2024-06-06	[ValueTracking] Make undef element check more precise	Nikita Popov	1	-4/+7
	If we're only checking for undef, then also only look for undef elements in the vector (rather than undef and poison).
2024-05-28	[IR][AArch64][PAC] Add "ptrauth(...)" Constant to represent signed pointers. ↵	Ahmed Bougacha	1	-0/+4
	(#85738) This defines a new kind of IR Constant that represents a ptrauth signed pointer, as used in AArch64 PAuth. It allows representing most kinds of signed pointer constants used thus far in the llvm ptrauth implementations, notably those used in the Darwin and ELF ABIs being implemented for c/c++. These signed pointer constants are then lowered to ELF/MachO relocations. These can be simply thought of as a constant `llvm.ptrauth.sign`, with the interesting addition of discriminator computation: the `ptrauth` constant can also represent a combined blend, when both address and integer discriminator operands are used. Both operands are otherwise optional, with default values 0/null.
2024-05-20	[ValueTracking] Recognize `X op (X != 0)` as non-zero	Noah Goldstein	1	-0/+29
	The ops supported are: `add`, `sub`, `xor`, `or`, `umax`, `uadd.sat` Proofs: https://alive2.llvm.org/ce/z/8ZMSRg The `add` case actually comes up in SPECInt, the rest are here mostly for completeness. Closes #88579
2024-05-20	[ValueTracking] Fix incorrect inferrence about the signbit of sqrt (#92510)	Yingwei Zheng	1	-4/+1
	According to IEEE Std 754-2019, `sqrt` returns nan when the input is negative (except for -0). In this case, we cannot make assumptions about sign bit of the result. Fixes https://github.com/llvm/llvm-project/issues/92217
2024-05-19	ValueTracking: Correct undef handling for constant FP vectors (#92557)	Matt Arsenault	1	-1/+1
	Treat undef as unknown, and poison as ignorable.
2024-05-16	[ValueTracking] Compute knownbits from `(icmp upred (add/sub nuw X, Y), C)`	Noah Goldstein	1	-18/+29
	`(icmp ule/ult (add nuw X, Y), C)` implies both `(icmp ule/ult X, C)` and `(icmp ule/ult Y, C)`. We can use this to deduce leading zeros in `X`/`Y`. `(icmp uge/ugt (sub nuw X, Y), C)` implies `(icmp uge/uge X, C)` . We can use this to deduce leading ones in `X`. Proofs: https://alive2.llvm.org/ce/z/sc5k22 Closes #87180
2024-05-16	[ValueTracking] Fix assertion failure when `computeKnownFPClass` returns ↵	Yingwei Zheng	1	-0/+4
	fcNone (#92355) Fixes https://github.com/llvm/llvm-project/pull/92084#issuecomment-2114083188.
2024-05-14	Reland "[ValueTracking] Compute knownbits from known fp classes" (#92084)	Yingwei Zheng	1	-0/+35
	This patch relands https://github.com/llvm/llvm-project/pull/86409. I mistakenly thought that `Known.makeNegative()` clears the sign bit of `Known.Zero`. This patch fixes the assertion failure by explicitly clearing the sign bit.
2024-05-14	Revert "[ValueTracking] Compute knownbits from known fp classes (#86409)"	Martin Storsjö	1	-36/+0
	This reverts commit d03a1a6e5838c7c2c0836d71507dfdf7840ade49. This change caused failed assertions, see https://github.com/llvm/llvm-project/pull/86409#issuecomment-2109469845 for details.
2024-05-14	[ValueTracking] Compute knownbits from known fp classes (#86409)	Yingwei Zheng	1	-0/+36
	This patch calculates knownbits from fp instructions/dominating fcmp conditions. It will enable more optimizations with signbit idioms.
2024-05-07	[ValueTracking] Recognize `LShr(UINT_MAX, Y) + 1` as a power-of-two (#91171)	Monad	1	-0/+5
	There is a missed optimization in ``` llvm define i8 @known_power_of_two_rust_next_power_of_two(i8 %x, i8 %y) { %2 = add i8 %x, -1 %3 = tail call i8 @llvm.ctlz.i8(i8 %2, i1 true) %4 = lshr i8 -1, %3 %5 = add i8 %4, 1 %6 = icmp ugt i8 %x, 1 %p = select i1 %6, i8 %5, i8 1 %r = urem i8 %y, %p ret i8 %r } ``` which is extracted from the Rust code ``` rust fn func(x: usize, y: usize) -> usize { let z = x.next_power_of_two(); y % z } ``` Here `%p` (a.k.a `z`) is semantically a power-of-two, so `y urem p` can be optimized to `y & (p - 1)`. (Alive2 proof: https://alive2.llvm.org/ce/z/H3zooY) --- It could be generalized to recognizing `LShr(UINT_MAX, Y) + 1` as a power-of-two, which is what this PR does. Alive2 proof: https://alive2.llvm.org/ce/z/zUPTbc
2024-05-03	[AggressiveInstCombine] Inline strcmp/strncmp (#89371)	Franklin Zhang	1	-0/+7
	Inline calls to strcmp(s1, s2) and strncmp(s1, s2, N), where N and exactly one of s1 and s2 are constant. For example: ```c int res = strcmp(s, "ab"); ``` is converted to ```c int res = (int)s[0] - (int)'a'; if (res != 0) goto END; res = (int)s[1] - (int)'b'; if (res != 0) goto END; res = (int)s[2] - (int)'\0'; END: ``` Ported from a similar gcc feature [Inline strcmp with small constant strings](https://gcc.gnu.org/bugzilla/show_bug.cgi?id=78809).
2024-05-01	[UndefOrPoison] [CompileTime] Avoid IDom walk unless required. NFC (#90092)	annamthomas	1	-21/+25
	If the value is not boolean and we are checking for `Undef` or `UndefOrPoison`, we can avoid the potentially expensive IDom walk. This should improve compile time for isGuaranteedNotToBeUndefOrPoison and isGuaranteedNotToBeUndef.
2024-04-29	[InstCombine] Infer nuw on mul nsw with non-negative operands (#90170)	Nikita Popov	1	-1/+7
	If a mul nsw has non-negative operands, it's also nuw. Proof: https://alive2.llvm.org/ce/z/2Dz9Uu Fixes https://github.com/llvm/llvm-project/issues/90020.
2024-04-24	[ValueTracking] Add support for `trunc nuw/nsw` in isKnowNonZero	Noah Goldstein	1	-0/+7
	With `nsw`/`nuw`, the `trunc` is non-zero if its operand is non-zero. Proofs: https://alive2.llvm.org/ce/z/iujmk6 Closes #89643
2024-04-24	[InstCombine] Fix miscompile in negation of select (#89698)	Nikita Popov	1	-7/+17
	Swapping the operands of a select is not valid if one hand is more poisonous that the other, because the negation zero contains poison elements. Fix this by adding an extra parameter to isKnownNegation() to forbid poison elements. I've implemented this using manual checks to avoid needing four variants for the NeedsNSW/AllowPoison combinations. Maybe there is a better way to do this... Fixes https://github.com/llvm/llvm-project/issues/89669.
2024-04-21	[ValueTracking] Combine variable declaration with its only assignment. NFC ↵	Craig Topper	1	-2/+1
	(#89526)
2024-04-19	GlobalsModRef, ValueTracking: Look through threadlocal.address intrinsic ↵	Matthias Braun	1	-0/+4
	(#88418) This improves handling of `threadlocal.address` intrinsic in analyses: The thread-id cannot change within a function with the exception of suspend points of pre-split coroutines. This changes `llvm::getUnderlyingObject` to look through `threadlocal.address` in these cases. `GlobalsAAResult::AnalyzeUsesOfPointer` checks whether an address can be traced to simple loads/stores or escapes to other places. Starting the analysis from a thread-local `GlobalValue` the `threadlocal.address` intrinsic is safe to skip here. This improves issue #87437
2024-04-18	[IR][PatternMatch] Only accept poison in getSplatValue() (#89159)	Nikita Popov	1	-2/+2
	In #88217 a large set of matchers was changed to only accept poison values in splats, but not undef values. This is because we now use poison for non-demanded vector elements, and allowing undef can cause correctness issues. This patch covers the remaining matchers by changing the AllowUndef parameter of getSplatValue() to AllowPoison instead. We also carry out corresponding renames in matchers. As a followup, we may want to change the default for things like m_APInt to m_APIntAllowPoison (as this is much less risky when only allowing poison), but this change doesn't do that. There is one caveat here: We have a single place (X86FixupVectorConstants) which does require handling of vector splats with undefs. This is because this works on backend constant pool entries, which currently still use undef instead of poison for non-demanded elements (because SDAG as a whole does not have an explicit poison representation). As it's just the single use, I've open-coded a getSplatValueAllowUndef() helper there, to discourage use in any other places.
2024-04-18	[IR] Drop poison-generating return attributes when necessary (#89138)	Andreas Jonson	1	-1/+1
	Rename has/dropPoisonGeneratingFlagsOrMetadata to has/dropPoisonGeneratingAnnotations and make it also handle nonnull, align and range return attributes on calls, similar to the existing handling for !nonnull, !align and !range metadata.
2024-04-16	[ValueTracking] Implement `computeKnownFPClass` for ↵	Noah Goldstein	1	-0/+13
	`llvm.vector.reduce.{fmin,fmax,fmaximum,fminimum}` Closes #88408
2024-04-16	[ValueTracking] Restore isKnownNonZero parameter order. (#88873)	Harald van Dijk	1	-53/+53
	Prior to #85863, the required parameters of llvm::isKnownNonZero were Value and DataLayout. After, they are Value, Depth, and SimplifyQuery, where SimplifyQuery is implicitly constructible from DataLayout. The change to move Depth before SimplifyQuery needed callers to be updated unnecessarily, and as commented in #85863, we actually want Depth to be after SimplifyQuery anyway so that it can be defaulted and the caller does not need to specify it.
2024-04-15	ValueTracking: Treat poison more aggressively in computeKnownFPClass (#87990)	Matt Arsenault	1	-0/+6
	Assume no valid values, and the sign bit is 0.
2024-04-15	[ValueTracking] Don't accept undef in isKnownNonZero()	Nikita Popov	1	-2/+2
	As the undef can be replaced with a zero value, this is not legal in the general case. We can only allow poison values. This matches what the other ValueTracking helpers like computeKnownBits() do.
2024-04-14	[ValueTracking] Implement `isKnownNonZero` for `llvm.vector.reduce.or`	Noah Goldstein	1	-1/+2
	Closes #88320
2024-04-14	[ValueTracking] Implement `computeKnownBits` for `llvm.vector.reduce.xor`	Noah Goldstein	1	-0/+15

2024-04-14	[ValueTracking] Implement `computeKnownBits` for `llvm.vector.reduce.{or,and}`	Noah Goldstein	1	-2/+4

2024-04-12	[ValueTracking] Convert `isKnownNonZero` to use SimplifyQuery (#85863)	Yingwei Zheng	1	-13/+4
	This patch converts `isKnownNonZero` to use SimplifyQuery. Then we can use the context information from `DomCondCache`. Fixes https://github.com/llvm/llvm-project/issues/85823. Alive2: https://alive2.llvm.org/ce/z/QUvHVj
2024-04-12	[NFC] Replace m_Sub(m_Zero(), X) with m_Neg(X) (#88461)	AtariDreams	1	-2/+2

2024-04-11	[ValueTracking] compute knownbits from `(icmp upred X (and/or X, Y))`; NFC	Noah Goldstein	1	-12/+36
	`(icmp uge/ugt (and X, Y), C)` implies both `(icmp uge/ugt X, C)` and `(icmp uge/ugt Y, C)`. We can use this to deduce leading ones in `X`. `(icmp ule/ult (or X, Y), C)` implies both `(icmp ule/ult X, C)` and `(icmp ule/ult Y, C)`. We can use this to deduce leading zeros in `X`. Closes #86059
2024-04-10	[ValueTracking] Add support for `xor`/`disjoint or` in `isKnownNonZero`	Noah Goldstein	1	-13/+27
	Handles cases like `X ^ Y == X` / `X disjoint\| Y == X`. Both of these cases have identical logic to the existing `add` case, so just converting the `add` code to a more general helper. Proofs: https://alive2.llvm.org/ce/z/Htm7pe Closes #87706