rocket-tools/riscv-gnu-toolchain/llvm.git - Unnamed repository; edit this file 'description' to name the repository.

Age	Commit message (Collapse)	Author	Files	Lines
2023-12-11	[GlobalISel] Add G_PREFETCH (#74863)	Jay Foad	1	-0/+40

2023-11-30	MachineVerifier: Reject extra non-register operands on instructions (#73758)	Matt Arsenault	1	-0/+30
	We were allowing extra immediate arguments, and only bothering to check if registers were implicit or not. Also consolidate extra operand checks in verifier, to make this testable. We had 3 different places checking if you were trying to build an instruction with more operands than allowed by the definition. We had an assertion in addOperand, a direct check in the MIRParser to avoid the assertion, and the machine verifier checks. Remove the assert and parser check so the verifier can provide a consistent verification experience, which will also handle instructions modified in place.
2023-11-07	[MachineVerifier] Fix COPY check in MachineVerifier for scalable vectors	Michael Maitland	1	-2/+41
	This change fixes #71518, which compared the KnownMinValue of the scalable virtual register with the FixedSize of the physical register in the wrong direction. It turns out that we cannot include this check at all since it will lead to a false failures. Test cases are added to show that the false failures no longer occur after this fix.
2023-11-07	[CodeGen][MachineVerifier] Use TypeSize instead of unsigned for getRe… ↵	Michael Maitland	1	-0/+23
	(#70881) …gSizeInBits This patch changes getRegSizeInBits to return a TypeSize instead of an unsigned in the case that a virtual register has a scalable LLT. In the case that register is physical, a Fixed TypeSize is returned. The MachineVerifier pass is updated to allow copies between fixed and scalable operands as long as the Src size will fit into the Dest size. This is a precommit which will be stacked on by a change to GISel to generate COPYs with a scalable destination but a fixed size source. This patch is stacked on https://github.com/llvm/llvm-project/pull/70893 for the ability to use scalable vector types in MIR tests.
2023-10-30	[GISel] Restrict G_BSWAP to multiples of 16 bits. (#70245)	Craig Topper	1	-0/+19
	This is consistent with the IR verifier and SelectionDAG's getNode. Update tests accordingly. I tried to keep some coverage of non-pow2 when possible. X86 didn't like a G_UNMERGE_VALUES from s48 to 3 s16 that got created when I tried s48.
2023-09-29	[test] -march -> -mtriple (#67741)	Visoiu Mistrih Francis	2	-2/+2
	Similar to 806761a
2023-09-29	[test] Change llc -march=aarch64\|arm64 to -mtriple=aarch64\|arm64	Fangrui Song	41	-41/+41
	Similar to commit 806761a7629df268c8aed49657aeccffa6bca449 to avoid issues due to object file format differences. These tests are currently benign.
2023-09-01	MachineVerifier: Add tests which are incorrectly accepted	Matt Arsenault	3	-0/+64
	In the process of splitting out the liveness tracking, I ran into these cases which should have been caught. There are still missing errors for some cases in the entry block. https://reviews.llvm.org/D127104
2023-08-21	[AArch64] Update generic sched model to A510	Harvin Iriawan	1	-1/+1
	Refresh of the generic scheduling model to use A510 instead of A55. Main benefits are to the little core, and introducing SVE scheduling information. Changes tested on various OoO cores, no performance degradation is seen. Differential Revision: https://reviews.llvm.org/D156799
2023-04-18	[X86] Fix checks for illegal physreg COPY instructions	Jay Foad	1	-3/+11
	D105263 changed this test to not expect a MachineVerifier error on this instruction: ; FP16 reg is sub_reg of xmm %0:_(s16) = COPY $xmm0 D107082 changed the behaviour back again so that this instruction did cause an error, but the test was not updated to expect the error. Differential Revision: https://reviews.llvm.org/D148534
2023-02-14	[PowerPC][GISel] add support for fpconstant	Chen Zheng	1	-0/+40
	Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D133340
2022-12-07	[GlobalISel] Add a new G_INVOKE_REGION_START instruction to fix an EH bug.	Amara Emerson	1	-0/+22
	We currently have a bug where the legalizer, when dealing with phi operands, may create instructions in the phi's incoming blocks at points which are effectively dead due to a possible exception throw. Say we have: throwbb: EH_LABEL x0 = %callarg1 BL @may_throw_call EH_LABEL B returnbb bb: %v = phi i1 %true, throwbb, %false.... When legalizing we may need to widen the i1 %true value, and to do that we need to create new extension instructions in the incoming block. Our insertion point currently is the MBB::getFirstTerminator() which puts the IP before the unconditional branch terminator in throwbb. These extensions may never be executed if the call throws, and therefore we need to emit them before the call (but not too early, since our new instruction may need values defined within throwbb as well). throwbb: EH_LABEL x0 = %callarg1 BL @may_throw_call EH_LABEL %true = G_CONSTANT i32 1 ; <<<-- ruh'roh, this never executes if may_throw_call() throws! B returnbb bb: %v = phi i32 %true, throwbb, %false.... To fix this, I've added two new instructions. The main idea is that G_INVOKE_REGION_START is a terminator, which tries to model the fact that in the IR, the original invoke inst is actually a terminator as well. By using that as the new insertion point, we make sure to place new instructions on always executing paths. Unfortunately we still need to make the legalizer use a new insertion point API that I've added, since the existing `getFirstTerminator()` method does a reverse walk up the block, and any non-terminator instructions cause it to bail out. To avoid impacting compile time for all `getFirstTerminator()` uses, I've added a new method that does a forward walk instead. Differential Revision: https://reviews.llvm.org/D137905
2022-11-17	[GlobalISel] Better verification of G_UNMERGE_VALUES	Jay Foad	1	-0/+33
	Verify three cases of G_UNMERGE_VALUES separately: 1. Splitting a vector into subvectors (the converse of G_CONCAT_VECTORS). 2. Splitting a vector into its elements (the converse of G_BUILD_VECTOR). 3. Splitting a scalar into smaller scalars (the converse of G_MERGE_VALUES). Previously #1 allowed strange combinations like this: %1:_(<2 x s16>),%2:_(<2 x s16>) = G_UNMERGE_VALUES %0(<2 x s32>) This has been tightened up to check that the source and destination element types match, and some MIR test cases updated accordingly. Differential Revision: https://reviews.llvm.org/D111132
2022-11-15	[GlobalISel] Remove semantic operand of G_IS_FPCLASS	Serge Pavlov	1	-10/+7
	Instruction G_IS_FPCLASS had an operand that represented floating-point semantics of its first operand. It allowed types that have the same length, like `bfloat16` and `half`, to be distinguished. Unfortunately, it is not sufficient, as other operation still cannot distinguish such types. Solution of this problem must be more general, so now this operand is removed. Differential Revision: https://reviews.llvm.org/D138004
2022-09-22	[GlobalISel] Enforce G_ASSERT_ALIGN to have a valid alignment > 0.	Amara Emerson	1	-0/+17

2022-09-22	MachineVerifier: Verify REG_SEQUENCE	Matt Arsenault	1	-0/+59
	Somehow there was no verification of this, other than an ad-hoc assertion in TwoAddressInstructions.
2022-09-05	[MachineVerifier] Fix crash on early clobbered subreg operands.	Daniil Fukalov	1	-0/+31
	MachineVerifier tried to checkLivenessAtDef() ignoring it is actually a subreg. The issue was with processing two subregs of the same reg are used in the same instruction (e.g. inline asm): "def early-clobber" and other just "def". Reviewed By: foad Differential Revision: https://reviews.llvm.org/D126661
2022-08-19	[MachineVerifier] add checks for INLINEASM_BR	Nick Desaulniers	1	-0/+178
	Test for a case we observed after the initial implementation of D129997 landed, in which case we observed a crash while building the ppc64le Linux kernel. In that case, we had one block with two exits, both to the same successor. Removing one of the exits corrupted the successor/predecessor lists. So when we have an INLINEASM_BR, check a few things for each indirect target: 1. that it exists. 2. that it is listed in our successors. 3. that its predecessor list contains the parent MBB of INLINEASM_BR. This would have caught the regression discovered after D129997 landed, after the pass that was problematic (early-tailduplication) rather than getting a stack trace in a later pass (regalloc) that doesn't understand the anomaly and crashes. Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D130290
2022-05-27	[GlobalISel] Add G_IS_FPCLASS	Serge Pavlov	1	-0/+40
	Add a generic opcode to represent `llvm.is_fpclass` intrinsic. Differential Revision: https://reviews.llvm.org/D121454
2022-04-22	GlobalISel: Relax handling of G_ASSERT_* with source register classes	Matt Arsenault	2	-16/+18
	The most common situation where G_ASSERT_ZEXT appears for AMDGPU is a copy from a physical register, which happens to use set the actual register class on the virtual register. After copy coalescing, the assert's source operand had a vreg with a set class. The verifier was strictly rejecting cases where the set class/bank weren't an exact match. Additionally, RegBankSelect was also expecting a register bank to be set on the register, not a class. This is much stricter than regular copies so relax this behavior. This now allows these 2 cases: 1. Source register has either class or bank, and the result does not 2. Source register has a register class, and the result is a register with a matching bank. This should avoid needing some kind of special handling to avoid violating this constraint when folding copies.
2022-04-11	GlobalISel: Verify atomic load/store ordering restriction	Matt Arsenault	2	-0/+12
	Reject acquire stores and release loads. This matches the restriction imposed by the LLParser and IR verifier.
2022-04-05	MachineVerifier: Diagnose undef set on full register defs	Matt Arsenault	1	-0/+16
	An undef def of a full register would assert in LiveIntervalCalc.
2021-12-21	[GlobalISel] Verify operand types for G_SHL, G_LSHR, G_ASHR	Jay Foad	2	-1/+22
	Differential Revision: https://reviews.llvm.org/D115868
2021-12-05	[GlobalISel] Allow DBG_VALUE to use undefined vregs before LiveDebugValues.	Jack Andersen	2	-0/+78
	Expanding on D109750. Since `DBG_VALUE` instructions have final register validity determined in `LDVImpl::handleDebugValue`, there is no apparent reason to immediately prune unused register operands as their defs are erased. Consequently, this renders `MachineInstr::eraseFromParentAndMarkDBGValuesForRemoval` moot; gaining a substantial performance improvement. The only necessary changes involve making relevant passes consider invalid DBG_VALUE vregs uses as valid. Reviewed By: MatzeB Differential Revision: https://reviews.llvm.org/D112852
2021-09-02	Revert @llvm.isnan intrinsic patchset.	Roman Lebedev	1	-33/+0
	Please refer to https://lists.llvm.org/pipermail/llvm-dev/2021-September/152440.html (and that whole thread.) TLDR: the original patch had no prior RFC, yet it had some changes that really need a proper RFC discussion. It won't be productive to discuss such an RFC, once it's actually posted, while said patch is already committed, because that introduces bias towards already-committed stuff, and the tree is potentially in broken state meanwhile. While the end result of discussion may lead back to the current design, it may also not lead to the current design. Therefore i take it upon myself to revert the tree back to last known good state. This reverts commit 4c4093e6e39fe6601f9c95a95a6bc242ef648cd5. This reverts commit 0a2b1ba33ae6dcaedb81417f7c4cc714f72a5968. This reverts commit d9873711cb03ac7aedcaadcba42f82c66e962e6e. This reverts commit 791006fb8c6fff4f33c33cb513a96b1d3f94c767. This reverts commit c22b64ef66f7518abb6f022fcdfd86d16c764caf. This reverts commit 72ebcd3198327da12804305bda13d9b7088772a8. This reverts commit 5fa6039a5fc1b6392a3c9a3326a76604e0cb1001. This reverts commit 9efda541bfbd145de90f7db38d935db6246dc45a. This reverts commit 94d3ff09cfa8d7aecf480e54da9a5334e262e76b.
2021-08-20	[GlobalISel] Add G_LLROUND	Jessica Paquette	2	-2/+25
	Basically the same as G_LROUND. Handles the llvm.llround family of intrinsics. Also add a helper function to the MachineVerifier for checking if all of the (virtual register) operands of an instruction are scalars. Seems like a useful thing to have. Differential Revision: https://reviews.llvm.org/D108429
2021-08-19	[GlobalISel] Add a G_LROUND instruction	Jessica Paquette	1	-0/+23
	Meant to represent the `@llvm.lround.*` family. Add the opcode, docs, and verification. Differential Revision: https://reviews.llvm.org/D108417
2021-08-19	[AArch64][GlobalISel] Add G_VECREDUCE fewerElements support for full ↵	Amara Emerson	1	-2/+0
	scalarization. For some reductions like G_VECREDUCE_OR on AArch64, we need to scalarize completely if the source is <= 64b. This change adds support for that in the legalizer. If the source has a pow-2 num elements, then we can do a tree reduction using the scalar operation in the individual elements. Otherwise, we just create a sequential chain of operations. For AArch64, we only need to scalarize if the input is <64b. If it's great than 64b then we can first do a fewElements step to 64b, taking advantage of vector instructions until we reach the point of scalarization. I also had to relax the verifier checks for reductions because the intrinsics support <1 x EltTy> types, which we lower to scalars for GlobalISel. Differential Revision: https://reviews.llvm.org/D108276
2021-08-18	[GlobalISel] Add G_ISNAN	Jessica Paquette	1	-0/+33
	Add a generic opcode equivalent to the `llvm.isnan` intrinsic + MachineVerifier support for it. We need an opcode here because we may want target-specific lowering later on. Differential Revision: https://reviews.llvm.org/D108222
2021-08-10	[X86] AVX512FP16 instructions enabling 1/6	Wang, Pengfei	1	-2/+2
	1. Enable FP16 type support and basic declarations used by following patches. 2. Enable new instructions VMOVW and VMOVSH. Ref.: https://software.intel.com/content/www/us/en/develop/download/intel-avx512-fp16-architecture-specification.html Reviewed By: LuoYuanke Differential Revision: https://reviews.llvm.org/D105263
2021-07-21	[MachineVerifier] Make INSERT_SUBREG diagnostic respect operand 2 subregs	Jon Roelofs	1	-1/+6
	This came out of post-commit review: https://reviews.llvm.org/D105953#inline-1012919 Thanks uabelho!
2021-07-20	[MachineVerifier] Diagnose invalid INSERT_SUBREGs	Jon Roelofs	1	-0/+31
	Differential revision: https://reviews.llvm.org/D105953
2021-07-16	Revert "[MachineVerifier] Diagnose invalid INSERT_SUBREGs"	Jon Roelofs	1	-29/+0
	This reverts commit dd57ba1a17b93dbe211d04cb2d4de5f6dc898d60. It broke some tests: http://45.33.8.238/linux/51314/step_12.txt
2021-07-16	[MachineVerifier] Diagnose invalid INSERT_SUBREGs	Jon Roelofs	1	-0/+29
	Differential revision: https://reviews.llvm.org/D105953
2021-06-30	CodeGen: Print/parse LLTs in MachineMemOperands	Matt Arsenault	9	-34/+34
	This will currently accept the old number of bytes syntax, and convert it to a scalar. This should be removed in the near future (I think I converted all of the tests already, but likely missed a few). Not sure what the exact syntax and policy should be. We can continue printing the number of bytes for non-generic instructions to avoid test churn and only allow non-scalar types for generic instructions. This will currently print the LLT in parentheses, but accept parsing the existing integers and implicitly converting to scalar. The parentheses are a bit ugly, but the parser logic seems unable to deal without either parentheses or some keyword to indicate the start of a type.
2021-06-30	[GISel] Support llvm.memcpy.inline	Jon Roelofs	4	-1/+112
	Differential revision: https://reviews.llvm.org/D105072
2021-04-28	GlobalISel: Relax verification of physical register copy types	Matt Arsenault	1	-0/+54
	This was picking a concrete size for a physical register, and enforcing exact match on the virtual register's type size. Some targets add multiple types to a register class, and some are smaller than the full bit width. For example x86 adds f32 to 128-bit xmm registers, and AMDGPU adds i16/f16 to 32-bit registers. It might be better to represent these cases as a copy of the full register and an extraction of the subpart, but a lot of code assumes you can directly copy. This will help fix the current usage of the DAG calling convention infrastructure which is incompatible with how GlobalISel is now using it. The API is somewhat cumbersome here, but I just mirrored the existing functions, except now with LLTs (and allow returning null on failure, unlike the MVT version). I think the concept of selecting register classes based on type is flawed to begin with, but I'm trying to keep this compatible with the existing handling.
2021-03-25	[GlobalISel] Add G_ROTR and G_ROTL opcodes for rotates.	Amara Emerson	1	-0/+13
	Differential Revision: https://reviews.llvm.org/D99383
2021-03-25	[AArch64][GlobalISel] Emit bzero on Darwin	Jessica Paquette	1	-0/+33
	Darwin platforms for both AArch64 and X86 can provide optimized `bzero()` routines. In this case, it may be preferable to use `bzero` in place of a memset of 0. This adds a G_BZERO generic opcode, similar to G_MEMSET et al. This opcode can be generated by platforms which may want to use bzero. To emit the G_BZERO, this adds a pre-legalize combine for AArch64. The conditions for this are largely a port of the bzero case in `AArch64SelectionDAGInfo::EmitTargetCodeForMemset`. The only difference in comparison to the SelectionDAG code is that, when compiling for minsize, this will fire for all memsets of 0. The original code notes that it's not beneficial to do this for small memsets; however, using bzero here will save a mov from wzr. For minsize, I think that it's preferable to prioritise omitting the mov. This also fixes a bug in the libcall legalization code which would delete instructions which could not be legalized. It also adds a check to make sure that we actually get a libcall name. Code size improvements (Darwin): - CTMark -Os: -0.0% geomean (-0.1% on pairlocalalign) - CTMark -Oz: -0.2% geomean (-0.5% on bullet) Differential Revision: https://reviews.llvm.org/D99358
2021-03-24	Add missing -march to runline in llvm/test/MachineVerifier/test_g_ubfx_sbfx.mir	Jessica Paquette	1	-1/+1
	This can cause failures if built without a default target triple.
2021-03-19	[GlobalISel] Add G_SBFX + G_UBFX (bitfield extraction opcodes)	Jessica Paquette	1	-0/+15
	There is a bunch of similar bitfield extraction code throughout *ISelDAGToDAG. E.g, ARMISelDAGToDAG, AArch64ISelDAGToDAG, and AMDGPUISelDAGToDAG all contain code that matches a bitfield extract from an and + right shift. Rather than duplicating code in the same way, this adds two opcodes: - G_UBFX (unsigned bitfield extract) - G_SBFX (signed bitfield extract) They work like this ``` %x = G_UBFX %y, %lsb, %width ``` Where `lsb` and `width` are - The least-significant bit of the extraction - The width of the extraction This will extract `width` bits from `%y`, starting at `lsb`. G_UBFX zero-extends the result, while G_SBFX sign-extends the result. This should allow us to use the combiner to match the bitfield extraction patterns rather than duplicating pattern-matching code in each target. Differential Revision: https://reviews.llvm.org/D98464
2021-03-01	GlobalISel: Verify G_CONCAT_VECTORS has at least 2 sources	Matt Arsenault	1	-16/+12

2021-02-17	[GlobalISel] Add G_ASSERT_SEXT	Jessica Paquette	2	-0/+77
	This adds a G_ASSERT_SEXT opcode, similar to G_ASSERT_ZEXT. This instruction signifies that an operation was already sign extended from a smaller type. This is useful for functions with sign-extended parameters. E.g. ``` define void @foo(i16 signext %x) { ... } ``` This adds verifier, regbankselect, and instruction selection support for G_ASSERT_SEXT equivalent to G_ASSERT_ZEXT. Differential Revision: https://reviews.llvm.org/D96890
2021-02-12	[GlobalISel] Simpler verification of G_SEXT_INREG and G_ASSERT_ZEXT	Jay Foad	2	-6/+2
	There's no need to call verifyVectorElementMatch since we already know that the source and destination types are identical. Differential Revision: https://reviews.llvm.org/D96589
2021-01-28	[GlobalISel] Add G_ASSERT_ZEXT	Jessica Paquette	2	-0/+79
	This adds a generic opcode which communicates that a type has already been zero-extended from a narrower type. This is intended to be similar to AssertZext in SelectionDAG. For example, ``` %x_was_extended:_(s64) = G_ASSERT_ZEXT %x, 16 ``` Signifies that the top 48 bits of %x are known to be 0. This is useful in cases like this: ``` define i1 @zeroext_param(i8 zeroext %x) { %cmp = icmp ult i8 %x, -20 ret i1 %cmp } ``` In AArch64, `%x` must use a 32-bit register, which is then truncated to a 8-bit value. If we know that `%x` is already zero-ed out in the relevant high bits, we can avoid the truncate. Currently, in GISel, this looks like this: ``` _zeroext_param: and w8, w0, #0xff ; We don't actually need this! cmp w8, #236 cset w0, lo ret ``` While SDAG does not produce the truncation, since it knows that it's unnecessary: ``` _zeroext_param: cmp w0, #236 cset w0, lo ret ``` This patch - Adds G_ASSERT_ZEXT - Adds MIRBuilder support for it - Adds MachineVerifier support for it - Documents it It also puts G_ASSERT_ZEXT into its own class of "hint instruction." (There should be a G_ASSERT_SEXT in the future, maybe a G_ASSERT_ALIGN as well.) This allows us to skip over hints in the legalizer etc. These can then later be selected like COPY instructions or removed. Differential Revision: https://reviews.llvm.org/D95564
2021-01-13	[Verifier] Add tied-ness verification to statepoint intsruction	Serguei Katkov	1	-0/+30
	Reviewers: reames, dantrushin Reviewed By: reames, dantrushin Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D94483
2020-10-15	[GlobalISel] Remove scalar src from non-sequential fadd/fmul reductions.	Amara Emerson	1	-2/+2
	It's probably better to split these into separate G_FADD/G_FMUL + G_VECREDUCE operations in the translator rather than carrying the scalar around. The majority of the time it'll get simplified away as the scalars are probably identity values. Differential Revision: https://reviews.llvm.org/D89150
2020-10-08	[GlobalISel] Add G_VECREDUCE_* opcodes for vector reductions.	Amara Emerson	1	-0/+35
	These mirror the IR and SelectionDAG intrinsics & nodes. Opcodes added: G_VECREDUCE_SEQ_FADD G_VECREDUCE_SEQ_FMUL G_VECREDUCE_FADD G_VECREDUCE_FMUL G_VECREDUCE_FMAX G_VECREDUCE_FMIN G_VECREDUCE_ADD G_VECREDUCE_MUL G_VECREDUCE_AND G_VECREDUCE_OR G_VECREDUCE_XOR G_VECREDUCE_SMAX G_VECREDUCE_SMIN G_VECREDUCE_UMAX G_VECREDUCE_UMIN Differential Revision: https://reviews.llvm.org/D88750
2020-08-27	[Test] Tidy up loose ends from LLVM_HAS_GLOBAL_ISEL	Russell Gallop	25	-25/+25
	This hasn't been allowed as a build option since r309990 Remove leftover REQUIRES: global-isel Differential Revision: https://reviews.llvm.org/D86714
2020-08-27	Fix for PS4 bots after 0b7f6cc71a72a85f8a0cbee836a7a8e31876951a	Russell Gallop	2	-0/+2