riscv-gnu-toolchain/llvm.git - Unnamed repository; edit this file 'description' to name the repository.

Age	Commit message (Collapse)	Author	Files	Lines
2024-10-26	[rtsan][llvm][NFC] Rename sanitize_realtime_unsafe attr to ↵	davidtrevelyan	1	-2/+2
	sanitize_realtime_blocking (#113155) # What This PR renames the newly-introduced llvm attribute `sanitize_realtime_unsafe` to `sanitize_realtime_blocking`. Likewise, sibling variables such as `SanitizeRealtimeUnsafe` are renamed to `SanitizeRealtimeBlocking` respectively. There are no other functional changes. # Why? - There are a number of problems that can cause a function to be real-time "unsafe", - we wish to communicate what problems rtsan detects and why they're unsafe, and - a generic "unsafe" attribute is, in our opinion, too broad a net - which may lead to future implementations that need extra contextual information passed through them in order to communicate meaningful reasons to users. - We want to avoid this situation and make the runtime library boundary API/ABI as simple as possible, and - we believe that restricting the scope of attributes to names like `sanitize_realtime_blocking` is an effective means of doing so. We also feel that the symmetry between `[[clang::blocking]]` and `sanitize_realtime_blocking` is easier to follow as a developer. # Concerns - I'm aware that the LLVM attribute `sanitize_realtime_unsafe` has been part of the tree for a few weeks now (introduced here: https://github.com/llvm/llvm-project/pull/106754). Given that it hasn't been released in version 20 yet, am I correct in considering this to not be a breaking change?
2024-10-25	[IR] Fix undiagnosed cases of structs containing scalable vectors (#113455)	Jay Foad	1	-2/+1
	Type::isScalableTy and StructType::containsScalableVectorType failed to detect some cases of structs containing scalable vectors because containsScalableVectorType did not call back into isScalableTy to check the element types. Fix this, which requires sharing the same Visited set in both functions. Also change the external API so that callers are never required to pass in a Visited set, and normalize the naming to isScalableTy.
2024-10-17	[SimplifyLibCall][Attribute] Fix bug where we may keep `range` attr with ↵	goldsteinn	1	-1/+1
	incompatible type (#112649) In a variety of places we change the bitwidth of a parameter but don't update the attributes. The issue in this case is from the `range` attribute when inlining `__memset_chk`. `optimizeMemSetChk` will replace an `i32` with an `i8`, and if the `i32` had a `range` attr assosiated it will cause an error. Fixes #112633
2024-10-15	[Verifier] Verify attribute `denormal-fp-math[-f32]` (#112310)	Yingwei Zheng	1	-0/+13
	Some typos are also fixed. Address https://github.com/llvm/llvm-project/pull/112067#pullrequestreview-2363722447.
2024-10-07	[RFC] IR: Define noalias.addrspace metadata (#102461)	Matt Arsenault	1	-10/+43
	This is intended to solve a problem with lowering atomics in OpenMP and C++ common to AMDGPU and NVPTX. In OpenCL and CUDA, it is undefined behavior for an atomic instruction to modify an object in thread private memory. In OpenMP, it is defined. Correspondingly, the hardware does not handle this correctly. For AMDGPU, 32-bit atomics work and 64-bit atomics are silently dropped. We therefore need to codegen this by inserting a runtime address space check, performing the private case without atomics, and fallback to issuing the real atomic otherwise. This metadata allows us to avoid this extra check and branch. Handle this by introducing metadata intended to be applied to atomicrmw, indicating they cannot access the forbidden address space.
2024-09-19	[LLVM][rtsan] Add `sanitize_realtime_unsafe` attribute (#106754)	davidtrevelyan	1	-0/+6

2024-09-13	[AMDGPU] Error on non-global pointer with s_prefetch_data (#107624)	Stanislav Mekhanoshin	1	-0/+8

2024-08-30	Revert "[LLVM][rtsan] Add LLVM nosanitize_realtime attribute (#105447)" ↵	Chris Apple	1	-6/+0
	(#106743) This reverts commit 178fc4779ece31392a2cd01472b0279e50b3a199. This attribute was not needed now that we are using the lsan style ScopedDisabler for disabling this sanitizer See #106736 #106125 For more discussion
2024-08-29	[ExtendLifetimes] Implement llvm.fake.use to extend variable lifetimes (#86149)	Stephen Tozer	1	-0/+1
	This patch is part of a set of patches that add an `-fextend-lifetimes` flag to clang, which extends the lifetimes of local variables and parameters for improved debuggability. In addition to that flag, the patch series adds a pragma to selectively disable `-fextend-lifetimes`, and an `-fextend-this-ptr` flag which functions as `-fextend-lifetimes` for this pointers only. All changes and tests in these patches were written by Wolfgang Pieb (@wolfy1961), while Stephen Tozer (@SLTozer) has handled review and merging. The extend lifetimes flag is intended to eventually be set on by `-Og`, as discussed in the RFC here: https://discourse.llvm.org/t/rfc-redefine-og-o1-and-add-a-new-level-of-og/72850 This patch implements a new intrinsic instruction in LLVM, `llvm.fake.use` in IR and `FAKE_USE` in MIR, that takes a single operand and has no effect other than "using" its operand, to ensure that its operand remains live until after the fake use. This patch does not emit fake uses anywhere; the next patch in this sequence causes them to be emitted from the clang frontend, such that for each variable (or this) a fake.use operand is inserted at the end of that variable's scope, using that variable's value. This patch covers everything post-frontend, which is largely just the basic plumbing for a new intrinsic/instruction, along with a few steps to preserve the fake uses through optimizations (such as moving them ahead of a tail call or translating them through SROA). Co-authored-by: Stephen Tozer <stephen.tozer@sony.com>
2024-08-28	Move stepvector intrinsic out of experimental namespace (#98043)	Maciej Gabka	1	-2/+2
	This patch is moving out stepvector intrinsic from the experimental namespace. This intrinsic exists in LLVM for several years now, and is widely used.
2024-08-26	[LLVM][rtsan] Add LLVM nosanitize_realtime attribute (#105447)	Chris Apple	1	-0/+6

2024-08-22	[Verifier] Make lrint and lround intrinsic cases concise. NFC (#105676)	Sumanth Gundapaneni	1	-30/+9

2024-08-20	llvm.lround: Update verifier to validate support of vector types. (#98950)	Sumanth Gundapaneni	1	-2/+16
	Both IRVerifier and Machine Verifier are updated
2024-08-20	[IR] Check that arguments of naked function are not used (#104757)	Nikita Popov	1	-0/+4
	Verify that the arguments of a naked function are not used. They can only be referenced via registers/stack in inline asm, not as IR values. Doing so will result in assertion failures in the backend. There's probably more that we should verify, though I'm not completely sure what the constraints are (would it be correct to require that naked functions are exactly an inline asm call + unreachable, or is more allowed?) Fixes https://github.com/llvm/llvm-project/issues/104718.
2024-08-12	[verifier] Get rid of getResolverFunctionType. NFC (#102631)	Bjorn Pettersson	1	-3/+1
	With opaque pointers we can just get the pointer type for the resolver function by using PointerType::get, making the GlobalIFunc::getResolverFunctionType function obsolete.
2024-08-08	Verifier: Reword range metadata error message (#102441)	Matt Arsenault	1	-2/+4

2024-08-07	[LLVM][NVPTX] Add NVPTX codegen support for fence.proxy.tensormap (#100748)	Pradeep Kumar	1	-0/+8
	This commit adds LLVM Intrinsics and NVPTX codegen support for `fence.proxy.tensormap` with lit tests under fence-proxy-tensormap.ll. Also, added Intrinsics documentation in NVPTXUsage.rst --------- Co-authored-by: gonzalobg <65027571+gonzalobg@users.noreply.github.com>
2024-08-04	[llvm] Use llvm::is_contained (NFC) (#101855)	Kazu Hirata	1	-4/+1

2024-07-18	[IR] Unify max alignment for arguments with generic max align. (#99257)	Eli Friedman	1	-15/+23
	The 2^14 limit was completely arbitrary; the generic limit is still arbitrary, but at least it's the same arbitrary limit as everything else. While I'm here, also add a verifier check for the ByValOrByRefSize.
2024-07-10	[MemProf] Optionally print or record the profiled sizes of allocations (#98248)	Teresa Johnson	1	-3/+7
	This is the first step in being able to track the total profiled sizes of allocations successfully marked as cold. Under a new option -memprof-report-hinted-sizes: - For unambiguous (non-context-sensitive) allocations, print the profiled size and the allocation coldness, along with a hash of the allocation's location (to allow for deduplication across modules or inline instances). - For context sensitive allocations, add the size as a 3rd operand on the MIB metadata. A follow on patch will propagate this through to the thin link where the sizes will be reported for each context after cloning.
2024-07-10	[Clang][ARM][AArch64] Alway emit protection attributes for functions. (#82819)	Daniel Kiss	1	-1/+19
	So far branch protection, sign return address, guarded control stack attributes are only emitted as module flags to indicate the functions need to be generated with those features. The problem is in case of an LTO build the module flags are merged with the `min` rule which means if one of the module is not build with sign return address then the features will be turned off for all functions. Due to the functions take the branch-protection and sign-return-address features from the module flags. The sign-return-address is function level option therefore it is expected functions from files that is compiled with -mbranch-protection=pac-ret to be protected. The inliner might inline functions with different set of flags as it doesn't consider the module flags. This patch adds the attributes to all functions and drops the checking of the module flags for the code generation. Module flag is still used for generating the ELF markers. Also drops the "true"/"false" values from the branch-protection-enforcement, branch-protection-pauth-lr, guarded-control-stack attributes as presence of the attribute means it is on absence means off and no other option. Releand with test fixes.
2024-07-10	Revert "[Clang][ARM][AArch64] Alway emit protection attributes for ↵	Daniel Kiss	1	-19/+1
	functions." (#98284) Reverts llvm/llvm-project#82819
2024-07-10	[Clang][ARM][AArch64] Alway emit protection attributes for functions. (#82819)	Daniel Kiss	1	-1/+19
	So far branch protection, sign return address, guarded control stack attributes are only emitted as module flags to indicate the functions need to be generated with those features. The problem is in case of an LTO build the module flags are merged with the `min` rule which means if one of the module is not build with sign return address then the features will be turned off for all functions. Due to the functions take the branch-protection and sign-return-address features from the module flags. The sign-return-address is function level option therefore it is expected functions from files that is compiled with -mbranch-protection=pac-ret to be protected. The inliner might inline functions with different set of flags as it doesn't consider the module flags. This patch adds the attributes to all functions and drops the checking of the module flags for the code generation. Module flag is still used for generating the ELF markers. Also drops the "true"/"false" values from the branch-protection-enforcement, branch-protection-pauth-lr, guarded-control-stack attributes as presence of the attribute means it is on absence means off and no other option.
2024-07-09	[DebugInfo] Allow DISubrange/DIGenericSubrange without count/upperBound. ↵	Abid Qadeer	1	-11/+0
	(#96474) Due to the current order of metadata in DISubprgram, `Type` is processed before `Unit` by the Verifier. This can cause a race and use of garbage data. Consider the following code: ``` int test(int a[][5]) { return a[0][2]; } ``` when compiled with clang, the control reaches `Verifier::visitDISubrange` first with `CurrentSourceLang` still equal to dwarf::DW_LANG_lo_user (32768). The `Verifier::visitDICompileUnit` which sets the value of `CurrentSourceLang` is reached later. So `Verifier::visitDISubrange` ends up using a wrong value of `CurrentSourceLang`. This behavior does not effect C like language much but is a problem for Fortran. There is special processing in `Verifier::visitDISubrange` when `CurrentSourceLang` is Fortran. With this problem, that special handling is missed and verifier fails for any code that has Fortran's assumed size array in a global subroutine. Various solutions were tried to solve this problem before it was decided that best course of action is to remove these checks from Verifier.
2024-07-04	[IR][LangRef] Add partial reduction add intrinsic (#94499)	Nicholas Guy	1	-0/+14
	Adds the llvm.experimental.partial.reduce.add.* overloaded intrinsic, this intrinsic represents add reductions that result in a narrower vector.
2024-06-24	[llvm][ProfDataUtils] Provide getNumBranchWeights API (#90146)	Paul Kirth	1	-6/+6
	As suggested in https://github.com/llvm/llvm-project/pull/86609/files#r1556689262 an API for getting the number of branch weights directly from the MD node would be useful in a variety of checks, and keeps the logic within ProfDataUtils.
2024-06-21	Add the 'initializes' attribute langref and support (#84803)	Haopeng Liu	1	-0/+9
	We propose adding a new LLVM attribute, `initializes((Lo1,Hi1),(Lo2,Hi2),...)`, which expresses the notion of memory space (i.e., intervals, in bytes) that the argument pointing to is initialized in the function. Will commit the attribute inferring in the follow-up PRs. https://discourse.llvm.org/t/rfc-llvm-new-initialized-parameter-attribute-for-improved-interprocedural-dse/77337
2024-06-12	Reapply "[llvm][IR] Extend BranchWeightMetadata to track provenance o… ↵	Paul Kirth	1	-3/+6
	(#95281) …f weights" #95136 Reverts #95060, and relands #86609, with the unintended code generation changes addressed. This patch implements the changes to LLVM IR discussed in https://discourse.llvm.org/t/rfc-update-branch-weights-metadata-to-allow-tracking-branch-weight-origins/75032 In this patch, we add an optional field to MD_prof meatdata nodes for branch weights, which can be used to distinguish weights added from llvm.expect* intrinsics from those added via other methods, e.g. from profiles or inserted by the compiler. One of the major motivations, is for use with MisExpect diagnostics, which need to know if branch_weight metadata originates from an llvm.expect intrinsic. Without that information, we end up checking branch weights multiple times in the case if ThinLTO + SampleProfiling, leading to some inaccuracy in how we report MisExpect related diagnostics to users. Since we change the format of MD_prof metadata in a fundamental way, we need to update code handling branch weights in a number of places. We also update the lang ref for branch weights to reflect the change.
2024-06-11	Revert "[llvm][IR] Extend BranchWeightMetadata to track provenance of ↵	Paul Kirth	1	-6/+3
	weights" (#95060) Reverts llvm/llvm-project#86609 This change causes compile-time regressions for stage2 builds (https://llvm-compile-time-tracker.com/compare.php?from=3254f31a66263ea9647c9547f1531c3123444fcd&to=c5978f1eb5eeca8610b9dfce1fcbf1f473911cd8&stat=instructions:u). It also introduced unintended changes to `.text` which should be addressed before relanding.
2024-06-10	[llvm][IR] Extend BranchWeightMetadata to track provenance of weights (#86609)	Paul Kirth	1	-3/+6
	This patch implements the changes to LLVM IR discussed in https://discourse.llvm.org/t/rfc-update-branch-weights-metadata-to-allow-tracking-branch-weight-origins/75032 In this patch, we add an optional field to MD_prof metadata nodes for branch weights, which can be used to distinguish weights added from `llvm.expect*` intrinsics from those added via other methods, e.g. from profiles or inserted by the compiler. One of the major motivations, is for use with MisExpect diagnostics, which need to know if branch_weight metadata originates from an llvm.expect intrinsic. Without that information, we end up checking branch weights multiple times in the case if ThinLTO + SampleProfiling, leading to some inaccuracy in how we report MisExpect related diagnostics to users. Since we change the format of MD_prof metadata in a fundamental way, we need to update code handling branch weights in a number of places. We also update the lang ref for branch weights to reflect the change.
2024-06-07	[ARM] r11 is reserved when using -mframe-chain=aapcs (#86951)	Oliver Stannard	1	-1/+1
	When using the -mframe-chain=aapcs or -mframe-chain=aapcs-leaf options, we cannot use r11 as an allocatable register, even if -fomit-frame-pointer is also used. This is so that r11 will always point to a valid frame record, even if we don't create one in every function.
2024-05-28	[IR][AArch64][PAC] Add "ptrauth(...)" Constant to represent signed pointers. ↵	Ahmed Bougacha	1	-0/+23
	(#85738) This defines a new kind of IR Constant that represents a ptrauth signed pointer, as used in AArch64 PAuth. It allows representing most kinds of signed pointer constants used thus far in the llvm ptrauth implementations, notably those used in the Darwin and ELF ABIs being implemented for c/c++. These signed pointer constants are then lowered to ELF/MachO relocations. These can be simply thought of as a constant `llvm.ptrauth.sign`, with the interesting addition of discriminator computation: the `ptrauth` constant can also represent a combined blend, when both address and integer discriminator operands are used. Both operands are otherwise optional, with default values 0/null.
2024-05-08	[DXIL] Set DXIL Version in DXIL target triple based on shader model version ↵	S. Bharadwaj Yadavalli	1	-2/+2
	(#91407) This change set restores commit 080978dd2067d0c9ea7e229aa7696c2480d89ef1 that was reverted to address ASAN failures and includes a fix for the ASAN failures. Following is the description of the change: An earlier commit provided a way to decouple DXIL version from Shader Model version by representing the DXIL version as `SubArch` in the DXIL Target Triple and adding corresponding valid DXIL Arch types. This change constructs DXIL target triple with DXIL version that is deduced from Shader Model version specified in the following scenarios: 1. When compilation target profile is specified: For e.g., DXIL target triple `dxilv1.8-unknown-shader6.8-library` is constructed when `-T lib_6_8` is specified. 2. When DXIL target triple without DXIL version is specified: For e.g., DXIL target triple `dxilv1.8-pc-shadermodel6.8-library` is constructed when `-mtriple=dxil-pc-shadermodel6.8-library` is specified. Updated relevant HLSL tests that check for target triple.
2024-05-07	[NFC][LLVM] Refactor rounding mode detection of constrained fp intrinsic IDs ↵	Paul Walker	1	-16/+13
	(#90854) I've refactored the code to genericise the implementation to better allow for target specific constrained fp intrinsics.
2024-05-06	Revert "[DirectX][DXIL] Set DXIL Version in DXIL target triple based on ↵	S. Bharadwaj Yadavalli	1	-2/+2
	shader model version" (#91290) Reverts llvm/llvm-project#90809 Need to investigate ASAN failures.
2024-05-06	[DirectX][DXIL] Set DXIL Version in DXIL target triple based on shader model ↵	S. Bharadwaj Yadavalli	1	-2/+2
	version (#90809) An earlier commit provided a way to decouple DXIL version from Shader Model version by representing the DXIL version as `SubArch` in the DXIL Target Triple and adding corresponding valid DXIL Arch types. This change constructs DXIL target triple with DXIL version that is deduced from Shader Model version specified in the following scenarios: 1. When compilation target profile is specified: For e.g., DXIL target triple `dxilv1.8-unknown-shader6.8-library` is constructed when `-T lib_6_8` is specified. 2. When DXIL target triple without DXIL version is specified: For e.g., DXIL target triple `dxilv1.8-pc-shadermodel6.8-library` is constructed when `-mtriple=dxil-pc-shadermodel6.8-library` is specified. Updated relevant HLSL tests that check for target triple. Validated that Clang (`check-clang`) and LLVM (`check-llvm`) regression tests pass.
2024-05-02	[NFC] Reduce copies created of ConstantRange when getting ↵	Andreas Jonson	1	-1/+2
	ConstantRangeAttributes (#90335) Think that it can be good to reduce the number of copies created when working with ConstantRangeAttributes.
2024-04-30	[IR] Use StringRef::operator== instead of StringRef::equals (NFC) (#90550)	Kazu Hirata	1	-3/+3
	I'm planning to remove StringRef::equals in favor of StringRef::operator==. - StringRef::operator== outnumbers StringRef::equals by a factor of 22 under llvm/ in terms of their usage. - The elimination of StringRef::equals brings StringRef closer to std::string_view, which has operator== but not equals. - S == "foo" is more readable than S.equals("foo"), especially for !Long.Expression.equals("str") vs Long.Expression != "str".
2024-04-29	Move several vector intrinsics out of experimental namespace (#88748)	Maciej Gabka	1	-1/+1
	This patch is moving out following intrinsics: * vector.interleave2/deinterleave2 * vector.reverse * vector.splice from the experimental namespace. All these intrinsics exist in LLVM for more than a year now, and are widely used, so should not be considered as experimental.
2024-04-25	Remove unneeded LLVM_FALLTHROUGH. NFC	Fangrui Song	1	-1/+0

2024-04-24	[IR] Remove unused variable in Verifier.cpp (NFC)	Jie Fu	1	-7/+0
	llvm-project/llvm/lib/IR/Verifier.cpp:4854:14: error: unused variable 'IsLeaf' [-Werror,-Wunused-variable] const auto IsLeaf = [](const Metadata *CurMD) { ^ 1 error generated.
2024-04-24	[IR] Memory Model Relaxation Annotations (#78569)	Pierre van Houtryve	1	-0/+31
	Implements the core/target-agnostic components of Memory Model Relaxation Annotations. RFC: https://discourse.llvm.org/t/rfc-mmras-memory-model-relaxation-annotations/76361/5
2024-04-22	Revert "[TBAA] Add verifier for tbaa.struct metadata (#86709)"	Florian Hahn	1	-32/+0
	This reverts commit 7dbba39e583a3fd64e7e6b947251c035e483f054. Revert as there are reports this triggers during ThinLTO in some configurations.
2024-04-18	[DWARF] Add support for DW_TAG_template_alias for template aliases (#88943)	Orlando Cazalet-Hyams	1	-1/+2
	Part 1 of fix for issue https://github.com/llvm/llvm-project/issues/54624 Split from PR #87623. Clang front end changes to follow. Use DICompositeType to represent the template alias, using its extraData field as a tuple of DITemplateParameter to describe the template parameters. Added template-alias.ll - Check DWARF emission. Modified frame-types.s - Check llvm-symbolizer understands the DIE.
2024-04-16	Reapply "[Verifier] Reject va_start in non-variadic function (#88809)"	Jon Chesterfield	1	-0/+5
	This reverts commit f4960da6023b8034ae68925c3223d51624621b37. Includes a fix for the MLIR test case.
2024-04-16	Revert "[Verifier] Reject va_start in non-variadic function (#88809)"	Jon Chesterfield	1	-5/+0
	This reverts commit 61717c1aa1f08eb57839a21fb2d9004739022e0d. Failed a MLIR test
2024-04-16	[Verifier] Reject va_start in non-variadic function (#88809)	Jon Chesterfield	1	-0/+5
	A va_start intrinsic lowers to something derived from the variadic parameter to the function. If there is no such parameter, it can't lower meaningfully. Clang sema rejects the same with `error: 'va_start' used in function with fixed args`. Moves the existing lint warning into a verifier error. Updates the one lit test that had a va_start in a non-variadic function.
2024-04-15	Reapply "[TBAA] Add verifier for tbaa.struct metadata (#86709)"	Julian Nagele	1	-0/+32
	This reverts commit b9cd48f96acdd07c627ccafbf4386a1f3dcd6c51. ------------------------------------------------------------- Original commit message: Adds logic to the IR verifier that checks whether !tbaa.struct nodes are well-formed. That is, it checks that the operands of !tbaa.struct nodes are in groups of three, that each group of three operands consists of two integers and a valid tbaa node, and that the regions described by the offset and size operands are non-overlapping. PR: https://github.com/llvm/llvm-project/pull/86709
2024-04-12	IRVerifier: Allow GlobalValue as llvm.threadlocal.address operand (#88321)	Matthias Braun	1	-4/+4
	Loosen `llvm.threadlocal.address` verifier checks to allow any `GlobalValue` with `isThreadLocal()` set to true.
2024-04-08	Verify threadlocal_address constraints (#87841)	Matthias Braun	1	-0/+8
	Check invariants for `llvm.threadlocal.address` intrinsic in IR Verifier.