rocket-tools/riscv-gnu-toolchain/llvm.git - Unnamed repository; edit this file 'description' to name the repository.

Age	Commit message (Collapse)	Author	Files	Lines
2024-06-27	[libc] inline fast path of callonce (#96226)	Schrodinger ZHU Yifan	4	-17/+58
	Split from #91572 --------- Co-authored-by: Nick Desaulniers (paternity leave) <nickdesaulniers@users.noreply.github.com>
2024-06-27	[libc][math] Implement double precision sincos correctly rounded to all ↵	lntue	13	-1/+459
	rounding modes. (#96719) Sharing the same algorithm as double precision sin: https://github.com/llvm/llvm-project/pull/95736 and cos: https://github.com/llvm/llvm-project/pull/96591
2024-06-26	[libc] added newhdrgen python script and class file (#96671)	aaryanshukla	2	-0/+211
	python script uses yaml and classes to generate c headers header.py is only the main class file, the rest will be in another pr more files to be added in multiple prs
2024-06-26	[libc] Remove atomic alignment diagnostics globally (#96803)	Joseph Huber	2	-12/+0
	Summary: These warnings mean that it will lower to a libcall. Previously we just disabled it locally, which didn't work with GCC. This patch does it globally in the compiler options if the compiler is clang.
2024-06-26	[libc] NVPTX Profiling (#92009)	jameshu15869	17	-11/+644
	PR for adding microbenchmarking infrastructure for NVPTX. `nvlink` cannot perform LTO, so we cannot inline `libc` functions and this function call overhead is not adjusted for during microbenchmarking.
2024-06-26	[libc] added newhdrgen class implementation (#96710)	RoseZhang03	6	-0/+123
	Added a class representation of a libc header file, allowing for easier conversion from YAML to .h file output. Classes include: - Function (representing function headers) - Include (representing various include statements found on a header file) - Macro (representing macro definitions) - Enumeration (representing enum definitions) - Type (representing include statements for NamedTypes) - Object (representing ObjectSpec defintitions)
2024-06-26	[libc] add proxy header for struct_sigaction (#96224)	Schrodinger ZHU Yifan	8	-15/+41

2024-06-26	[libc][fcntl] Simplify the handling of the return value from syscall … ↵	Xu Zhang	3	-7/+24
	(#96325) Fixes #95570
2024-06-26	[libc][docs] List `rand` and `srand` as supported on the GPU (#96757)	Joseph Huber	2	-1/+3
	Summary: I initially didn't report these as supported because they didn't provide expected behavior and were very wasteful. The recent patch moved them to a lock-free atomic implementation so they can now actually be used.
2024-06-26	[libc] Fix Fuscia builder failing on atomic warnings (#96791)	Joseph Huber	2	-0/+12
	Summary: This function uses atomics now, which emit warnings on some platforms that don't support full lock-free atomics. These aren't specifically wrong, and in the future we could investigate a libc configuration specialized for single-threaded microprocessors, but for now we should get the bot running again.
2024-06-26	[libc] Make 'rand()' thread-safe using atomics instead of TLS (#96692)	Joseph Huber	6	-42/+25
	Summary: Currently, we implement the `rand` function using thread-local storage. This is somewhat problematic because not every target supports TLS, and even more do not support non-zero initializers on TLS. The C standard states that the `rand()` function need not be thread, safe. However, many implementations provide thread-safety anyway. There's some confusing language in the 'rationale' section of https://pubs.opengroup.org/onlinepubs/9699919799/functions/rand.html, but given that `glibc` uses a lock, I think we should make this thread safe as well. it mentions that threaded behavior is desirable and can be done in the two ways: 1. A single per-process sequence of pseudo-random numbers that is shared by all threads that call rand() 2. A different sequence of pseudo-random numbers for each thread that calls rand() The current implementation is (2.) and this patch moves it to (1.). This is beneficial for the GPU case and more generic support. The downside is that it's slightly slower to do these atomic operations, the fast path will be two atomic reads and an atomic write.
2024-06-25	[libc][fixedvector] Add const_iterator begin/end (#96714)	PiJoules	2	-0/+15

2024-06-25	[libc][arm] move setjmp+longjmp to fullbuild-only entrypoints (#96708)	Nick Desaulniers (paternity leave)	1	-4/+8
	The opaque type jmp_buf should only be tested in fullbuild mode.
2024-06-25	[libc][math] Implement double precision cos correctly rounded to all ↵	lntue	15	-174/+505
	rounding modes. (#96591) Sharing the same algorithm as double precision sin: https://github.com/llvm/llvm-project/pull/95736
2024-06-25	[libc][math][c23] Add MPFR exhaustive test for fmodf16 (#94656)	OverMighty	4	-16/+174

2024-06-25	[libc][thumb] support syscalls from thumb mode (#96558)	Nick Desaulniers (paternity leave)	1	-7/+19
	r7 is reserved in thumb2 (typically for the frame pointer, as opposed to r11 in ARM mode), so assigning to a variable with explicit register storage in r7 will produce an error. But r7 is where the Linux kernel expects the syscall number to be placed. We can use a temporary to get the register allocator to pick a temporary, which we save+restore the previous value of r7 in. Fixes: #93738
2024-06-25	[libc][arm32] define argc type and stack alignment (#96367)	Nick Desaulniers (paternity leave)	3	-23/+8
	https://github.com/ARM-software/abi-aa/blob/main/aapcs32/aapcs32.rst#6212stack-constraints-at-a-public-interface mentions that the stack on ARM32 is double word aligned. Remove confused comments around ArgcType. argc is always an int, passed on the stack, so we need to store a pointer to it (regardless of ILP32 or LP64).
2024-06-25	[libc][math][c23] Add f16divf C23 math function (#96131)	OverMighty	20	-45/+613
	Part of #93566.
2024-06-25	[libc] Configure CMAKE_REQUIRED_FLAGS so the GPU can use flag checks (#95424)	Joseph Huber	1	-32/+13
	Summary: This patch adds `CMAKE_REQUIRED_FLAGS` for the GPU build so checks like `check_cxx_compiler_flags` work as expected. This is required because we need to hack around the potential lack of `nvlink` and `ptxas` for NVPTX targets and the fact that the AMDGPU target needs `-nogpulib` to avoid errors on lack of ROCm. This makes a few of the checks pass and also allows us to just check `-mcpu=native` for architecture detection instead of finding the tools manually.
2024-06-24	[libc] Disable freelist test on NVPTX temporarily	Joseph Huber	1	-11/+13
	Summary: This test fails due to alignment issues, it's likely that it's misaligned on other targets too and they just don't crash on it. @PiJoules maybe we should run this with ubsan?
2024-06-24	[libc][math] Implement double precision sin correctly rounded to all ↵	lntue	19	-59/+1790
	rounding modes. (#95736) - Algorithm: - Step 1 - Range reduction: for a double precision input `x`, return `k` and `u` such that - k is an integer - u = x - k * pi / 128, and \|u\| < pi/256 - Step 2 - Calculate `sin(u)` and `cos(u)` in double-double using Taylor polynomials with errors < 2^-70 with FMA or < 2^-66 w/o FMA. - Step 3 - Calculate `sin(x) = sin(kpi/128) cos(u) + cos(kpi/128) sin(u)` using look-up table for `sin(kpi/128)` and `cos(kpi/128)`. - Step 4 - Use Ziv's rounding test to decide if the result is correctly rounded. - Step 4' - If the Ziv's rounding test failed, redo step 1-3 using 128-bit precision. - Currently, without FMA instructions, the large range reduction only works correctly for the default rounding mode (FE_TONEAREST). - Provide `LIBC_MATH` flag so that users can set `LIBC_MATH = LIBC_MATH_SKIP_ACCURATE_PASS` to build the `sin` function without step 4 and 4'.
2024-06-24	[libc][startup] create header for ElfW and use in startup (#96510)	Nick Desaulniers (paternity leave)	4	-4/+25
	This is necessary for 32b platforms such as ARM and i386. Link: #94128
2024-06-24	[libc][arm] add malloc/free/aligned_alloc to entrypoints (#96516)	Nick Desaulniers (paternity leave)	1	-0/+5
	Necessary for arm32 cross full build.
2024-06-21	[libc][stdlib] Bring all GPU's alloc/free entrypoints under the same ↵	lntue	1	-104/+104
	conditional. (#96373)
2024-06-21	[libc][stdlib] Fix skipped libc.src.stdlib.freelist_malloc target for ↵	lntue	3	-1/+4
	baremetal. (#96372) Downstream build issue reported: https://ci.chromium.org/ui/p/fuchsia/builders/toolchain.ci/clang-linux-x64/b8744479865106393873/overview
2024-06-21	[libc][startup] check that we're cross compiling and using LLD (#96357)	Nick Desaulniers (paternity leave)	1	-1/+5
	We only need to set `--target=` for LLD when cross compiling. This should fix the host build using BFD or targeting the host. Fixes: #96342
2024-06-21	[libc][stdlib] Only use freelist_malloc for baremetal targets. (#96355)	lntue	2	-10/+16

2024-06-21	[libc][startup] set --target= for linker when cross compiling (#96342)	Nick Desaulniers (paternity leave)	1	-1/+1
	Otherwise the startup objects will fail to link since they were cross compiled, but the linker is not informed of the intent to cross compile, which results in linker errors when the host architecture does not match the target architecture.
2024-06-21	Revert "combined string and time functions"	Nick Desaulniers	3	-207/+0
	This reverts commit f333fc5c9732a5b64ae0bca09ade6f0036e80c40. Accidentally pushed, sorry!
2024-06-21	combined string and time functions	Rose Zhang	3	-0/+207

2024-06-21	[libc] Added const modifier to SigSetPtrType (#96252)	RoseZhang03	1	-2/+2
	Function header files for epoll_pwait and epoll_pwait2 have const modifier on SigSetPtrType in function signature, but the linux.td file does not reflect that. .td file located in libc/spec/linux.td epoll functions located in libc/src/sys/epoll
2024-06-21	[libc][math][c23] Add {ldexp,scalbn,scalbln}f16 C23 math functions (#94797)	OverMighty	26	-32/+322
	Part of #93566.
2024-06-20	[libc] Control freelist malloc buffer size with a config (#96248)	PiJoules	5	-5/+17
	Rather than propgating a compile define, add an explicit cmake flag for controlling the size. The default for baremetal is 100KB and the default for others is 1GB.
2024-06-20	[libc][config] Add malloc as baremetal arm entrypoint (#95827)	PiJoules	2	-0/+8

2024-06-20	[libc] Move freelist + block to __support (#96231)	PiJoules	12	-96/+98

2024-06-20	[libc][math][c23] Add {getpayload,setpayload,setpayloadsig}f16 C23 math ↵	OverMighty	21	-2/+519
	functions (#95159) Part of #93566.
2024-06-20	[libc][arm] implement a basic setjmp/longjmp (#93220)	Nick Desaulniers (paternity leave)	9	-7/+178
	Note: our baremetal arm configuration compiles this as `--target=arm-none-eabi`, so this code is built in -marm mode. It could be smaller with `--target=armv7-none-eabi -mthumb`. The assembler is valid ARMv5, or THUMB2, but not THUMB(1).
2024-06-19	[libc] Fix scheduler test incorrectly guessing user privileges (#95562)	OverMighty	1	-21/+27
	Non-root users may be able to set real-time scheduling policies. Don't expect failure to set real-time scheduling policies based on UID. Instead, check that if it fails, it is either due to missing privileges, or unsupported parameters if the scheduling policy is not mandated by POSIX. Fixes #95564.
2024-06-18	[libc][fcntl] fix -Wshorten-64-to-32 for 32b ARM (#95945)	Nick Desaulniers (paternity leave)	1	-2/+2
	Fixes: llvm-project/libc/src/__support/OSUtil/linux/fcntl.cpp:63:26: error: implicit conversion loses integer precision: '__off64_t' (aka 'long long') to '__off_t' (aka 'long') [-Werror,-Wshorten-64-to-32] flk->l_start = flk64.l_start; ~ ~~~~~~^~~~~~~ llvm-project/libc/src/__support/OSUtil/linux/fcntl.cpp:64:24: error: implicit conversion loses integer precision: '__off64_t' (aka 'long long') to '__off_t' (aka 'long') [-Werror,-Wshorten-64-to-32] flk->l_len = flk64.l_len; ~ ~~~~~~^~~~~ We already have an overflow check, just need the cast to be explicit. This warning was observed on the 32b ARM build in overlay mode.
2024-06-18	[libc] Remove unnecessary check in printf floats (#95841)	Michael Jones	1	-19/+16
	Fixes https://github.com/llvm/llvm-project/issues/95638 The check was `if(unsigned_num >= 0)` which will always be true. The intent was to check for zero, but the `for` loop inside the `if` was already doing that.
2024-06-17	[libc] Fix getauxval being defined in a namespace	Joseph Huber	2	-4/+8

2024-06-17	[libc][stdlib] Run freelist_heap_test only in full build mode. (#95850)	lntue	1	-14/+16

2024-06-17	[libc][stdlib] Only add internal malloc in full build mode. Use the system ↵	lntue	1	-9/+17
	malloc in overlay mode. (#95845) This causes an issue in overlay mode: https://github.com/llvm/llvm-project/pull/95736#issuecomment-2172739705
2024-06-17	[libc] Only include getauxval on AARCH64 targets (#95844)	Joseph Huber	2	-2/+4
	Summary: Not all platforms support this function or header, but it was being included by every test. Move it inside of the `ifdef` for the only user, which is aarch64.
2024-06-14	[libc][__support][bit] Switch popcount to Brian Kernighan’s Algorithm (#95625)	Ryan Beltran	1	-3/+4

2024-06-14	[libc] Add the implementation of the fdopen function (#94186)	Xu Zhang	18	-77/+367
	Fixes #93711 . This patch implements the ``fdopen`` function. Given that ``fdopen`` internally calls ``fcntl``, the implementation of ``fcntl`` has been moved to the ``__support/OSUtil``, where it serves as an internal public function.
2024-06-14	Reapply "[libc] printf, putchar and vprintf in bareemetal entrypoints… ↵	Haowei	2	-0/+6
	(#95619) This reverts commit eca988aa4420f33810f9830c80ff9f149b7928ff. The underlying libc issue was fixed by PR#95576. The original PR is #95436 , which adds printf, putchar and vprintf in bareemetal entrypoints
2024-06-14	[libc] fix build errors (#95613)	Schrodinger ZHU Yifan	5	-3/+6

2024-06-14	[libc] fix build errors (#95600)	Schrodinger ZHU Yifan	2	-3/+3
	Bitfield conversion problem tested at: https://godbolt.org/z/dxjhs5Ghr
2024-06-14	[libc] fix preferred_type attribute detection (#95599)	Schrodinger ZHU Yifan	1	-1/+1