diff options
author | Alexandre Ganea <alexandre.ganea@ubisoft.com> | 2020-03-27 10:20:39 -0400 |
---|---|---|
committer | Alexandre Ganea <alexandre.ganea@ubisoft.com> | 2020-03-27 10:20:58 -0400 |
commit | 09158252f777c2e2f06a86b154c44abcbcf9bb74 (patch) | |
tree | d439364df9e912e5b6fbe9fe10e10e0da5b663b6 /llvm/lib/Support/Threading.cpp | |
parent | 3f2f7f895a849a3247625482b15491ed2103ccf4 (diff) | |
download | llvm-09158252f777c2e2f06a86b154c44abcbcf9bb74.zip llvm-09158252f777c2e2f06a86b154c44abcbcf9bb74.tar.gz llvm-09158252f777c2e2f06a86b154c44abcbcf9bb74.tar.bz2 |
[ThinLTO] Allow usage of all hardware threads in the system
Before this patch, it wasn't possible to extend the ThinLTO threads to all SMT/CMT threads in the system. Only one thread per core was allowed, instructed by usage of llvm::heavyweight_hardware_concurrency() in the ThinLTO code. Any number passed to the LLD flag /opt:lldltojobs=..., or any other ThinLTO-specific flag, was previously interpreted in the context of llvm::heavyweight_hardware_concurrency(), which means SMT disabled.
One can now say in LLD:
/opt:lldltojobs=0 -- Use one std::thread / hardware core in the system (no SMT). Default value if flag not specified.
/opt:lldltojobs=N -- Limit usage to N threads, regardless of usage of heavyweight_hardware_concurrency().
/opt:lldltojobs=all -- Use all hardware threads in the system. Equivalent to /opt:lldltojobs=$(nproc) on Linux and /opt:lldltojobs=%NUMBER_OF_PROCESSORS% on Windows. When an affinity mask is set for the process, threads will be created only for the cores selected by the mask.
When N > number-of-hardware-threads-in-the-system, the threads in the thread pool will be dispatched equally on all CPU sockets (tested only on Windows).
When N <= number-of-hardware-threads-on-a-CPU-socket, the threads will remain on the CPU socket where the process started (only on Windows).
Differential Revision: https://reviews.llvm.org/D75153
Diffstat (limited to 'llvm/lib/Support/Threading.cpp')
-rw-r--r-- | llvm/lib/Support/Threading.cpp | 28 |
1 files changed, 23 insertions, 5 deletions
diff --git a/llvm/lib/Support/Threading.cpp b/llvm/lib/Support/Threading.cpp index de5adad..39de058 100644 --- a/llvm/lib/Support/Threading.cpp +++ b/llvm/lib/Support/Threading.cpp @@ -84,16 +84,34 @@ void llvm::llvm_execute_on_thread_async( int computeHostNumHardwareThreads(); unsigned llvm::ThreadPoolStrategy::compute_thread_count() const { + if (ThreadsRequested > 0) + return ThreadsRequested; + int MaxThreadCount = UseHyperThreads ? computeHostNumHardwareThreads() : sys::getHostNumPhysicalCores(); if (MaxThreadCount <= 0) MaxThreadCount = 1; + return MaxThreadCount; +} - // No need to create more threads than there are hardware threads, it would - // uselessly induce more context-switching and cache eviction. - if (!ThreadsRequested || ThreadsRequested > (unsigned)MaxThreadCount) - return MaxThreadCount; - return ThreadsRequested; +Optional<ThreadPoolStrategy> +llvm::get_threadpool_strategy(StringRef Num, ThreadPoolStrategy Default) { + if (Num == "all") + return llvm::hardware_concurrency(); + if (Num.empty()) + return Default; + unsigned V; + if (Num.getAsInteger(10, V)) + return None; // malformed 'Num' value + if (V == 0) + return Default; + + // Do not take the Default into account. This effectively disables + // heavyweight_hardware_concurrency() if the user asks for any number of + // threads on the cmd-line. + ThreadPoolStrategy S = llvm::hardware_concurrency(); + S.ThreadsRequested = V; + return S; } namespace { |