aboutsummaryrefslogtreecommitdiff
path: root/clang/lib/CodeGen/CodeGenModule.cpp
diff options
context:
space:
mode:
authorJoseph Huber <huberjn@outlook.com>2024-02-01 07:19:57 -0600
committerGitHub <noreply@github.com>2024-02-01 07:19:57 -0600
commitf956e7fbf161447b9236f7c4448a9d02d3564261 (patch)
tree75efdcc8e95d0be467d7976dea21b448c0a461f1 /clang/lib/CodeGen/CodeGenModule.cpp
parentd2565bb11308f6cf98d838e828d9bcbe2d51e0e4 (diff)
downloadllvm-f956e7fbf161447b9236f7c4448a9d02d3564261.zip
llvm-f956e7fbf161447b9236f7c4448a9d02d3564261.tar.gz
llvm-f956e7fbf161447b9236f7c4448a9d02d3564261.tar.bz2
[AMDGPU] Prefer `s_memtime` for `readcyclecounter` on GFX10 (#80211)
Summary: The old `s_memtime` instruction was supported until the GFX10 architecture. Although this instruction has a higher latency than the new shader counter, it's much more usable as a processor clock as it is a full 64-bit counter. The new shader counter is only a 20-bit counter, which makes it difficult to use as a standard cycle counter as it will overflow in a few milliseconds. This patch suggests preferring `s_memtime` for this instrinsic if it is still available.
Diffstat (limited to 'clang/lib/CodeGen/CodeGenModule.cpp')
0 files changed, 0 insertions, 0 deletions