aboutsummaryrefslogtreecommitdiff
path: root/mlir/lib/Transforms/LoopInvariantCodeMotion.cpp
diff options
context:
space:
mode:
authorJay Foad <jay.foad@amd.com>2020-02-28 23:38:41 +0000
committerJay Foad <jay.foad@amd.com>2020-03-10 19:31:24 +0000
commitc8f0d27ef37cf9f6a808b5830c9bdc5686cc5bc4 (patch)
treed46011a76c51c861d18c8e9676e94412d1e6a27d /mlir/lib/Transforms/LoopInvariantCodeMotion.cpp
parent47ec8702cbc6f607b2e5cc25270a560eb9e02710 (diff)
downloadllvm-c8f0d27ef37cf9f6a808b5830c9bdc5686cc5bc4.zip
llvm-c8f0d27ef37cf9f6a808b5830c9bdc5686cc5bc4.tar.gz
llvm-c8f0d27ef37cf9f6a808b5830c9bdc5686cc5bc4.tar.bz2
[AMDGPU] Fix the gfx10 scheduling model for f32 conversions
Summary: As far as I can tell on gfx10 conversions to/from f32 (that are not converting f32 to/from f64) are full rate instructions, but they were marked as quarter rate instructions. I have fixed this for gfx10 only. I assume the scheduling model was correct for older architectures, though I don't have any documentation handy to confirm that. Reviewers: rampitec, arsenm Subscribers: kzhuravl, jvesely, wdng, nhaehnle, yaxunl, dstuttard, tpr, t-tye, hiraditya, kerbowa, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D75392
Diffstat (limited to 'mlir/lib/Transforms/LoopInvariantCodeMotion.cpp')
0 files changed, 0 insertions, 0 deletions