aboutsummaryrefslogtreecommitdiff
path: root/llvm/lib/Bitcode/Reader/BitcodeReader.cpp
diff options
context:
space:
mode:
authorHal Finkel <hfinkel@anl.gov>2014-05-07 22:25:18 +0000
committerHal Finkel <hfinkel@anl.gov>2014-05-07 22:25:18 +0000
commitf6475bbc4b156ccc70213895057549b5acf1e464 (patch)
tree9cc250a2bbe750a1f23f82c2315b26756994dec5 /llvm/lib/Bitcode/Reader/BitcodeReader.cpp
parent80877c228d019e9cdca6295f3197867cc86e1720 (diff)
downloadllvm-f6475bbc4b156ccc70213895057549b5acf1e464.zip
llvm-f6475bbc4b156ccc70213895057549b5acf1e464.tar.gz
llvm-f6475bbc4b156ccc70213895057549b5acf1e464.tar.bz2
[X86TTI] Remove the unrolling branch limits
The loop stream detector (LSD) on modern Intel cores, which optimizes the execution of small loops, has limits on the number of taken branches in addition to uop-count limits (modern AMD cores have similar limits). Unfortunately, at the IR level, estimating the number of branches that will be taken is difficult. For one thing, it strongly depends on later passes (block placement, etc.). The original implementation took a conservative approach and limited the maximal BB DFS depth of the loop. However, fairly-extensive benchmarking by several of us has revealed that this is the wrong approach. In fact, there are zero known cases where the branch limit prevents a detrimental unrolling (but plenty of cases where it does prevent beneficial unrolling). While we could improve the current branch counting logic by incorporating branch probabilities, this further complication seems unjustified without a motivating regression. Instead, unless and until a regression appears, the branch counting will be removed. llvm-svn: 208255
Diffstat (limited to 'llvm/lib/Bitcode/Reader/BitcodeReader.cpp')
0 files changed, 0 insertions, 0 deletions