aboutsummaryrefslogtreecommitdiff
path: root/llvm/lib/Bitcode/Reader/BitcodeReader.cpp
diff options
context:
space:
mode:
authorRoman Gareev <gareevroman@gmail.com>2016-12-21 11:18:42 +0000
committerRoman Gareev <gareevroman@gmail.com>2016-12-21 11:18:42 +0000
commit92c446016a72d452e481aadc28bfb1ccc32ddeb7 (patch)
tree2602b596d980514059782117144eeddf4dd73f3c /llvm/lib/Bitcode/Reader/BitcodeReader.cpp
parent7c7bf1b432c0deb69a7cd11e41ec915fd5500e7e (diff)
downloadllvm-92c446016a72d452e481aadc28bfb1ccc32ddeb7.zip
llvm-92c446016a72d452e481aadc28bfb1ccc32ddeb7.tar.gz
llvm-92c446016a72d452e481aadc28bfb1ccc32ddeb7.tar.bz2
[Polly] Use three-dimensional arrays to store packed operands of the matrix
multiplication Previously we had two-dimensional accesses to store packed operands of the matrix multiplication for the sake of simplicity of the packed arrays. However, addition of the third dimension helps to simplify the corresponding memory access, reduce the execution time of isl operations applied to it, and consequently reduce the compile-time of Polly. For example, in case of Intel Core i7-3820 SandyBridge and the following options, clang -O3 gemm.c -I utilities/ utilities/polybench.c -DPOLYBENCH_TIME -march=native -mllvm -polly -mllvm -polly-pattern-matching-based-opts=true -DPOLYBENCH_USE_SCALAR_LB -mllvm -polly-target-cache-level-associativity=8,8 -mllvm -polly-target-cache-level-sizes=32768,262144 -mllvm -polly-target-latency-vector-fma=7 it helps to reduce the compile-time from about 361.456 seconds to about 0.816 seconds. Reviewed-by: Michael Kruse <llvm@meinersbur.de>, Tobias Grosser <tobias@grosser.es> Differential Revision: https://reviews.llvm.org/D27878 llvm-svn: 290251
Diffstat (limited to 'llvm/lib/Bitcode/Reader/BitcodeReader.cpp')
0 files changed, 0 insertions, 0 deletions