aboutsummaryrefslogtreecommitdiff
path: root/llvm/lib/Bitcode/Reader/BitcodeReader.cpp
diff options
context:
space:
mode:
authorpeter klausler <pklausler@nvidia.com>2021-05-17 14:06:44 -0700
committerpeter klausler <pklausler@nvidia.com>2021-05-18 10:59:52 -0700
commit5e1421b22f642a6b34690d0d724e691ba3984836 (patch)
tree68247f38710d572e644bac13a2d53ffd69404d57 /llvm/lib/Bitcode/Reader/BitcodeReader.cpp
parent2919222d8017f2425a85765b95e4b7c6f8e70ca4 (diff)
downloadllvm-5e1421b22f642a6b34690d0d724e691ba3984836.zip
llvm-5e1421b22f642a6b34690d0d724e691ba3984836.tar.gz
llvm-5e1421b22f642a6b34690d0d724e691ba3984836.tar.bz2
[flang] Implement MATMUL in the runtime
Define an API for the transformational intrinsic function MATMUL, implement it, and add some basic unit tests. The large number of possible argument type combinations are covered by a set of generalized templates that are instantiated for each valid pair of possible argument types. Places where BLAS-2/3 routines could be called for acceleration are marked with TODOs. Handling for other special cases (e.g., known-shape 3x3 matrices and vectors) are deferred. Some minor tweaks were made to the recent related implementation of DOT_PRODUCT to reflect lessons learned. Differential Revision: https://reviews.llvm.org/D102652
Diffstat (limited to 'llvm/lib/Bitcode/Reader/BitcodeReader.cpp')
0 files changed, 0 insertions, 0 deletions