riscv-gnu-toolchain/gcc.git - Unnamed repository; edit this file 'description' to name the repository.

diff options

author	Peter Bergner <bergner@linux.ibm.com>	2022-05-17 21:09:29 -0500
committer	Peter Bergner <bergner@linux.ibm.com>	2022-05-17 21:10:27 -0500
commit	c6e36f05fbb081abb068958d8900ad34b303a70b (patch)
tree	fbb1fbe9de9e09c20aa02d853a0e89c27337f7d1 /gcc/ada/sem_ch3.adb
parent	3d9439b1bb76c186958d5b86f0076f8b3017b8a2 (diff)
download	gcc-c6e36f05fbb081abb068958d8900ad34b303a70b.zip gcc-c6e36f05fbb081abb068958d8900ad34b303a70b.tar.gz gcc-c6e36f05fbb081abb068958d8900ad34b303a70b.tar.bz2

rs6000: Prefer assigning the MMA vector operands to altivec registers [PR105556]

When optimizing the DGEMM kernel in OpenBLAS to use MMA, the MMA code uses all 8 accumulators, which overlap all vs0-vs31 vector registers. Current trunk assigns one of the normal vector inputs to one of the MMA instructions, which forces us to spill one of the accumulators to memory, leading to poor performance. The solution here is to replace the "wa" constraints for the vector input operands in the MMA instruction patterns with "v,?wa" so that we prefer using the altivec registers vs32-vs63 over the vs0-vs31 registers. 2022-05-17 Peter Bergner <bergner@linux.ibm.com> Segher Boessenkool <segher@kernel.crashing.org> gcc/ PR target/105556 * config/rs6000/mma.md (mma_<vv>, mma_<avv>, mma_<pv>, mma_<apv>, mma_<vvi4i4i8>, mma_<avvi4i4i8>, mma_<vvi4i4i2>, mma_<avvi4i4i2>, mma_<vvi4i4>, mma_<avvi4i4>, mma_<pvi4i2>, mma_<apvi4i2>, mma_<vvi4i4i4>, mma_<avvi4i4i4>): Replace "wa" constraints with "v,?wa". Update other operands accordingly.

Diffstat (limited to 'gcc/ada/sem_ch3.adb')

0 files changed, 0 insertions, 0 deletions


context:
space:
mode: