aboutsummaryrefslogtreecommitdiff
path: root/gcc/combine.c
diff options
context:
space:
mode:
authorTom de Vries <tdevries@suse.de>2018-07-30 08:17:26 +0000
committerTom de Vries <vries@gcc.gnu.org>2018-07-30 08:17:26 +0000
commit4cdfee3f206d784f8a502af4f34180a0762df4fe (patch)
treed46c15b15b238c3095358b7e75a968184b1b05d1 /gcc/combine.c
parent0b210c43bbb6eddac8ba550d9c45bf679d4328c5 (diff)
downloadgcc-4cdfee3f206d784f8a502af4f34180a0762df4fe.zip
gcc-4cdfee3f206d784f8a502af4f34180a0762df4fe.tar.gz
gcc-4cdfee3f206d784f8a502af4f34180a0762df4fe.tar.bz2
[libgomp, nvptx] Handle per-function max-threads-per-block in default dims
Currently parallel-loop-1.c fails at -O0 on a Quadro M1200, because one of the kernel launch configurations exceeds the resources available in the device, due to the default dimensions chosen by the runtime. This patch fixes that by taking the per-function max_threads_per_block into account when using the default dimensions. 2018-07-30 Tom de Vries <tdevries@suse.de> * plugin/plugin-nvptx.c (MIN, MAX): Redefine. (nvptx_exec): Ensure worker and vector default dims don't exceed targ_fn->max_threads_per_block. From-SVN: r263062
Diffstat (limited to 'gcc/combine.c')
0 files changed, 0 insertions, 0 deletions