Sign in
llvm
/
clang
/
c6d149249952dca405a25fd043da864ddf71301f
/
test
/
OpenMP
/
nvptx_teams_reduction_codegen.cpp
88700af
[OPENMP]Use different addresses for zeroed thread_id/bound_id.
by Alexey Bataev
· 5 years ago
635dc81
IR: print value numbers for unnamed function arguments
by Tim Northover
· 5 years ago
1353fe8
[OPENMP][NVPTX]Mark more functions as always_inline for better performance.
by Alexey Bataev
· 6 years ago
d6626f4
[OPENMP][NVPTX]Fix PR40893: Size doesn't match for
by Alexey Bataev
· 6 years ago
349d010
[OPENMP][NVPTX]Use faster teams reduction algorithm.
by Alexey Bataev
· 6 years ago
90bf190
[OPENMP][NVPTX]Reduce number of barriers in reductions.
by Alexey Bataev
· 6 years ago
1188822
[OPENMP][NVPTX]Use new functions from the runtime library.
by Alexey Bataev
· 6 years ago
2801c2a
[OPENMP][NVPTX]Emit shared memory buffer for reduction as 128 bytes buffer.
by Alexey Bataev
· 6 years ago
e98c891
[OPENMP][NVPTX]Improved interwarp copy function.
by Alexey Bataev
· 6 years ago
e1fea8e
[OpenMP] Add a new version of the SPMD deinit kernel function
by Gheorghe-Teodor Bercea
· 6 years ago
260cb96
[OPENMP][NVPTX]Basic support for reductions across the teams.
by Alexey Bataev
· 6 years ago
f7b7bb1
[OPENMP][NVPTX]Emit correct reduction code for teams/parallel reductions.
by Alexey Bataev
· 6 years ago
255eaa6
[OPENMP][NVPTX] Support memory coalescing for globalized variables.
by Alexey Bataev
· 6 years ago
ccd9acb
[OPENMP, NVPTX] Fix reduction of the big data types/structures.
by Alexey Bataev
· 7 years ago
c541223
[OPENMP] General code improvements.
by Alexey Bataev
· 7 years ago
b7d1036
[OPENMP, NVPTX] Fix codegen for the teams reduction.
by Alexey Bataev
· 7 years ago
27fbdd8
[OPENMP] Fix casting in NVPTX support library.
by Alexey Bataev
· 7 years ago
51604e7
[OpenMP] Adjust arguments of nvptx runtime functions
by Jonas Hahnfeld
· 7 years ago
08de7bb
[OpenMP] Teams reduction on the NVPTX device.
by Arpith Chacko Jacob
· 8 years ago