6f068682cc9fd653404eaff8a3275dda92837ed9 - llvm-project/parallel-libs

commit	6f068682cc9fd653404eaff8a3275dda92837ed9	[log] [tgz]
author	Jason Henline <jhen@google.com>	Tue Sep 13 23:59:10 2016 +0000
committer	Copybara-Service <copybara-worker@google.com>	Tue Sep 01 01:05:01 2020 -0700
tree	5ce70f821db8b1335a462c56fd4e34a7ba9e6ba9
parent	9230beadc183e48d143bc63f92b66a01aebccbc7 [diff]

[SE] Pack global dev handle addresses

Summary:
We were packing global device memory handles in
`PackedKernelArgumentArray`, but as I was implementing the CUDA
platform, I realized that CUDA wants the address of the handle, not the
handle itself. So this patch switches to packing the address of the
handle.

Reviewers: jlebar

Subscribers: jprice, jlebar, parallel_libs-commits

Differential Revision: https://reviews.llvm.org/D24528

llvm-svn: 281424
GitOrigin-RevId: b38d8a3a3baabf759e819fdefd764462691f4048

streamexecutor/examples/HostSaxpy.cpp[diff]
streamexecutor/include/streamexecutor/DeviceMemory.h[diff]
streamexecutor/include/streamexecutor/PackedKernelArgumentArray.h[diff]
streamexecutor/unittests/CoreTests/PackedKernelArgumentArrayTest.cpp[diff]

4 files changed

tree: 5ce70f821db8b1335a462c56fd4e34a7ba9e6ba9

streamexecutor/
.arcconfig
.clang-format
.clang-tidy
CMakeLists.txt
README.rst