[OPENMP][NVPTX]Fixed checks for cuda versions. Summary: We used CUDART_VERSION macro to check for the installed cuda version but this macro is defined in cuda_runtime_api.h, which is not used by project. Better to use CUDA_VERSION macro, which is defined in cuda.h. Also, added the check if this macro is defined. If macro is undefined, there is something wrong with the cuda configuration and we should not continue the compilation. This also fixes problems with runtime building in cuda 10+. Reviewers: grokos Subscribers: guansong, jdoerfert, caomhin, kkwli0, openmp-commits Tags: #openmp Differential Revision: https://reviews.llvm.org/D64648 git-svn-id: https://llvm.org/svn/llvm-project/openmp/trunk@366224 91177308-0d34-0410-b5e6-96231b3b80d8

commit: 0519a2c6b21e1cec91c1d2fe1434054f718a3137 [log] [tgz]
author: Alexey Bataev <a.bataev@hotmail.com> Tue Jul 16 16:07:10 2019 +0000
committer: Alexey Bataev <a.bataev@hotmail.com> Tue Jul 16 16:07:10 2019 +0000
tree: 8026f1e32f68e8d7f4f08dcf21261c4f7a9401bd
parent: 946967528b804d3de059b9ad6fbe7e22ebbdd9bb [diff]
diff --git a/libomptarget/deviceRTLs/nvptx/src/omptarget-nvptx.h b/libomptarget/deviceRTLs/nvptx/src/omptarget-nvptx.h
index 646417d..f28284d 100644
--- a/libomptarget/deviceRTLs/nvptx/src/omptarget-nvptx.h
+++ b/libomptarget/deviceRTLs/nvptx/src/omptarget-nvptx.h

@@ -48,7 +48,9 @@
 // Macros for Cuda intrinsics
 // In Cuda 9.0, the *_sync() version takes an extra argument 'mask'.
 // Also, __ballot(1) in Cuda 8.0 is replaced with __activemask().
-#if defined(CUDART_VERSION) && CUDART_VERSION >= 9000
+#ifndef CUDA_VERSION
+#error CUDA_VERSION macro is undefined, something wrong with cuda.
+#elif CUDA_VERSION >= 9000
 #define __SHFL_SYNC(mask, var, srcLane) __shfl_sync((mask), (var), (srcLane))
 #define __SHFL_DOWN_SYNC(mask, var, delta, width)                              \
   __shfl_down_sync((mask), (var), (delta), (width))
@@ -58,7 +60,7 @@
 #define __SHFL_DOWN_SYNC(mask, var, delta, width)                              \
   __shfl_down((var), (delta), (width))
 #define __ACTIVEMASK() __ballot(1)
-#endif
+#endif // CUDA_VERSION
 
 #define __SYNCTHREADS_N(n) asm volatile("bar.sync %0;" : : "r"(n) : "memory");
 // Use original __syncthreads if compiled by nvcc or clang >= 9.0.
commit	0519a2c6b21e1cec91c1d2fe1434054f718a3137	[log] [tgz]
author	Alexey Bataev <a.bataev@hotmail.com>	Tue Jul 16 16:07:10 2019 +0000
committer	Alexey Bataev <a.bataev@hotmail.com>	Tue Jul 16 16:07:10 2019 +0000
tree	8026f1e32f68e8d7f4f08dcf21261c4f7a9401bd
parent	946967528b804d3de059b9ad6fbe7e22ebbdd9bb [diff]