commit | a79bce23b6fdfa14fddabf4d4f5cea158d15fa29 | [log] [tgz] |
---|---|---|
author | Jack Kirk <jack.kirk@codeplay.com> | Fri Aug 05 11:41:47 2022 -0700 |
committer | Copybara-Service <copybara-worker@google.com> | Fri Aug 05 12:16:53 2022 -0700 |
tree | c4ab63acca28619e3132974670fbc6b2eda67e89 | |
parent | d3d6dcd8a61bae01d4caf8074d60e6d2b8fa192a [diff] |
[CUDA] Fixed sm version constrain for __bmma_m8n8k128_mma_and_popc_b1. As stated in https://docs.nvidia.com/cuda/parallel-thread-execution/index.html#warp-level-matrix-instructions-wmma-mma: ".and operation in single-bit wmma requires sm_80 or higher." tra@: Fixed a bug in builtins-nvptx-mma.py test generator and regenerated the tests. Differential Revision: https://reviews.llvm.org/D131265 GitOrigin-RevId: 3e0e5568a6a8c744d26f79a1e55360fe2655867c