scudo: Use DC GZVA instruction in storeTags().

DC GZVA can operate on multiple granules at a time (corresponding to
the CPU's cache line size) so we can generally expect it to be faster
than STZG in a loop.

Differential Revision: https://reviews.llvm.org/D100910

GitOrigin-RevId: 46c59d91dc7a39cc98be7a68d6dc60f3e8a35df0
1 file changed