commit | 8fc8f86e1b9c49a64baa936fdd65f1356b58108b | [log] [tgz] |
---|---|---|
author | Jan Vesely <jan.vesely@rutgers.edu> | Tue Feb 04 19:27:19 2020 -0500 |
committer | Copybara-Service <copybara-worker@google.com> | Wed Oct 21 13:06:24 2020 -0700 |
tree | 8dcd714e35d36bd767f818707b287b758eecb329 | |
parent | ecdb8a171f3ab137843370025cce236736c4b39a [diff] |
AMDGPU/EG,CM: Implement fsqrt using recip(rsqrt(x)) instead of x * rsqrt(x) The old version might be faster on EG (RECIP_IEEE is Trans only), but it'd need extra corner case checks. This gives correct corner case behaviour and saves a register. Fixes OCL CTS sqrt test (1-thread, scalar) on Turks. Reviewer: arsenm Differential Revision: https://reviews.llvm.org/D74017 GitOrigin-RevId: e6686adf8a743564f0c455c34f04752ab08cf642