Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Fix performance regression of normalize
The PR #2778 tries to resolve the signed zero issue of normalize. But it unconditionally adds v_cmp and v_cndmask instructions. It causes performance drop. Therefore, we add check of NSZ flag. When NSZ is specified, we still follow previous handling.
- Loading branch information