Fix issues of tanh(x) when x=INF or -INF #2786

amdrexu · 2023-10-26T09:19:59Z

We didn't check this special case. We just computed tanh(x) by sinh(x)/cosh(x). But when x=INF or -INF, the limit of tanh(x) is defined as follow:

lim(tanh(x)) = 1.0, x -> INF; lim(tanh(x)) = -1.0, x -> -INF

amdvlk-admin · 2023-10-26T10:00:00Z

Test summary for commit `50d32aa`

CTS tests (Failed: 0/137949)

Built with version 1.3.5.2

Ubuntu navi3x, Srdcvk

Passed: 35353/68947 (51.3%)
Failed: 0/68947 (0.0%)
Not Supported: 33594/68947 (48.7%)
Warnings: 0/68947 (0.0%)

Ubuntu navi2x, Srdcvk

Passed: 35424/69002 (51.3%)
Failed: 0/69002 (0.0%)
Not Supported: 33578/69002 (48.7%)
Warnings: 0/69002 (0.0%)

nhaehnle

I feel like I've had a discussion with @kezhaoAMD about this recently. But I think it was in an llpcfe context. Did you discuss with him? He had a solution using copysign which might be a little better.

xazhangAMD · 2023-10-26T13:35:51Z

I feel like I've had a discussion with @kezhaoAMD about this recently. But I think it was in an llpcfe context. Did you discuss with him? He had a solution using copysign which might be a little better.

Yes, llpcfe uses a copysign.

  Value *FPOne = ConstantFP::get(call->getType(), 1);
  Value *isInf = m_builder->CreateIsInf(val);
  Value *infResult = m_builder->CreateBinaryIntrinsic(Intrinsic::copysign, FPOne, val);
  return m_builder->CreateSelect(isInf, infResult, tanh);

amdrexu · 2023-10-26T15:31:31Z

I feel like I've had a discussion with @kezhaoAMD about this recently. But I think it was in an llpcfe context. Did you discuss with him? He had a solution using copysign which might be a little better.

Yes, llpcfe uses a copysign.
  Value *FPOne = ConstantFP::get(call->getType(), 1);
  Value *isInf = m_builder->CreateIsInf(val);
  Value *infResult = m_builder->CreateBinaryIntrinsic(Intrinsic::copysign, FPOne, val);
  return m_builder->CreateSelect(isInf, infResult, tanh);

Thank you. Yes, this is better and saves a v_cndmask. I will follow it.

xazhangAMD · 2023-10-26T15:41:04Z

I feel like I've had a discussion with @kezhaoAMD about this recently. But I think it was in an llpcfe context. Did you discuss with him? He had a solution using copysign which might be a little better.

Yes, llpcfe uses a copysign.
  Value *FPOne = ConstantFP::get(call->getType(), 1);
  Value *isInf = m_builder->CreateIsInf(val);
  Value *infResult = m_builder->CreateBinaryIntrinsic(Intrinsic::copysign, FPOne, val);
  return m_builder->CreateSelect(isInf, infResult, tanh);
Thank you. Yes, this is better and saves a v_cndmask. I will follow it.

I didn't add it into lgc because I was not sure whether GLSL spec requires this. I can drop the llpcfe approach when the lgc commit is merged.

We didn't check this special case. We just computed tanh(x) by sinh(x)/cosh(x). But when x=INF or -INF, the limit of tanh(x) is defined as follow: lim(tanh(x)) = 1.0, x -> INF; lim(tanh(x)) = -1.0, x -> -INF

amdvlk-admin · 2023-10-26T16:26:18Z

Test summary for commit `db6e682`

CTS tests (Failed: 0/137949)

Built with version 1.3.5.2

Ubuntu navi3x, Srdcvk

Passed: 35353/68947 (51.3%)
Failed: 0/68947 (0.0%)
Not Supported: 33594/68947 (48.7%)
Warnings: 0/68947 (0.0%)

Ubuntu navi2x, Srdcvk

Passed: 35424/69002 (51.3%)
Failed: 0/69002 (0.0%)
Not Supported: 33578/69002 (48.7%)
Warnings: 0/69002 (0.0%)

amdrexu · 2023-10-27T00:46:59Z

I feel like I've had a discussion with @kezhaoAMD about this recently. But I think it was in an llpcfe context. Did you discuss with him? He had a solution using copysign which might be a little better.

Yes, llpcfe uses a copysign.
  Value *FPOne = ConstantFP::get(call->getType(), 1);
  Value *isInf = m_builder->CreateIsInf(val);
  Value *infResult = m_builder->CreateBinaryIntrinsic(Intrinsic::copysign, FPOne, val);
  return m_builder->CreateSelect(isInf, infResult, tanh);
Thank you. Yes, this is better and saves a v_cndmask. I will follow it.
I didn't add it into lgc because I was not sure whether GLSL spec requires this. I can drop the llpcfe approach when the lgc commit is merged.

Thank you. Actually, GLSL spec allows this but we didn't encounter such issues because there is no test.

nhaehnle

Thanks!

amdrexu requested a review from a team as a code owner October 26, 2023 09:19

nhaehnle reviewed Oct 26, 2023

View reviewed changes

Fix issues of tanh(x) when x=INF or -INF

db6e682

We didn't check this special case. We just computed tanh(x) by sinh(x)/cosh(x). But when x=INF or -INF, the limit of tanh(x) is defined as follow: lim(tanh(x)) = 1.0, x -> INF; lim(tanh(x)) = -1.0, x -> -INF

amdrexu force-pushed the bugfix branch from 50d32aa to db6e682 Compare October 26, 2023 16:01

nhaehnle approved these changes Oct 27, 2023

View reviewed changes

amdrexu merged commit 1816e93 into GPUOpen-Drivers:dev Oct 27, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix issues of tanh(x) when x=INF or -INF #2786

Fix issues of tanh(x) when x=INF or -INF #2786

amdrexu commented Oct 26, 2023 •

edited

Loading

amdvlk-admin commented Oct 26, 2023

nhaehnle left a comment

xazhangAMD commented Oct 26, 2023

amdrexu commented Oct 26, 2023

xazhangAMD commented Oct 26, 2023

amdvlk-admin commented Oct 26, 2023

amdrexu commented Oct 27, 2023

nhaehnle left a comment

Fix issues of tanh(x) when x=INF or -INF #2786

Fix issues of tanh(x) when x=INF or -INF #2786

Conversation

amdrexu commented Oct 26, 2023 • edited Loading

amdvlk-admin commented Oct 26, 2023

Test summary for commit 50d32aa

nhaehnle left a comment

Choose a reason for hiding this comment

xazhangAMD commented Oct 26, 2023

amdrexu commented Oct 26, 2023

xazhangAMD commented Oct 26, 2023

amdvlk-admin commented Oct 26, 2023

Test summary for commit db6e682

amdrexu commented Oct 27, 2023

nhaehnle left a comment

Choose a reason for hiding this comment

amdrexu commented Oct 26, 2023 •

edited

Loading

Test summary for commit `50d32aa`

Test summary for commit `db6e682`