Relax tolerance for test_out_addbmm_cpu_float32 #86365

Flamefire · 2022-10-06T14:17:28Z

The test may fail due to slightly different values caused by different order of matrizes in SGEMM:

Mismatched elements: 1 / 50 (2.0%)
Greatest absolute difference: 1.430511474609375e-05 at index (4, 5) (up to 1e-05 allowed)
Greatest relative difference: 4.65393206065873e-06 at index (4, 5) (up to 1.3e-06 allowed)

Observed on POWER (ppc64le)

pytorch-bot · 2022-10-06T14:17:32Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/86365

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit c4e94d5:
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

linux-foundation-easycla · 2022-10-06T14:17:33Z

The committers listed above are authorized under a signed CLA.

✅ login: Flamefire / name: Alexander Grund (c4e94d5)

mruberry

Approving to unblock testing -- can we guard this decorator using active_if to only apply when running on POWER?

Flamefire · 2022-10-07T08:12:11Z

can we guard this decorator using active_if to only apply when running on POWER?

I don't think this is worth it as POWER may only be one platform where this fails and using other compilers, BLAS libs, etc may make it fail on others too.
Also the tolerance is not unreasonable:

(Existing) 'TestCommon.test_numpy_refs': tol(atol=1.3e-05, rtol=1.3e-05)
(Existing) 'TestConsistency.test_output_match': tol(atol=1e-5, rtol=1e-5)
(New) 'TestCommon.test_out':' tol(atol=1.5e-05, rtol=1e-05)

So the new tolerance is very close to that of the numpy reference test with the relative being even lower. So one could argue to make it tol(atol=1.5e-05, rtol=1.3e-05) for all 3 tests for consistency

The test may fail due to slightly different values caused by different order of matrizes in SGEMM: > Mismatched elements: 1 / 50 (2.0%) > Greatest absolute difference: 1.430511474609375e-05 at index (4, 5) (up to 1e-05 allowed) > Greatest relative difference: 4.65393206065873e-06 at index (4, 5) (up to 1.3e-06 allowed)

Flamefire · 2022-11-22T09:24:20Z

Rebased and new CLA signed

kit1980 · 2022-11-22T20:25:29Z

@pytorchbot merge

pytorchmergebot · 2022-11-22T20:27:22Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

The test may fail due to slightly different values caused by different order of matrizes in SGEMM: > Mismatched elements: 1 / 50 (2.0%) > Greatest absolute difference: 1.430511474609375e-05 at index (4, 5) (up to 1e-05 allowed) > Greatest relative difference: 4.65393206065873e-06 at index (4, 5) (up to 1.3e-06 allowed) Observed on POWER (ppc64le) Pull Request resolved: pytorch#86365 Approved by: https://github.com/mruberry, https://github.com/kit1980

Flamefire requested review from mruberry and ngimel as code owners October 6, 2022 14:17

facebook-github-bot added the cla signed label Oct 6, 2022

pytorchbot added the open source label Oct 6, 2022

mruberry approved these changes Oct 6, 2022

View reviewed changes

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Oct 6, 2022

Flamefire force-pushed the fix-baddbmm-prec branch from f84e0f2 to c4e94d5 Compare November 22, 2022 09:24

kit1980 approved these changes Nov 22, 2022

View reviewed changes

pytorchmergebot added the Merged label Nov 22, 2022

pytorchmergebot closed this in ac30047 Nov 22, 2022

Flamefire deleted the fix-baddbmm-prec branch November 23, 2022 10:33

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Relax tolerance for test_out_addbmm_cpu_float32 #86365

Relax tolerance for test_out_addbmm_cpu_float32 #86365

Uh oh!

Flamefire commented Oct 6, 2022

Uh oh!

pytorch-bot bot commented Oct 6, 2022 •

edited

Loading

Uh oh!

linux-foundation-easycla bot commented Oct 6, 2022 •

edited

Loading

Uh oh!

mruberry left a comment

Uh oh!

Flamefire commented Oct 7, 2022

Uh oh!

Flamefire commented Nov 22, 2022

Uh oh!

kit1980 commented Nov 22, 2022

Uh oh!

pytorchmergebot commented Nov 22, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Relax tolerance for test_out_addbmm_cpu_float32 #86365

Relax tolerance for test_out_addbmm_cpu_float32 #86365

Uh oh!

Conversation

Flamefire commented Oct 6, 2022

Uh oh!

pytorch-bot bot commented Oct 6, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/86365

✅ No Failures

Uh oh!

linux-foundation-easycla bot commented Oct 6, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mruberry left a comment

Choose a reason for hiding this comment

Uh oh!

Flamefire commented Oct 7, 2022

Uh oh!

Flamefire commented Nov 22, 2022

Uh oh!

kit1980 commented Nov 22, 2022

Uh oh!

pytorchmergebot commented Nov 22, 2022

Merge started

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

pytorch-bot bot commented Oct 6, 2022 •

edited

Loading

linux-foundation-easycla bot commented Oct 6, 2022 •

edited

Loading