Put generated `InvalidClientHelloDataTruncatedBytes` data into one test #62480

greenEkatherine · 2021-12-07T12:38:07Z

There are 3980 generated test cases in SniHelper_TruncatedData_Fails scenario, it breaks XUnit limits for subresults and therefore doesn't report them properly.
The same change was made in YARP's test dotnet/yarp#1432

… SniHelper_TruncatedData_Fails

ghost · 2021-12-07T12:38:11Z

Tagging subscribers to this area: @dotnet/ncl, @vcsjones
See info in area-owners.md if you want to be subscribed.

Issue Details

There are 3980 generated test cases in SniHelper_TruncatedData_Fails scenario, it breaks XUnit limits for subresults and therefore doesn't report them properly.
The same change was made in YARP's test dotnet/yarp#1432

Author:	greenEkatherine
Assignees:	-
Labels:	`area-System.Net.Security`
Milestone:	-

ManickaP · 2021-12-07T12:57:01Z

src/libraries/System.Net.Security/tests/FunctionalTests/TlsFrameHelperTests.cs

+            // moving inside one test because there are more than 3000 cases and they overflow subresults
+            foreach ((int id, byte[] clientHello) in InvalidClientHelloDataTruncatedBytes())
+            {
+                InvalidClientHello(clientHello, id, shouldPass: false);


So now if one of the test fails we won't know which one and also the rest of them won't get executed.
If we're fine with this, no problem. I'm just making sure all the side-effects are understood and agreed on.

Thank you for pointing this out. I may add additional info to identify failed test, if it helps - we still have an id.

stephentoub · 2021-12-07T13:44:43Z

Is this actually coming from xunit itself? Do we see this locally or only in CI? Seems like it's actually coming from AzDo, misspellings and all. Is it causing problems? I'd be inclined to keep it as is and get the local benefits of better debugability, unless there's a real issue.

stephentoub · 2021-12-07T13:51:25Z

That said, 3980 test cases seems like a lot for this. Are they all valuable? Sometimes theories make it a little too easy to test the same conditions over and over :-)

greenEkatherine · 2021-12-07T14:02:54Z

Is this actually coming from xunit itself? Do we see this locally or only in CI? Seems like it's actually coming from AzDo, misspellings and all. Is it causing problems? I'd be inclined to keep it as is and get the local benefits of better debugability, unless there's a real issue.

It is only in CI. The possible problem - ignoring test results after the first 1000 cases.

greenEkatherine · 2021-12-07T14:06:36Z

That said, 3980 test cases seems like a lot for this. Are they all valuable? Sometimes theories make it a little too easy to test the same conditions over and over :-)

That was my question to @wfurt 😃 There are no duplicates, so my suggestion was to stop generating data after we reach broken bytes. It should reduce number of cases.

stephentoub · 2021-12-07T14:10:16Z

The possible problem - ignoring test results after the first 1000 cases.

Does it:

not run them in the first place
ignore only successes
ignore failures as well

?

We have many large theories throughout runtime. I'd be surprised and worried if this was actually ignoring failures in CI.

greenEkatherine · 2021-12-07T14:36:24Z

The possible problem - ignoring test results after the first 1000 cases.

Does it:

not run them in the first place

ignore only successes

ignore failures as well

?

We have many large theories throughout runtime. I'd be surprised and worried if this was actually ignoring failures in CI.

I double checked, failures should be counted in final result, the issue is only in reporting for separate cases https://developercommunity.visualstudio.com/t/VSTest-test-publication-miscounts-test-c/909375#T-ND914320

stephentoub · 2021-12-07T14:43:36Z

Thanks. That thread makes it sound like there isn't actually a problem here, and it's just grouping all cases under the parent?

greenEkatherine · 2021-12-07T16:59:30Z

Thanks. That thread makes it sound like there isn't actually a problem here, and it's just grouping all cases under the parent?

I think no, it's related to the similar issues where not all test results are published. Parent should show if any test fails, without it I cannot find a confirmation that CI will not ignore failures if they happen after the limit.

danmoseley · 2021-12-07T17:17:53Z

src/libraries/System.Net.Security/tests/FunctionalTests/TlsFrameHelperTests.cs

        }

-        public static IEnumerable<object[]> InvalidClientHelloDataTruncatedBytes()
+        public static IEnumerable<Tuple<int, byte[]>> InvalidClientHelloDataTruncatedBytes()


Nit, as a style thing, we seem (I think?) to most often use the (int, byte[]) syntax (possibly with names) not the old Tuple<X, Y> syntax.

greenEkatherine · 2021-12-09T14:26:33Z

I found out that it is a publishing issue only, failing test must be indicated correctly in CI. I don't even see warnings in runtime pipeline as it was in YARP pipeline, it's because publishing steps are different. Taking this and feedback into account I would rather decline this change.

stephentoub · 2021-12-09T15:21:25Z

Taking this and feedback into account I would rather decline this change.

Sounds good. Thanks.

wfurt · 2022-02-15T02:13:09Z

I'mm sorry for commenting on old and closed issue - but I completely missed it for some reason until @rzikm brought it to my attention.

This was was added by dotnet/corefx#28278 very long time and it feels like simple form of fuzzing e.g. feeding the parser with variances of invalid input and checking that the parsing still fails. It is unlikely IMHO we will ever break it and if so int should be easy to figure out details from local test run.

There are two reasons why I wanted to fix this (since we already made same change for YARP)

when running tests locally, this kills the scroll buffer so often it is not possible to see other test failures directly.
It creates thousands of extra entries for each build on each platform. That creates unnecessary burden on CPU and storage. Not huge but we seems to be on constant battle for resources.

I'm wondering if we could reconsider this @stephentoub and take it as small improvement.

stephentoub · 2022-02-15T02:15:27Z

I'm wondering if we could reconsider this @stephentoub and take it as small improvement.

If we don't believe the tests are valuable, it's fine to get rid of them or simplify them. My pushback was on removing them or changing the structure of them purely to work around the cited limitation, which didn't seem to be correct.

wfurt · 2022-02-15T02:39:17Z

I think there is some value of feeding in invalid data since SslStream operates on untrusted inputs. But I'm not sure what would be good simplification. I added another invalid data test in #63184 But it seems difficult to come up with somewhat more complete set. For that reason I thought moving this chunk into single test may be useful. We can try to come up with better ways how to fuzz or generate invalid or incomplete data.

Put generated InvalidClientHelloDataTruncatedBytes data into one test…

77229a5

… SniHelper_TruncatedData_Fails

greenEkatherine added the area-System.Net.Security label Dec 7, 2021

greenEkatherine requested review from wfurt and stephentoub December 7, 2021 12:38

ManickaP reviewed Dec 7, 2021

View reviewed changes

danmoseley reviewed Dec 7, 2021

View reviewed changes

greenEkatherine closed this Dec 9, 2021

ghost locked as resolved and limited conversation to collaborators Jan 8, 2022

karelz added this to the 7.0.0 milestone Apr 8, 2022

Put generated InvalidClientHelloDataTruncatedBytes data into one test #62480

Put generated InvalidClientHelloDataTruncatedBytes data into one test #62480

Uh oh!

Conversation

greenEkatherine commented Dec 7, 2021

Uh oh!

ghost commented Dec 7, 2021

Uh oh!

ManickaP Dec 7, 2021

Choose a reason for hiding this comment

Uh oh!

greenEkatherine Dec 7, 2021

Choose a reason for hiding this comment

Uh oh!

stephentoub commented Dec 7, 2021

Uh oh!

stephentoub commented Dec 7, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

greenEkatherine commented Dec 7, 2021

Uh oh!

greenEkatherine commented Dec 7, 2021

Uh oh!

stephentoub commented Dec 7, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

greenEkatherine commented Dec 7, 2021

Uh oh!

stephentoub commented Dec 7, 2021

Uh oh!

greenEkatherine commented Dec 7, 2021

Uh oh!

danmoseley Dec 7, 2021

Choose a reason for hiding this comment

Uh oh!

greenEkatherine commented Dec 9, 2021

Uh oh!

stephentoub commented Dec 9, 2021

Uh oh!

wfurt commented Feb 15, 2022

Uh oh!

stephentoub commented Feb 15, 2022

Uh oh!

wfurt commented Feb 15, 2022

Uh oh!

Uh oh!

Put generated `InvalidClientHelloDataTruncatedBytes` data into one test #62480

Put generated `InvalidClientHelloDataTruncatedBytes` data into one test #62480

stephentoub commented Dec 7, 2021 •

edited

Loading

stephentoub commented Dec 7, 2021 •

edited

Loading