Skip to content

Conversation

kunalspathak
Copy link
Contributor

#110173 is failing for more than 6 months now in multiple pipelines. Disable some tests to make CI green. We will continue to investigate #110173 and will re-enable them once fix is in.

cc: @janvorli

@Copilot Copilot AI review requested due to automatic review settings June 11, 2025 14:43
@github-actions github-actions bot added the needs-area-label An area label is needed to ensure this gets routed to the appropriate area owners label Jun 11, 2025
@kunalspathak kunalspathak requested review from janvorli and removed request for Copilot June 11, 2025 14:44
Copy link
Contributor

@Copilot Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull Request Overview

This PR temporarily disables additional StackOverflow tests on specific architectures and platforms to stabilize CI while issue #110173 is investigated.

  • Expanded the skip conditions in TestStackOverflowLargeFrameSecondaryThread to include X64 and Arm on Unix/macOS.
  • Extended the skip logic in TestStackOverflow3 to cover Unix ARM64.
  • Added comments referencing the disabling reason and related issue.
Comments suppressed due to low confidence (1)

src/tests/baseservices/exceptions/stackoverflow/stackoverflowtester.cs:197

  • [nitpick] Clarify this comment to explicitly state that both Unix X64 and Unix Arm are affected, e.g., 'Disabled on Unix X64 and Unix Arm due to…', to avoid ambiguity.
// Disabled on Unix X64/Arm due to https://github.com/dotnet/runtime/issues/110173 which needs investigation.

Comment on lines +188 to 190
if (((RuntimeInformation.ProcessArchitecture == Architecture.Arm64) || (RuntimeInformation.ProcessArchitecture == Architecture.X64) || (RuntimeInformation.ProcessArchitecture == Architecture.RiscV64) ||
(RuntimeInformation.ProcessArchitecture == Architecture.LoongArch64) || (RuntimeInformation.ProcessArchitecture == Architecture.Arm)) &&
((Environment.OSVersion.Platform == PlatformID.Unix) || (Environment.OSVersion.Platform == PlatformID.MacOSX)))
Copy link
Preview

Copilot AI Jun 11, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

[nitpick] Consider extracting the complex architecture and platform check into a well‐named helper method or boolean variable to improve readability and maintainability.

Suggested change
if (((RuntimeInformation.ProcessArchitecture == Architecture.Arm64) || (RuntimeInformation.ProcessArchitecture == Architecture.X64) || (RuntimeInformation.ProcessArchitecture == Architecture.RiscV64) ||
(RuntimeInformation.ProcessArchitecture == Architecture.LoongArch64) || (RuntimeInformation.ProcessArchitecture == Architecture.Arm)) &&
((Environment.OSVersion.Platform == PlatformID.Unix) || (Environment.OSVersion.Platform == PlatformID.MacOSX)))
if (IsUnixPlatformWithSpecificArchitecture())

Copilot uses AI. Check for mistakes.

Comment on lines +227 to 233
if ((RuntimeInformation.ProcessArchitecture == Architecture.Arm) || ((RuntimeInformation.ProcessArchitecture == Architecture.Arm64) && (Environment.OSVersion.Platform == PlatformID.Unix)))
{
// Disabled on ARM due to https://github.com/dotnet/runtime/issues/107184
// Disabled on Unix ARM64 due to https://github.com/dotnet/runtime/issues/110173 which needs investigation.
return;
}

Copy link
Preview

Copilot AI Jun 11, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

[nitpick] The condition in TestStackOverflow3 is quite dense; consider moving this logic into a named variable or helper for clarity and easier future updates.

Suggested change
if ((RuntimeInformation.ProcessArchitecture == Architecture.Arm) || ((RuntimeInformation.ProcessArchitecture == Architecture.Arm64) && (Environment.OSVersion.Platform == PlatformID.Unix)))
{
// Disabled on ARM due to https://github.com/dotnet/runtime/issues/107184
// Disabled on Unix ARM64 due to https://github.com/dotnet/runtime/issues/110173 which needs investigation.
return;
}
if (ShouldDisableTest())
{
return;
}
private static bool ShouldDisableTest()
{
// Disabled on ARM due to https://github.com/dotnet/runtime/issues/107184
// Disabled on Unix ARM64 due to https://github.com/dotnet/runtime/issues/110173 which needs investigation.
return (RuntimeInformation.ProcessArchitecture == Architecture.Arm) ||
((RuntimeInformation.ProcessArchitecture == Architecture.Arm64) && (Environment.OSVersion.Platform == PlatformID.Unix));
}

Copilot uses AI. Check for mistakes.

@janvorli
Copy link
Member

I agree it is a pain to have a test failing few times a week. But it seems that if we disable the test, there is no chance to figure out what's wrong, as I was unable to reproduce this problem locally, trying by running the test in a loop for weeks. Or even in CI by re-running relevant CI legs over and over in a targeted PR. The frequency of the CI test runs is the only thing that makes the problem fire from time to time. It seems there is some extremely rare combination of an async event and the stack overflow that triggers the behavior.

But maybe disabling it now and re-enabling it temporarily in the CI every time I figure out some extra logging / checks to try to figure out more details would work.

@kunalspathak
Copy link
Contributor Author

But maybe disabling it now and re-enabling it temporarily in the CI every time I figure out some extra logging / checks to try to figure out more details would work.

I agree, that way the CI is clean and you can trigger CI from your branch with extra logging.

@janvorli
Copy link
Member

Triggering it from my branch would not help, as it is unlikely it would ever repro that way. I was rather thinking to enable it globally for a while when I have some more ideas on added logging.

@kunalspathak
Copy link
Contributor Author

/ba-g failure seems to be #116520

@kunalspathak kunalspathak merged commit e0eea1f into dotnet:main Jun 11, 2025
72 of 74 checks passed
@kunalspathak kunalspathak deleted the disable-stackoverflow branch June 11, 2025 16:42
@AndyAyersMS
Copy link
Member

Can we set up a special CI leg that runs this test over and over, and schedule that to run every day?

@github-actions github-actions bot locked and limited conversation to collaborators Jul 12, 2025
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
needs-area-label An area label is needed to ensure this gets routed to the appropriate area owners
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants