test(source): remove flaky log assertions from pod tests #5517

u-kai · 2025-06-11T01:03:40Z

What does it do ?

Remove flaky log assertions from TestPodSource to fix race conditions that caused intermittent test failures with "Expected no debug messages" errors.

Example of the race condition failure: : link

Motivation

The TestPodSource test was experiencing flaky failures due to race conditions when running in parallel.
The issue occurred because:

Global logger state conflicts: Multiple tests with t.Parallel() were simultaneously
modifying the global logrus logger through LogsUnderTestWithLogLevel()
Implementation detail testing: Testing log output (an implementation detail) rather than
focusing on the core functionality

Solution: Remove log testing entirely and focus on endpoint generation correctness, which is
the actual functionality being tested.

More

Yes, this PR title follows Conventional Commits
Yes, I added unit tests
Yes, I updated end user documentation accordingly

k8s-ci-robot · 2025-06-11T01:03:49Z

Hi @u-kai. Thanks for your PR.

I'm waiting for a kubernetes-sigs member to verify that this patch is reasonable to test. If it is, they should reply with /ok-to-test on its own line. Until that is done, I will not automatically test new commits in this PR, but the usual testing commands by org members will still work. Regular contributors should join the org to skip this step.

Once the patch is verified, the new status will be reflected by the ok-to-test label.

I understand the commands that are listed here.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

mloiseleur · 2025-06-11T06:16:08Z

/ok-to-test

source/pod_test.go

ivankatliarchuk · 2025-06-11T07:16:14Z

If you removing this capability from tests, we need a compensation. Testing similar capabilities but without paralel execution

u-kai · 2025-06-11T08:22:11Z

@ivankatliarchuk
Thank you for review!

I personally don't think the log assertions in this test are essential, especially since it's just Debug logging.
I prioritized enabling t.Parallel() to speed up the test suite, and I believe removing the log assertions makes the test more robust since log output tends to change frequently.
Could you clarify why you think it's important to keep testing the logs in this case?

ivankatliarchuk · 2025-06-11T08:24:26Z

/hold

ivankatliarchuk · 2025-06-11T08:24:55Z

compensation for removed testing capability is required.

ivankatliarchuk · 2025-06-11T08:26:27Z

as well as static analysis for correct use of test.Parallel() is needed

u-kai · 2025-06-11T10:14:55Z

@ivankatliarchuk

compensation for removed testing capability is required.

The only part that is no longer being tested after my change is the log output.
As I understand it, your suggestion for a compensation would mean explicitly verifying that the expected log messages are produced — is that correct?
To my knowledge, very few of the existing tests currently verify that the log output is correct, so I just want to make sure I am interpreting your request properly.
Could you please confirm whether this understanding is correct?

ivankatliarchuk · 2025-06-11T11:28:40Z

Yes, there is no need to validate same logs for each test, so only few tests have logging valided. We could have separate test that just verify log messages. Agree that it does single log level testing, not efficient.

u-kai · 2025-06-11T13:47:55Z

@ivankatliarchuk
Thanks for the helpful advice! I've added a separate test for log verification as you suggested.
It would be great if you could take another look when you have a chance.

ivankatliarchuk

/lgtm

ivankatliarchuk · 2025-06-11T13:49:49Z

source/pod_test.go

+}
+
+func TestPodSourceLogs(t *testing.T) {
+	t.Parallel()


do we need Parallel for that? Just a comment, not necessary need to action

I think it's needed here, because as discussed below, using t.Parallel at this level is safe and allows us to parallelize the tests.

ivankatliarchuk · 2025-06-11T13:53:03Z

/unhold

source/pod_test.go

mloiseleur

See my suggestion: t.Parallel() should be inside the for loop.
apart from that, LGTM. Thanks for your help on this 👍

u-kai · 2025-06-12T12:41:59Z

@mloiseleur
Thanks for your review.
I applied your suggestion and ran the tests multiple times. However, I noticed that the test sometimes fails like this:

--- FAIL: TestPodSourceLogs (0.00s)
    --- FAIL: TestPodSourceLogs/when_ignoreNonHostNetworkPods=false,_no_skip_logs_should_be_generated (0.10s)
        pod_test.go:895:
                Error Trace:    /Users/kai/external-dns/internal/testutils/log.go:91
                Error:          Should be false
                Test:           TestPodSourceLogs/when_ignoreNonHostNetworkPods=false,_no_skip_logs_should_be_generated
                Messages:       Expected log message found when should not: skipping pod my-pod1-65699. hostNetwork=false
FAIL

This happens because the parallel tests share state and have conflicting expectations about whether certain log messages should or should not appear.
While this can be avoided by making sure test data (like variable names) don’t overlap between tests, this requires extra care and could easily introduce mistakes in the future.

For this reason, I personally prefer keeping the current approach without using t.Parallel() here — what do you think?

mloiseleur · 2025-06-12T12:47:30Z

For this reason, I personally prefer keeping the current approach without using t.Parallel() here — what do you think?

Make sense to me. We want the CI to be as reliable as we can 👍

u-kai · 2025-06-12T12:54:07Z

Make sense to me. We want the CI to be as reliable as we can 👍

Got it, I'll leave it as is then.

mloiseleur · 2025-06-13T06:16:44Z

/approve

ivankatliarchuk · 2025-06-13T07:01:52Z

/test pull-external-dns-unit-test

ivankatliarchuk · 2025-06-13T07:03:54Z

/test all

ivankatliarchuk

/lgtm

k8s-ci-robot · 2025-06-13T07:08:27Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: ivankatliarchuk, mloiseleur

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~OWNERS~~ [ivankatliarchuk,mloiseleur]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

test(source): remove flaky log assertions from pod tests

2377aae

k8s-ci-robot requested a review from ivankatliarchuk June 11, 2025 01:03

k8s-ci-robot added the source label Jun 11, 2025

k8s-ci-robot requested a review from szuecs June 11, 2025 01:03

k8s-ci-robot added cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. needs-ok-to-test Indicates a PR that requires an org member to verify it is safe to test. labels Jun 11, 2025

k8s-ci-robot added the size/S Denotes a PR that changes 10-29 lines, ignoring generated files. label Jun 11, 2025

k8s-ci-robot added ok-to-test Indicates a non-member PR verified by an org member that is safe to test. and removed needs-ok-to-test Indicates a PR that requires an org member to verify it is safe to test. labels Jun 11, 2025

ivankatliarchuk suggested changes Jun 11, 2025

View reviewed changes

source/pod_test.go Show resolved Hide resolved

k8s-ci-robot assigned ivankatliarchuk Jun 11, 2025

k8s-ci-robot added the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label Jun 11, 2025

test(source): add comprehensive log testing for pod source

ab24862

k8s-ci-robot added size/L Denotes a PR that changes 100-499 lines, ignoring generated files. and removed size/S Denotes a PR that changes 10-29 lines, ignoring generated files. labels Jun 11, 2025

ivankatliarchuk reviewed Jun 11, 2025

View reviewed changes

k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Jun 11, 2025

k8s-ci-robot removed the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label Jun 11, 2025

mloiseleur reviewed Jun 12, 2025

View reviewed changes

source/pod_test.go Show resolved Hide resolved

mloiseleur reviewed Jun 12, 2025

View reviewed changes

source/pod_test.go Show resolved Hide resolved

mloiseleur reviewed Jun 12, 2025

View reviewed changes

k8s-ci-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Jun 13, 2025

ivankatliarchuk approved these changes Jun 13, 2025

View reviewed changes

k8s-ci-robot merged commit f5a3667 into kubernetes-sigs:master Jun 13, 2025
15 checks passed

test(source): remove flaky log assertions from pod tests #5517

test(source): remove flaky log assertions from pod tests #5517

Uh oh!

Conversation

u-kai commented Jun 11, 2025

What does it do ?

Motivation

More

Uh oh!

k8s-ci-robot commented Jun 11, 2025

Uh oh!

mloiseleur commented Jun 11, 2025

Uh oh!

Uh oh!

ivankatliarchuk commented Jun 11, 2025

Uh oh!

u-kai commented Jun 11, 2025

Uh oh!

ivankatliarchuk commented Jun 11, 2025

Uh oh!

ivankatliarchuk commented Jun 11, 2025

Uh oh!

ivankatliarchuk commented Jun 11, 2025

Uh oh!

u-kai commented Jun 11, 2025

Uh oh!

ivankatliarchuk commented Jun 11, 2025

Uh oh!

u-kai commented Jun 11, 2025

Uh oh!

ivankatliarchuk left a comment

Choose a reason for hiding this comment

Uh oh!

ivankatliarchuk Jun 11, 2025

Choose a reason for hiding this comment

Uh oh!

u-kai Jun 12, 2025

Choose a reason for hiding this comment

Uh oh!

ivankatliarchuk commented Jun 11, 2025

Uh oh!

Uh oh!

Uh oh!

mloiseleur left a comment

Choose a reason for hiding this comment

Uh oh!

u-kai commented Jun 12, 2025

Uh oh!

mloiseleur commented Jun 12, 2025

Uh oh!

u-kai commented Jun 12, 2025

Uh oh!

mloiseleur commented Jun 13, 2025

Uh oh!

ivankatliarchuk commented Jun 13, 2025

Uh oh!

ivankatliarchuk commented Jun 13, 2025

Uh oh!

ivankatliarchuk left a comment

Choose a reason for hiding this comment

Uh oh!

k8s-ci-robot commented Jun 13, 2025

Uh oh!

Uh oh!

Uh oh!