fix: Fix retry count to not count the original request #1328

Pijukatel · 2025-07-29T06:46:32Z

Description

The argument max_request_retries of BasicCrawler previously included also the initial request in retries. Now it counts only the retries.

Issues

Closes: max_request_retries should not include the original request #1326

vdusek · 2025-07-29T07:23:35Z

tests/unit/crawlers/_basic/test_basic_crawler.py

        'https://c.placeholder.com',
        'https://b.placeholder.com',
        'https://b.placeholder.com',
+        'https://b.placeholder.com',


Is this intenational?

Yes, default retries are 3. So now with this PR it will make 4 calls (original request + 3 retries)

vdusek · 2025-07-29T07:23:52Z

tests/unit/crawlers/_basic/test_basic_crawler.py

-        'https://c.placeholder.com',
-        'https://c.placeholder.com',


Is this intenational?

Yes, 1 retry means 1 call + 1 retry.

vdusek · 2025-07-29T07:24:52Z

tests/unit/crawlers/_basic/test_basic_crawler.py

-        # Retrieve or initialize the headers, and extract the current custom retry count.
-        headers = context.request.headers or HttpHeaders()
-        custom_retry_count = int(headers.get('custom_retry_count', '0'))
-
        # Append the current call information.
-        calls.append(Call(context.request.url, error, custom_retry_count))
-
-        # Update the request to include an incremented custom retry count in the headers and return it.
-        request = context.request.model_dump()
-        request['headers'] = HttpHeaders({'custom_retry_count': str(custom_retry_count + 1)})
-        return Request.model_validate(request)


Is this just some optimization?

Yes, I think this was somewhat complicated for the test's intention, and the same could be achieved with a simpler setup.

janbuchar · 2025-07-29T08:35:38Z

Just curious, is there a test that checks this in the JS version?

vdusek

lgtm

Pijukatel · 2025-07-29T12:41:58Z

Just curious, is there a test that checks this in the JS version?

I think this should cover it
https://github.com/apify/crawlee/blob/master/test/core/crawlers/basic_crawler.test.ts#L414

The argument `max_request_retries` of `BasicCrawler` previously included also the initial request in retries. Now it counts only the retries. - Closes: #1326

Pijukatel added 2 commits July 29, 2025 08:39

Fix retry count to not include the initial attempt

31ef29a

Merge branch 'master' into fix-retry-count

722f0cc

Pijukatel added bug Something isn't working. t-tooling Issues with this label are in the ownership of the tooling team. labels Jul 29, 2025

github-actions bot assigned Pijukatel Jul 29, 2025

github-actions bot added this to the 120th sprint - Tooling team milestone Jul 29, 2025

github-actions bot added the tested Temporary label used only programatically for some analytics. label Jul 29, 2025

Pijukatel requested a review from vdusek July 29, 2025 07:04

Pijukatel marked this pull request as ready for review July 29, 2025 07:04

vdusek reviewed Jul 29, 2025

View reviewed changes

Pijukatel requested a review from vdusek July 29, 2025 08:34

vdusek approved these changes Jul 29, 2025

View reviewed changes

Pijukatel merged commit 74fa1d9 into master Jul 29, 2025
19 checks passed

Pijukatel deleted the fix-retry-count branch July 29, 2025 12:42

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: Fix retry count to not count the original request #1328

fix: Fix retry count to not count the original request #1328

Uh oh!

Pijukatel commented Jul 29, 2025

Uh oh!

vdusek Jul 29, 2025

Uh oh!

Pijukatel Jul 29, 2025

Uh oh!

vdusek Jul 29, 2025

Uh oh!

Pijukatel Jul 29, 2025

Uh oh!

vdusek Jul 29, 2025

Uh oh!

Pijukatel Jul 29, 2025

Uh oh!

janbuchar commented Jul 29, 2025

Uh oh!

vdusek left a comment

Uh oh!

Pijukatel commented Jul 29, 2025

Uh oh!

Uh oh!

Uh oh!

fix: Fix retry count to not count the original request #1328

fix: Fix retry count to not count the original request #1328

Uh oh!

Conversation

Pijukatel commented Jul 29, 2025

Description

Issues

Uh oh!

vdusek Jul 29, 2025

Choose a reason for hiding this comment

Uh oh!

Pijukatel Jul 29, 2025

Choose a reason for hiding this comment

Uh oh!

vdusek Jul 29, 2025

Choose a reason for hiding this comment

Uh oh!

Pijukatel Jul 29, 2025

Choose a reason for hiding this comment

Uh oh!

vdusek Jul 29, 2025

Choose a reason for hiding this comment

Uh oh!

Pijukatel Jul 29, 2025

Choose a reason for hiding this comment

Uh oh!

janbuchar commented Jul 29, 2025

Uh oh!

vdusek left a comment

Choose a reason for hiding this comment

Uh oh!

Pijukatel commented Jul 29, 2025

Uh oh!

Uh oh!

Uh oh!