-
Notifications
You must be signed in to change notification settings - Fork 482
refactor!: сhange default incognito context
to persistent context
for Playwright
#985
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
Playwright
incognito context
to persistent context
for Playwright
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Since it contains a breaking change, could you please describe the breaking changes in the PR's description and also summarize it in the Upgrading guide?
Co-authored-by: Vlada Dusek <[email protected]>
Co-authored-by: Vlada Dusek <[email protected]>
Co-authored-by: Vlada Dusek <[email protected]>
### Description - fix public imports in `__init__` files - Add `rich` to direct dependencies. It is one of `cookiecutter`'s dependencies, but we use it directly in `statistics._models.py` --------- Co-authored-by: Vlada Dusek <[email protected]>
### Description Add adaptive context helpers and documentation for AdaptivePlaywrightCrawler. ### Issues - Closes: apify#249 --------- Co-authored-by: Jan Buchar <[email protected]> Co-authored-by: Jan Buchar <[email protected]>
### Description - update curl-cffi version requirement to >=0.9.0. - update default `impersonate` from `chrome124` to `chrome131`
- Migrate from `poetry` to `uv`. - Relates: apify#628 - The update of templates to use `uv` will be implemented separately.
- `project.urls` - python 3.13 in ci - unify name "Set up uv package manager" - fix contributing guide - add all extra, remove dev extra (move to dev deps) - relates: apify#628
…pify#959) Add `additional_http_error_status_codes` and `ignore_http_error_status_codes` to PlaywrightCrawler. Since they exist now on all crawlers, move them to `BasicCrawler` level. Do not use `_http_client` attributes for getting additional status codes related variables. **Breaking:** Remove `HttpCrawlerOptions` -> No unique options compared to `BasicCrawlerOptions` anymore. - Closes: apify#953
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
There are failing tests for Windows Python 3.12.
6b6236d
to
5e97e31
Compare
The problem is not related to this PR. It's solved in the PR #1007 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks for the tests, just two minor comments.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks good, just a few comments.
Co-authored-by: Vlada Dusek <[email protected]>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
Description
PlaywrightCrawler
from using the standard browser context to using a persistent browser context.user_data_dir
with the path to the directory for the context. Ifuser_data_dir
is not provided, a temporary directory will be created.Issues