KEMBAR78
[3.13] gh-88375, gh-111788: Fix parsing errors and normalization in robotparser (GH-138502) by miss-islington · Pull Request #138549 · python/cpython · GitHub
Skip to content

Conversation

@miss-islington
Copy link
Contributor

@miss-islington miss-islington commented Sep 5, 2025

  • Don't fail trying to parse weird patterns.
  • Don't fail trying to decode non-UTF-8 "robots.txt" files.
  • No longer ignore trailing "?" in patterns and URLs.
  • Distinguish raw special characters "?", "=" and "&" from the
    percent-encoded ones.
  • Remove tests that do nothing.
    (cherry picked from commit cb7ef18)

Co-authored-by: Serhiy Storchaka storchaka@gmail.com

… in robotparser (pythonGH-138502)

* Don't fail trying to parse weird patterns.
* Don't fail trying to decode non-UTF-8 "robots.txt" files.
* No longer ignore trailing "?" in patterns and URLs.
* Distinguish raw special characters "?", "=" and "&" from the
  percent-encoded ones.
* Remove tests that do nothing.
(cherry picked from commit cb7ef18)

Co-authored-by: Serhiy Storchaka <storchaka@gmail.com>
@serhiy-storchaka serhiy-storchaka merged commit 509ea39 into python:3.13 Sep 5, 2025
41 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants