KEMBAR78
gh-46236: PyUnicode docs improvements by encukou · Pull Request #129966 · python/cpython · GitHub
Skip to content

Conversation

@encukou
Copy link
Member

@encukou encukou commented Feb 10, 2025

While planning to deprecate the PyASCIIObject structs, I found some docs improvements. IMO, these should be applied (and backported) regardless of what happens to PyASCIIObject.

Move deprecated PyUnicode API docs to new section

I intend to add more here. IMO, it's good practice to separate deprecated API out like this.

  • Move Py_UNICODE here.
  • Formally soft-deprecate PyUnicode_READY, and move it here.
  • Document and soft-deprecate PyUnicode_IS_READY, and move it here.

Document PyUnicode_IS_ASCII, PyUnicode_CHECK_INTERNED

PyUnicode_New: Clarify requirements for "fresh" strings

Also, refer to PyUnicode_New wrom all the "writers" for which you need to follow the requirements

PyUnicodeWriter_DecodeUTF8Stateful: Link "error-handlers"


📚 Documentation preview 📚: https://cpython-previews--129966.org.readthedocs.build/

Move Py_UNICODE to a new "Deprecated API" section.

Formally soft-deprecate PyUnicode_READY, and move it

Document and soft-deprecate PyUnicode_IS_READY, and move it
@encukou
Copy link
Member Author

encukou commented Feb 26, 2025

@vstinner @serhiy-storchaka Do these changes look OK to you?

.. c:function:: unsigned int PyUnicode_IS_ASCII(PyObject *unicode)
Return true if the string only contains ASCII characters.
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
Return true if the string only contains ASCII characters.
Return non-zero if the string only contains ASCII characters.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

"Return true" is common for such functions (see for example PyUnicode_Check()).

@serhiy-storchaka serhiy-storchaka self-requested a review February 27, 2025 08:32
Copy link
Member

@serhiy-storchaka serhiy-storchaka left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I have a couple of suggestion, but in general LGTM. 👍

.. c:function:: unsigned int PyUnicode_IS_ASCII(PyObject *unicode)
Return true if the string only contains ASCII characters.
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

"Return true" is common for such functions (see for example PyUnicode_Check()).

.. c:function:: unsigned int PyUnicode_CHECK_INTERNED(PyObject *str)
Return a non-zero value if *str* is interned, zero if not.
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Most documentation uses "Return true" (60 occurrences), some use "Return non-zero" (13 occurrences) and one uses "Return a non-zero".

In this case using "Return a non-zero" looks justified, as it may encode additional information.

Co-authored-by: Serhiy Storchaka <storchaka@gmail.com>
Copy link
Member

@vstinner vstinner left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@encukou
Copy link
Member Author

encukou commented Feb 28, 2025

Thank you for the reviews!

@encukou encukou merged commit e21863c into python:main Feb 28, 2025
42 checks passed
@github-project-automation github-project-automation bot moved this from Todo to Done in Docs PRs Feb 28, 2025
@encukou encukou deleted the pyunicode-docs branch February 28, 2025 14:11
seehwan pushed a commit to seehwan/cpython that referenced this pull request Apr 16, 2025
Move deprecated PyUnicode API docs to new section

Move Py_UNICODE to a new "Deprecated API" section.

Formally soft-deprecate PyUnicode_READY, and move it

Document and soft-deprecate PyUnicode_IS_READY, and move it

Document PyUnicode_IS_ASCII, PyUnicode_CHECK_INTERNED

PyUnicode_New docs: Clarify requirements for "fresh" strings

PyUnicodeWriter_DecodeUTF8Stateful: Link "error-handlers"



Co-authored-by: Serhiy Storchaka <storchaka@gmail.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

docs Documentation in the Doc dir skip news

Projects

Status: Done

Development

Successfully merging this pull request may close these issues.

3 participants