KEMBAR78
gh-128646: Implement GzipFile.readinto() functions by effigies · Pull Request #128647 · python/cpython · GitHub
Skip to content

Conversation

@effigies
Copy link
Contributor

@effigies effigies commented Jan 8, 2025

This PR adds implementations of .readinto() and .readinto1() for GzipFile which pass the received buffer to the underlying BufferedReader. Without this, the base BufferedIOReader implementation is used, which simply calls .read() and copies into the target buffer.

I don't know a good way to write a test to check memory usage, but see #128646 for a profile that demonstrates the effectiveness of the patch.

@ghost
Copy link

ghost commented Jan 8, 2025

All commit authors signed the Contributor License Agreement.
CLA signed

@effigies effigies force-pushed the fix/gzipfile-readinto branch from 4eea459 to f272a9c Compare January 8, 2025 20:45
@effigies effigies force-pushed the fix/gzipfile-readinto branch from f272a9c to db7f971 Compare January 8, 2025 21:02
@zware
Copy link
Member

zware commented Feb 12, 2025

Disclaimer: not a subject matter expert.

The change looks fine to me. However, I can't shake the feeling that this seems like it ought to be fixable somewhere in io, though I have not actually seen how.

I would like to see some basic tests added for GzipFile.readinto{,1}; they currently seem not to be tested at all.

@effigies
Copy link
Contributor Author

effigies commented Feb 12, 2025

Tests added. I adapted test_read{,1}, but using random data to ensure that the contents could not be read in one call.

I couldn't see how to make either of them a regression test for this issue.

I can't shake the feeling that this seems like it ought to be fixable somewhere in io, though I have not actually seen how.

Here's a possible diff for `_pyio.py`:
diff --git a/Lib/_pyio.py b/Lib/_pyio.py
index 023478aa78c..9197c8628d5 100644
--- a/Lib/_pyio.py
+++ b/Lib/_pyio.py
@@ -721,15 +721,17 @@ def _readinto(self, b, read1):
             b = memoryview(b)
         b = b.cast('B')
 
-        if read1:
+        nread = 0
+        while True:
             data = self.read1(len(b))
-        else:
-            data = self.read(len(b))
-        n = len(data)
-
-        b[:n] = data
+            n = len(data)
+            b[:n] = data
+            nread += n
+            if read1:
+                break
+            b = b[n:]
 
-        return n
+        return nread
 
     def write(self, b):
         """Write the given bytes buffer to the IO stream.

I could try updating the C implementation along those lines as well...

@cmaloney
Copy link
Contributor

cmaloney commented Feb 12, 2025

Not a Python core dev, but have been spending a lot of time around readinto. The C and _pyio BufferedIOReader changes look like they'd be quite a bit more intricate, but definitely where I'd prefer. Keeping in gzip makes it well contained / less likely to change behavior of other code (ex. code where the underlying buffer implemented readinto, but it was not getting run and is broken in some way).

Currently the C has a generic _bufferediobase_readinto_generic, which already handles 2 variants via flag (readinto vs. readinto1), and this would require adding two more to the combinatorics: Underlying stream has readinto, readinto1 vs only has read. I think it's important that continues working if the underlying stream only has read (Common for transformers of data). I definitely think interesting, but increases scope a bit / the gzip change is well contained and a move in the right direction on its own to me.

For testing, the ideal would be something like Go's testing.AllocsPerRun I've found really useful in the past. If you just need to check "there should be 2 read system calls", test.support.strace_helper could help if on Linux (sample).

This adds two more copies of the snippet:

        if self.mode != READ:
            import errno
            raise OSError(errno.EBADF, "read() on write-only GzipFile object")

That now has at least 4 copies, probably should be turned into a helper function

@effigies
Copy link
Contributor Author

I adapted my above Python code to the C implementation:
diff --git a/Modules/_io/bufferedio.c b/Modules/_io/bufferedio.c
index e45323c93a1..7cdd8c28283 100644
--- a/Modules/_io/bufferedio.c
+++ b/Modules/_io/bufferedio.c
@@ -49,35 +49,46 @@ static PyObject *
 _bufferediobase_readinto_generic(PyObject *self, Py_buffer *buffer, char readinto1)
 {
     Py_ssize_t len;
+    Py_ssize_t nread;
     PyObject *data;
 
-    PyObject *attr = readinto1
-        ? &_Py_ID(read1)
-        : &_Py_ID(read);
-    data = _PyObject_CallMethod(self, attr, "n", buffer->len);
-    if (data == NULL)
-        return NULL;
+    PyObject *attr = &_Py_ID(read1);
 
-    if (!PyBytes_Check(data)) {
-        Py_DECREF(data);
-        PyErr_SetString(PyExc_TypeError, "read() should return bytes");
-        return NULL;
-    }
+    nread = 0;
+    do {
+        data = _PyObject_CallMethod(self, attr, "n", buffer->len - nread);
+        if (data == NULL)
+            return NULL;
 
-    len = PyBytes_GET_SIZE(data);
-    if (len > buffer->len) {
-        PyErr_Format(PyExc_ValueError,
-                     "read() returned too much data: "
-                     "%zd bytes requested, %zd returned",
-                     buffer->len, len);
-        Py_DECREF(data);
-        return NULL;
-    }
-    memcpy(buffer->buf, PyBytes_AS_STRING(data), len);
+        if (!PyBytes_Check(data)) {
+            Py_DECREF(data);
+            PyErr_SetString(PyExc_TypeError, "read() should return bytes");
+            return NULL;
+        }
+
+        len = PyBytes_GET_SIZE(data);
+        if (len == 0) {
+            break;
+        }
+        if (nread + len > buffer->len) {
+            PyErr_Format(PyExc_ValueError,
+                         "read1() returned too much data: "
+                         "%zd bytes requested, %zd returned",
+                         buffer->len, nread + len);
+            Py_DECREF(data);
+            return NULL;
+        }
+        memcpy(buffer->buf + nread, PyBytes_AS_STRING(data), len);
+        nread += len;
+
+        if (readinto1) {
+            break;
+        }
+    } while (nread < buffer->len);
 
     Py_DECREF(data);
 
-    return PyLong_FromSsize_t(len);
+    return PyLong_FromSsize_t(nread);
 }
 
 /*[clinic input]

Using the memray test in #128646, it doesn't achieve the desired result. I suspect this is because it still has to go through self.read1() (GzipFile) -> self._buffer.read1 (BufferedReader), which for some reason does not limit itself to one read, instead filling the buffer.

That does seem like a bigger problem, and I'd be happy to tackle it in another PR, but that could be a rabbit hole, while this is an achievable win right now.

@effigies
Copy link
Contributor Author

Okay, I got nerd-sniped and have now convinced myself that it is absolutely not worth the trouble of resolving this in _io/_pyio.

Even if we can resolve this in _io/_pyio, any attempt to use the base-class implementations will result in more indirection, not less, as the implementation will eventually have to come around to GzipFile.read() or GzipFile.read1(). In addition to simplifying the call stack, this PR makes reading the code simpler as well, as the chain of delegation is now more straightforward.

This adds two more copies of the snippet:

        if self.mode != READ:
            import errno
            raise OSError(errno.EBADF, "read() on write-only GzipFile object")

We could just use self._check_can_read():

class BaseStream(io.BufferedIOBase):
"""Mode-checking helper functions."""
def _check_not_closed(self):
if self.closed:
raise ValueError("I/O operation on closed file")
def _check_can_read(self):
if not self.readable():
raise io.UnsupportedOperation("File not open for reading")

That exception subclasses OSError and ValueError, but doesn't set EBADF. I will include that change in this PR if asked by a core dev, but I will not expand the scope at present.

@cmaloney
Copy link
Contributor

cmaloney commented Feb 12, 2025

👍 to not doing readinto in _io/_pyio for now. I think it would be worthwhile to have readinto across this sort of object, but is very much a large scale tangential project (And one I've come close to but am explicitly avoiding myself). For better or worse, the errno.EBADF is effectively encapsulated in the API, suspect would need a new _check_can_read_ebadf. I really don't like the expansion of the deferred import errno from 2 -> 4 times is good... (It saves import time if everything goes well, at the cost of explicit code complexity).

@effigies
Copy link
Contributor Author

@zware Just checking: Are you waiting on something from me, or are you leaving approval to a subject matter expert?

Copy link
Contributor

@cmaloney cmaloney left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think this is a good step for now. I looked at doing the BufferedIOBase changes, and it's complicated as the current code says "readinto is always implemented on top of read", and that would need to be changed to "read is implemented on top of readinto, if readinto is provided". In both cases, readall (size=-1) could be implemented on top of readinto, which likely would be a win, but not the problem this issue is looking at / trying to solve.

@zware zware added the 🔨 test-with-buildbots Test PR w/ buildbots; report in status section label Mar 7, 2025
@bedevere-bot
Copy link

🤖 New build scheduled with the buildbot fleet by @zware for commit 9ded50d 🤖

Results will be shown at:

https://buildbot.python.org/all/#/grid?branch=refs%2Fpull%2F128647%2Fmerge

If you want to schedule another build, you need to add the 🔨 test-with-buildbots label again.

@bedevere-bot bedevere-bot removed the 🔨 test-with-buildbots Test PR w/ buildbots; report in status section label Mar 7, 2025
@zware
Copy link
Member

zware commented Mar 7, 2025

@zware Just checking: Are you waiting on something from me, or are you leaving approval to a subject matter expert?

I was hoping an actual expert would materialize, but at this point I'm ready to just go ahead with this as long as the buildbots are happy with it. I expect to merge this later today, but if I haven't made it back by Wednesday1 please ping me to get this in before 3.14.0a6.

Footnotes

  1. "Time is an illusion, [free] time doubly so"

@zware zware merged commit 72e5b25 into python:main Mar 8, 2025
115 of 116 checks passed
@bedevere-bot
Copy link

⚠️⚠️⚠️ Buildbot failure ⚠️⚠️⚠️

Hi! The buildbot aarch64 Fedora Stable Refleaks 3.x (tier-2) has failed when building commit 72e5b25.

What do you need to do:

  1. Don't panic.
  2. Check the buildbot page in the devguide if you don't know what the buildbots are or how they work.
  3. Go to the page of the buildbot that failed (https://buildbot.python.org/#/builders/123/builds/789) and take a look at the build logs.
  4. Check if the failure is related to this commit (72e5b25) or if it is a false positive.
  5. If the failure is related to this commit, please, reflect that on the issue and make a new Pull Request with a fix.

You can take a look at the buildbot page here:

https://buildbot.python.org/#/builders/123/builds/789

Failed tests:

  • test_perf_profiler

Failed subtests:

  • test_python_calls_appear_in_the_stack_if_perf_activated - test.test_perf_profiler.TestPerfProfilerWithDwarf.test_python_calls_appear_in_the_stack_if_perf_activated

Summary of the results of the build (if available):

==

Click to see traceback logs
Traceback (most recent call last):
  File "/home/buildbot/buildarea/3.x.cstratak-fedora-stable-aarch64.refleak/build/Lib/test/test_perf_profiler.py", line 362, in test_python_calls_appear_in_the_stack_if_perf_activated
    self.assertEqual(stderr, "")
    ~~~~~~~~~~~~~~~~^^^^^^^^^^^^
AssertionError: 'Warning:\nProcessed 613 events and lost 1[34 chars]\n\n' != ''
- Warning:
- Processed 613 events and lost 1 chunks!
- 
- Check IO/CPU overload!
- 


Traceback (most recent call last):
  File "/home/buildbot/buildarea/3.x.cstratak-fedora-stable-aarch64.refleak/build/Lib/test/test_perf_profiler.py", line 364, in test_python_calls_appear_in_the_stack_if_perf_activated
    self.assertIn(f"py::foo:{script}", stdout)
    ~~~~~~~~~~~~~^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
AssertionError: 'py::foo:/tmp/test_python_605b3nmu/tmpjyziy3xt/perftest.py' not found in 'python 3751067 276471.958865:          1 cycles:Pu: \n\t    ffff82e50ac0 _start+0x0 (/usr/lib/ld-linux-aarch64.so.1)\n\npython 3751067 276471.958890:          1 cycles:Pu: \n\tffffc95efcdffc78 [unknown] ([unknown])\n\tffffc95efce0049c [unknown] ([unknown])\n\tffffc95efb9a15e4 [unknown] ([unknown])\n\t    ffff82e50ac0 _start+0x0 (/usr/lib/ld-linux-aarch64.so.1)\n\npython 3751067 276471.959052:          1 cycles:Pu: \n\t    ffff82e3b954 _dl_init_paths+0x154 (/usr/lib/ld-linux-aarch64.so.1)\n\t    ffff82e4df33 call_init_paths+0x1133 (inlined)\n\t    ffff82e4df33 dl_main+0x1133 (/usr/lib/ld-linux-aarch64.so.1)\n\t    ffff82e4b5ff _dl_sysdep_start+0x1df (/usr/lib/ld-linux-aarch64.so.1)\n\t    ffff82e4cb17 _dl_start_final+0x5ab (inlined)\n\t    ffff82e4cb17 _dl_start+0x5ab (/usr/lib/ld-linux-aarch64.so.1)\n\t    ffff82e50ad3 _start+0x13 (/usr/lib/ld-linux-aarch64.so.1)\n\npython 3751067 276471.959068:        327 cycles:Pu: \n\t    ffff82e3bb30 _dl_init_paths+0x330 (/usr/lib/ld-linux-aarch64.so.1)\n\t    ffff82e4df33 call_init_paths+0x1133 (inlined)\n\t    ffff82e4df33 dl_main+0x1133 (/usr/lib/ld-linux-aarch64.so.1)\n\t    ffff82e4b5ff _dl_sysdep_start+0x1df (/usr/lib/ld-linux-aarch64.so.1)\n\t    ffff82e4cb17 _dl_start_final+0x5ab (inlined)\n\t    ffff82e4cb17 _dl_start+0x5ab (/usr/lib/ld-linux-aarch64.so.1)\n\t    ffff82e50ad3 _start+0x13 (/usr/lib/ld-linux-aarch64.so.1)\n\npython 3751067 276471.960104:       2836 cycles:Pu: \n\t    ffff82cb4c38 sbrk+0x18 (inlined)\n\t    ffff82c5aeff __default_morecore@GLIBC_2.17+0x1f (/usr/lib64/libc.so.6)\n\t    ffff82c5bddb sysmalloc+0x3db (/usr/lib64/libc.so.6)\n\t    ffff82c5cf37 _int_malloc+0xd97 (/usr/lib64/libc.so.6)\n\t    ffff82c5d037 tcache_init.part.0+0x47 (/usr/lib64/libc.so.6)\n\t    ffff82c5dac3 tcache_init+0x183 (inlined)\n\t    ffff82c5dac3 malloc+0x183 (/usr/lib64/libc.so.6)\n\t          507ab7 _PyMem_RawMalloc+0x17 (/home/buildbot/buildarea/3.x.cstratak-fedora-stable-aarch64.refleak/build/python)\n\t          5070fb _PyMem_DebugRawAlloc+0xbf (/home/buildbot/buildarea/3.x.cstratak-fedora-stable-aarch64.refleak/build/python)\n\t          50714b _PyMem_DebugRawMalloc+0x17 (/home/buildbot/buildarea/3.x.cstratak-fedora-stable-aarch64.refleak/build/python)\n\t          51d88b PyMem_RawMalloc+0x23 (/home/buildbot/buildarea/3.x.cstratak-fedora-stable-aarch64.refleak/build/python)\n\t          51e70b _PyMem_RawStrdup+0x27 (/home/buildbot/buildarea/3.x.cstratak-fedora-stable-aarch64.refleak/build/python)\n\t          6675a7 _PyPreConfig_Read+0x67 (/home/buildbot/buildarea/3.x.cstratak-fedora-stable-aarch64.refleak/build/python)\n\t          670603 _Py_PreInitializeFromPyArgv+0x103 (/home/buildbot/buildarea/3.x.cstratak-fedora-stable-aarch64.refleak/build/python)\n\t          6a1777 pymain_init+0x57 (/home/buildbot/buildarea/3.x.cstratak-fedora-stab
    559b23 unicode_hash+0x77 (/home/buildbot/buildarea/3.x.cstratak-fedora-stable-aarch64.refleak/build/python)\n\t          559b87 hashtable_unicode_hash+0xb (/home/buildbot/buildarea/3.x.cstratak-fedora-stable-aarch64.refleak/build/python)\n\t          641723 _Py_hashtable_get_entry_generic+0x23 (/home/buildbot/buildarea/3.x.cstratak-fedora-stable-aarch64.refleak/build/python)\n\t          641b97 _Py_hashtable_get+0xf (/home/buildbot/buildarea/3.x.cstratak-fedora-stable-aarch64.refleak/build/python)\n\t          55aeab intern_static+0x67 (/home/buildbot/buildarea/3.x.cstratak-fedora-stable-aarch64.refleak/build/python)\n\t          575343 _PyUnicode_InternStatic+0x27 (/home/buildbot/buildarea/3.x.cstratak-fedora-stable-aarch64.refleak/build/python)\n\t          57f8c3 _PyUnicode_InitStaticStrings+0xa523 (/home/buildbot/buildarea/3.x.cstratak-fedora-stable-aarch64.refleak/build/python)\n\t          58d76b init_global_interned_strings+0x77 (/home/buildbot/buildarea/3.x.cstratak-fedora-stable-aarch64.refleak/build/python)\n\t          58d9b3 _PyUnicode_InitGlobalObjects+0x63 (/home/buildbot/buildarea/3.x.cstratak-fedora-stable-aarch64.refleak/build/python)\n\t          6687ef pycore_init_global_objects+0x23 (/home/buildbot/buildarea/3.x.cstratak-fedora-stable-aarch64.refleak/build/python)\n\t          668d97 pycore_interp_init+0x2b (/home/buildbot/buildarea/3.x.cstratak-fedora-stable-aarch64.refleak/build/python)\n\t          669073 pyinit_config+0xaf (/home/buildbot/buildarea/3.x.cstratak-fedora-stable-aarch64.refleak/build/python)\n\t          6708d3 pyinit_core+0xdb (/home/buildbot/buildarea/3.x.cstratak-fedora-stable-aarch64.refleak/build/python)\n\t          6709ab Py_InitializeFromConfig+0x97 (/home/buildbot/buildarea/3.x.cstratak-fedora-stable-aarch64.refleak/build/python)\n\t          6a181f pymain_init+0xff (/home/buildbot/buildarea/3.x.cstratak-fedora-stable-aarch64.refleak/build/python)\n\t          6a18ef pymain_main+0xf (/home/buildbot/buildarea/3.x.cstratak-fedora-stable-aarch64.refleak/build/python)\n\t          6a1983 Py_BytesMain+0x27 (/home/buildbot/buildarea/3.x.cstratak-fedora-stable-aarch64.refleak/build/python)\n\t          41f0b7 main+0xb (/home/buildbot/buildarea/3.x.cstratak-fedora-stable-aarch64.refleak/build/python)\n\t    ffff82be625b __libc_start_call_main+0x7b (/usr/lib64/libc.so.6)\n\t    ffff82be633b __libc_start_main@GLIBC_2.17+0x9b (inlined)\n\t          41efaf _start+0x2f (/home/buildbot/buildarea/3.x.cstratak-fedora-stable-aarch64.refleak/build/python)\n\npython 3751067 276471.962552:    1808671 cycles:Pu: \n\t          5022c4 reftotal_add+0xc (inlined)\n\t          5022c4 _Py_IncRefTotal+0xc (/home/buildbot/buildarea/3.x.cstratak-fedora-stable-aarch64.refleak/build/python)\n\t          502d3b _Py_NewReference+0x23 (/home/buildbot/buildarea/3.x.cstratak-fedora-stable-aarch64.refleak/build/python)\n\t          54de5f _PyObject_Init+0x43 (/home/buildbot/buildarea/3.x.cstratak-fedora-stable-aarch64.refleak/build/python)\n\t          55b907 PyUnicode_New+0xc3 (/home/buildbot/buildarea/3.x.cstratak-fedora-stable-aarch64.refleak/build/python)\n\t          56ecd7 unicode_decode_utf8+0x113 (/home/buildbot/buildarea/3.x.cstratak-fedora-stable-aarch64.refleak/build/python)\n\t          56ee4b PyUnicode_DecodeUTF8Stateful+0x1b (/home/buildbot/buildarea/3.x.cstratak-fedora-stable-aarch64.refleak/build/python)\n\t          56ef03 PyUnicode_FromString+0x2b (/home/buildbot/buildarea/3.x.cstratak-fedora-stable-aarch64.refleak/build/python)\n\t          58df2f PyUnicode_InternFromString+0xb (/home/buildbot/buildarea/3.x.cstratak-fedora-stable-aarch64.refleak/build/python)\n\t          4b21b7 descr_new+0x3f (/home/buildbot/buildarea/3.x.cstratak-fedora-stable-aarch64.refleak/build/python)\n\t          4b33b3 PyDescr_NewWrapper+0x2b (/home/buildbot/buildarea/3.x.cstratak-fedora-stable-aarch64.refleak/build/python)\n\t          5361bb add_operators+0x5f (/home/buildbot/buildarea/3.x.cstratak-fedora-stable-aarch64.refleak/build/python)\n\t          536517 type_ready_fill_dict+0x13 (/home/buildbot/buildarea

@cmaloney
Copy link
Contributor

cmaloney commented Mar 8, 2025

The bot failure looks like an unrelated failure; the test_perf_profiler test doesn't rely on gzip (directly, or the python code it is running under perf from what I can find). It looks like there were supposed to be a number of Python code frames shown by perf but there weren't any.

@effigies effigies deleted the fix/gzipfile-readinto branch March 8, 2025 09:26
@effigies
Copy link
Contributor Author

effigies commented Mar 10, 2025

@Yhg1s I just want to check if this can be considered for backporting to 3.12 and 3.13. It resolves what I consider a bug, but you may consider it an optimization.

@vstinner
Copy link
Member

@Yhg1s I just want to check if this can be considered for backporting to 3.12 and 3.13. It resolves what I consider a bug, but you may consider it an optimization.

IMO it's an optimization and it should not be backported.

seehwan pushed a commit to seehwan/cpython that referenced this pull request Apr 16, 2025
…28647)

The new methods simply delegate to the underlying buffer, much like the existing GzipFile.read[1] methods.  This avoids extra allocations caused by the BufferedIOBase.readinto implementation previously used.

This commit also factors out a common readability check rather than copying it an additional two times.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants