Skip to content

Commit 0fd4fd4

Browse files
[3.13] gh-148529: Minor improvements of the struct module documentation (GH-148565) (GH-149072)
* Document that 's' and 'p' accept bytes and bytearray. * Fix some footnotes. * Clarify that "string" is a byte string. * Fix the module docstring. (cherry picked from commit 3e5a3cb)
1 parent ab5ef98 commit 0fd4fd4

2 files changed

Lines changed: 36 additions & 33 deletions

File tree

Doc/library/struct.rst

Lines changed: 24 additions & 20 deletions
Original file line numberDiff line numberDiff line change
@@ -227,32 +227,32 @@ platform-dependent.
227227
+--------+--------------------------+--------------------+----------------+------------+
228228
| ``c`` | :c:expr:`char` | bytes of length 1 | 1 | |
229229
+--------+--------------------------+--------------------+----------------+------------+
230-
| ``b`` | :c:expr:`signed char` | integer | 1 | \(1), \(2) |
230+
| ``b`` | :c:expr:`signed char` | int | 1 | \(2) |
231231
+--------+--------------------------+--------------------+----------------+------------+
232-
| ``B`` | :c:expr:`unsigned char` | integer | 1 | \(2) |
232+
| ``B`` | :c:expr:`unsigned char` | int | 1 | \(2) |
233233
+--------+--------------------------+--------------------+----------------+------------+
234234
| ``?`` | :c:expr:`_Bool` | bool | 1 | \(1) |
235235
+--------+--------------------------+--------------------+----------------+------------+
236-
| ``h`` | :c:expr:`short` | integer | 2 | \(2) |
236+
| ``h`` | :c:expr:`short` | int | 2 | \(2) |
237237
+--------+--------------------------+--------------------+----------------+------------+
238-
| ``H`` | :c:expr:`unsigned short` | integer | 2 | \(2) |
238+
| ``H`` | :c:expr:`unsigned short` | int | 2 | \(2) |
239239
+--------+--------------------------+--------------------+----------------+------------+
240-
| ``i`` | :c:expr:`int` | integer | 4 | \(2) |
240+
| ``i`` | :c:expr:`int` | int | 4 | \(2) |
241241
+--------+--------------------------+--------------------+----------------+------------+
242-
| ``I`` | :c:expr:`unsigned int` | integer | 4 | \(2) |
242+
| ``I`` | :c:expr:`unsigned int` | int | 4 | \(2) |
243243
+--------+--------------------------+--------------------+----------------+------------+
244-
| ``l`` | :c:expr:`long` | integer | 4 | \(2) |
244+
| ``l`` | :c:expr:`long` | int | 4 | \(2) |
245245
+--------+--------------------------+--------------------+----------------+------------+
246-
| ``L`` | :c:expr:`unsigned long` | integer | 4 | \(2) |
246+
| ``L`` | :c:expr:`unsigned long` | int | 4 | \(2) |
247247
+--------+--------------------------+--------------------+----------------+------------+
248-
| ``q`` | :c:expr:`long long` | integer | 8 | \(2) |
248+
| ``q`` | :c:expr:`long long` | int | 8 | \(2) |
249249
+--------+--------------------------+--------------------+----------------+------------+
250-
| ``Q`` | :c:expr:`unsigned long | integer | 8 | \(2) |
250+
| ``Q`` | :c:expr:`unsigned long | int | 8 | \(2) |
251251
| | long` | | | |
252252
+--------+--------------------------+--------------------+----------------+------------+
253-
| ``n`` | :c:type:`ssize_t` | integer | | \(3) |
253+
| ``n`` | :c:type:`ssize_t` | int | | \(2), \(3) |
254254
+--------+--------------------------+--------------------+----------------+------------+
255-
| ``N`` | :c:type:`size_t` | integer | | \(3) |
255+
| ``N`` | :c:type:`size_t` | int | | \(2), \(3) |
256256
+--------+--------------------------+--------------------+----------------+------------+
257257
| ``e`` | :c:expr:`_Float16` | float | 2 | \(4), \(6) |
258258
+--------+--------------------------+--------------------+----------------+------------+
@@ -264,7 +264,7 @@ platform-dependent.
264264
+--------+--------------------------+--------------------+----------------+------------+
265265
| ``p`` | :c:expr:`char[]` | bytes | | \(8) |
266266
+--------+--------------------------+--------------------+----------------+------------+
267-
| ``P`` | :c:expr:`void \*` | integer | | \(5) |
267+
| ``P`` | :c:expr:`void \*` | int | | \(2), \(5) |
268268
+--------+--------------------------+--------------------+----------------+------------+
269269

270270
.. versionchanged:: 3.3
@@ -329,27 +329,31 @@ Notes:
329329
The ``'p'`` format character encodes a "Pascal string", meaning a short
330330
variable-length string stored in a *fixed number of bytes*, given by the count.
331331
The first byte stored is the length of the string, or 255, whichever is
332-
smaller. The bytes of the string follow. If the string passed in to
332+
smaller. The bytes of the string follow. If the byte string passed in to
333333
:func:`pack` is too long (longer than the count minus 1), only the leading
334-
``count-1`` bytes of the string are stored. If the string is shorter than
334+
``count-1`` bytes of the string are stored. If the byte string is shorter than
335335
``count-1``, it is padded with null bytes so that exactly count bytes in all
336336
are used. Note that for :func:`unpack`, the ``'p'`` format character consumes
337-
``count`` bytes, but that the string returned can never contain more than 255
337+
``count`` bytes, but that the :class:`!bytes` object returned can never contain more than 255
338338
bytes.
339+
When packing, arguments of types :class:`bytes` and :class:`bytearray`
340+
are accepted.
339341

340342
(9)
341343
For the ``'s'`` format character, the count is interpreted as the length of the
342-
bytes, not a repeat count like for the other format characters; for example,
344+
byte string, not a repeat count like for the other format characters; for example,
343345
``'10s'`` means a single 10-byte string mapping to or from a single
344346
Python byte string, while ``'10c'`` means 10
345347
separate one byte character elements (e.g., ``cccccccccc``) mapping
346348
to or from ten different Python byte objects. (See :ref:`struct-examples`
347349
for a concrete demonstration of the difference.)
348-
If a count is not given, it defaults to 1. For packing, the string is
350+
If a count is not given, it defaults to 1. For packing, the byte string is
349351
truncated or padded with null bytes as appropriate to make it fit. For
350-
unpacking, the resulting bytes object always has exactly the specified number
351-
of bytes. As a special case, ``'0s'`` means a single, empty string (while
352+
unpacking, the resulting :class:`!bytes` object always has exactly the specified number
353+
of bytes. As a special case, ``'0s'`` means a single, empty byte string (while
352354
``'0c'`` means 0 characters).
355+
When packing, arguments of types :class:`bytes` and :class:`bytearray`
356+
are accepted.
353357

354358
A format character may be preceded by an integral repeat count. For example,
355359
the format string ``'4h'`` means exactly the same as ``'hhhh'``.

Modules/_struct.c

Lines changed: 12 additions & 13 deletions
Original file line numberDiff line numberDiff line change
@@ -1,7 +1,7 @@
11
/* struct module -- pack values into and (out of) bytes objects */
22

33
/* New version supporting byte order, alignment and size options,
4-
character strings, and unsigned numbers */
4+
byte strings, and unsigned numbers */
55

66
#ifndef Py_BUILD_CORE_BUILTIN
77
# define Py_BUILD_CORE_MODULE 1
@@ -1947,7 +1947,7 @@ Struct_iter_unpack(PyStructObject *self, PyObject *buffer)
19471947
*
19481948
* Takes a struct object, a tuple of arguments, and offset in that tuple of
19491949
* argument for where to start processing the arguments for packing, and a
1950-
* character buffer for writing the packed string. The caller must insure
1950+
* character buffer for writing the packed data. The caller must ensure
19511951
* that the buffer may contain the required length for packing the arguments.
19521952
* 0 is returned on success, 1 is returned if there is an error.
19531953
*
@@ -2484,8 +2484,8 @@ static struct PyMethodDef module_functions[] = {
24842484

24852485
PyDoc_STRVAR(module_doc,
24862486
"Functions to convert between Python values and C structs.\n\
2487-
Python bytes objects are used to hold the data representing the C struct\n\
2488-
and also as format strings (explained below) to describe the layout of data\n\
2487+
Python bytes objects are used to hold the data representing the C struct.\n\
2488+
The format string (explained below) describes the layout of data\n\
24892489
in the C struct.\n\
24902490
\n\
24912491
The optional first format char indicates byte order, size and alignment:\n\
@@ -2495,18 +2495,17 @@ The optional first format char indicates byte order, size and alignment:\n\
24952495
>: big-endian, std. size & alignment\n\
24962496
!: same as >\n\
24972497
\n\
2498-
The remaining chars indicate types of args and must match exactly;\n\
2498+
The remaining characters indicate types of args and must match exactly;\n\
24992499
these can be preceded by a decimal repeat count:\n\
2500-
x: pad byte (no data); c:char; b:signed byte; B:unsigned byte;\n\
2501-
?:_Bool; h:short; H:unsigned short; i:int; I:unsigned int;\n\
2502-
l:long; L:unsigned long; f:float; d:double; e:half-float.\n\
2500+
x: pad byte (no data); c: char; b: signed byte; B: unsigned byte;\n\
2501+
?: _Bool; h: short; H: unsigned short; i: int; I: unsigned int;\n\
2502+
l: long; L: unsigned long; q: long long; Q: unsigned long long;\n\
2503+
f: float; d: double; e: half-float;\n\
25032504
Special cases (preceding decimal count indicates length):\n\
2504-
s:string (array of char); p: pascal string (with count byte).\n\
2505+
s: byte string (array of char); p: Pascal string (with count byte).\n\
25052506
Special cases (only available in native format):\n\
2506-
n:ssize_t; N:size_t;\n\
2507-
P:an integer type that is wide enough to hold a pointer.\n\
2508-
Special case (not in native mode unless 'long long' in platform C):\n\
2509-
q:long long; Q:unsigned long long\n\
2507+
n: ssize_t; N: size_t;\n\
2508+
P: an integer type that is wide enough to hold a pointer.\n\
25102509
Whitespace between formats is ignored.\n\
25112510
\n\
25122511
The variable struct.error is an exception raised on errors.\n");

0 commit comments

Comments
 (0)