File: mbstate_t.3type

package info (click to toggle)
manpages 6.16-1
  • links: PTS, VCS
  • area: main
  • in suites: sid
  • size: 20,476 kB
  • sloc: sh: 576; python: 222; perl: 191; makefile: 29; lisp: 22
file content (84 lines) | stat: -rw-r--r-- 2,057 bytes parent folder | download
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
.\" Copyright, the authors of the Linux man-pages project
.\"
.\" SPDX-License-Identifier: GPL-2.0-or-later
.\"
.TH mbstate_t 3type 2025-10-29 "Linux man-pages (unreleased)"
.SH NAME
mbstate_t
\-
multi-byte-character conversion state
.SH LIBRARY
Standard C library
.RI ( libc )
.SH SYNOPSIS
.nf
.B #include <wchar.h>
.P
.BR typedef " /* ... */  " mbstate_t;
.fi
.SH DESCRIPTION
Character conversion between
the multibyte representation
and the wide character representation
uses conversion state,
of type
.IR mbstate_t .
Conversion of a string uses a finite-state machine;
when it is interrupted after the complete conversion of a number of characters,
it may need to save a state for processing the remaining characters.
Such a conversion state
is needed for the sake of encodings such as ISO/IEC\~2022 and UTF-7.
.P
The initial state is the state at the beginning of conversion of a string.
There are two kinds of state:
the one used by multibyte to wide character conversion functions,
such as
.BR mbsrtowcs (3),
and the one used by wide character to multibyte conversion functions,
such as
.BR wcsrtombs (3),
but they both fit in a
.IR mbstate_t ,
and they both have the same representation for an initial state.
.P
For 8-bit encodings,
all states are equivalent to the initial state.
For multibyte encodings like UTF-8, EUC-*, BIG5, or SJIS,
the wide character to multibyte conversion functions
never produce non-initial states,
but the multibyte to wide-character conversion functions like
.BR mbrtowc (3)
do
produce non-initial states when interrupted in the middle of a character.
.P
One possible way to create an
.I mbstate_t
in initial state is to set it to zero:
.P
.in +4n
.EX
mbstate_t state;
memset(&state, 0, sizeof(state));
.EE
.in
.P
On Linux,
the following works as well,
but might generate compiler warnings:
.P
.in +4n
.EX
mbstate_t state = { 0 };
.EE
.in
.SH STANDARDS
C11, POSIX.1-2024.
.SH HISTORY
C99, POSIX.1-2001.
.SH SEE ALSO
.BR mbrlen (3),
.BR mbrtowc (3),
.BR mbsinit (3),
.BR mbsrtowcs (3),
.BR wcrtomb (3),
.BR wcsrtombs (3)