File: String-encoding.html

package info (click to toggle)
octave 10.3.0-1
  • links: PTS, VCS
  • area: main
  • in suites:
  • size: 145,388 kB
  • sloc: cpp: 335,976; ansic: 82,241; fortran: 20,963; objc: 9,402; sh: 8,756; yacc: 4,392; lex: 4,333; perl: 1,544; java: 1,366; awk: 1,259; makefile: 659; xml: 192
file content (101 lines) | stat: -rw-r--r-- 6,247 bytes parent folder | download | duplicates (2)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
<!DOCTYPE html>
<html>
<!-- Created by GNU Texinfo 7.1.1, https://www.gnu.org/software/texinfo/ -->
<head>
<meta http-equiv="Content-Type" content="text/html; charset=utf-8">
<title>String encoding (GNU Octave (version 10.3.0))</title>

<meta name="description" content="String encoding (GNU Octave (version 10.3.0))">
<meta name="keywords" content="String encoding (GNU Octave (version 10.3.0))">
<meta name="resource-type" content="document">
<meta name="distribution" content="global">
<meta name="Generator" content="makeinfo">
<meta name="viewport" content="width=device-width,initial-scale=1">

<link href="index.html" rel="start" title="Top">
<link href="Concept-Index.html" rel="index" title="Concept Index">
<link href="index.html#SEC_Contents" rel="contents" title="Table of Contents">
<link href="Converting-Strings.html" rel="up" title="Converting Strings">
<link href="Numerical-Data-and-Strings.html" rel="next" title="Numerical Data and Strings">
<style type="text/css">
<!--
a.copiable-link {visibility: hidden; text-decoration: none; line-height: 0em}
span:hover a.copiable-link {visibility: visible}
strong.def-name {font-family: monospace; font-weight: bold; font-size: larger}
-->
</style>
<link rel="stylesheet" type="text/css" href="octave.css">


</head>

<body lang="en">
<div class="subsection-level-extent" id="String-encoding">
<div class="nav-panel">
<p>
Next: <a href="Numerical-Data-and-Strings.html" accesskey="n" rel="next">Numerical Data and Strings</a>, Up: <a href="Converting-Strings.html" accesskey="u" rel="up">Converting Strings</a> &nbsp; [<a href="index.html#SEC_Contents" title="Table of contents" rel="contents">Contents</a>][<a href="Concept-Index.html" title="Index" rel="index">Index</a>]</p>
</div>
<hr>
<h4 class="subsection" id="String-encoding-1"><span>5.4.1 String encoding<a class="copiable-link" href="#String-encoding-1"> &para;</a></span></h4>

<a class="anchor" id="XREFunicode2native"></a><span style="display:block; margin-top:-4.5ex;">&nbsp;</span>


<dl class="first-deftypefn">
<dt class="deftypefn" id="index-unicode2native"><span><code class="def-type"><var class="var">native_bytes</var> =</code> <strong class="def-name">unicode2native</strong> <code class="def-code-arguments">(<var class="var">utf8_str</var>, <var class="var">codepage</var>)</code><a class="copiable-link" href="#index-unicode2native"> &para;</a></span></dt>
<dt class="deftypefnx def-cmd-deftypefn" id="index-unicode2native-1"><span><code class="def-type"><var class="var">native_bytes</var> =</code> <strong class="def-name">unicode2native</strong> <code class="def-code-arguments">(<var class="var">utf8_str</var>)</code><a class="copiable-link" href="#index-unicode2native-1"> &para;</a></span></dt>
<dd><p>Convert UTF-8 string <var class="var">utf8_str</var> to byte stream using <var class="var">codepage</var>.
</p>
<p>The character vector <var class="var">utf8_str</var> is converted to a byte stream
<var class="var">native_bytes</var> using the code page given by <var class="var">codepage</var>.  The
string <var class="var">codepage</var> must be an identifier of a valid code page.
Examples for valid code pages are <code class="code">&quot;ISO-8859-1&quot;</code>,
<code class="code">&quot;Shift-JIS&quot;</code>, or <code class="code">&quot;UTF-16&quot;</code>.  For a list of supported code
pages, see <a class="url" href="https://www.gnu.org/software/libiconv">https://www.gnu.org/software/libiconv</a>.  If <var class="var">codepage</var>
is omitted or empty, the system default codepage is used.
</p>
<p>If any of the characters cannot be mapped into the codepage <var class="var">codepage</var>,
they are replaced with the appropriate substitution sequence for that
codepage.
</p>

<p><strong class="strong">See also:</strong> <a class="ref" href="#XREFnative2unicode">native2unicode</a>.
</p></dd></dl>


<a class="anchor" id="XREFnative2unicode"></a><span style="display:block; margin-top:-4.5ex;">&nbsp;</span>


<dl class="first-deftypefn">
<dt class="deftypefn" id="index-native2unicode"><span><code class="def-type"><var class="var">utf8_str</var> =</code> <strong class="def-name">native2unicode</strong> <code class="def-code-arguments">(<var class="var">native_bytes</var>, <var class="var">codepage</var>)</code><a class="copiable-link" href="#index-native2unicode"> &para;</a></span></dt>
<dt class="deftypefnx def-cmd-deftypefn" id="index-native2unicode-1"><span><code class="def-type"><var class="var">utf8_str</var> =</code> <strong class="def-name">native2unicode</strong> <code class="def-code-arguments">(<var class="var">native_bytes</var>)</code><a class="copiable-link" href="#index-native2unicode-1"> &para;</a></span></dt>
<dd><p>Convert byte stream <var class="var">native_bytes</var> to UTF-8 using <var class="var">codepage</var>.
</p>
<p>The numbers in the vector <var class="var">native_bytes</var> are rounded and clipped to
integers between 0 and 255.  This byte stream is then mapped into the
code page given by the string <var class="var">codepage</var> and returned in the string
<var class="var">utf8_str</var>.  Octave uses UTF-8 as its internal encoding.  The string
<var class="var">codepage</var> must be an identifier of a valid code page.  Examples for
valid code pages are <code class="code">&quot;ISO-8859-1&quot;</code>, <code class="code">&quot;Shift-JIS&quot;</code>, or
<code class="code">&quot;UTF-16&quot;</code>.  For a list of supported code pages, see
<a class="url" href="https://www.gnu.org/software/libiconv">https://www.gnu.org/software/libiconv</a>.  If <var class="var">codepage</var> is omitted
or empty, the system default codepage is used.
</p>
<p>If <var class="var">native_bytes</var> is a string vector, it is returned as is.
</p>

<p><strong class="strong">See also:</strong> <a class="ref" href="#XREFunicode2native">unicode2native</a>.
</p></dd></dl>


</div>
<hr>
<div class="nav-panel">
<p>
Next: <a href="Numerical-Data-and-Strings.html">Numerical Data and Strings</a>, Up: <a href="Converting-Strings.html">Converting Strings</a> &nbsp; [<a href="index.html#SEC_Contents" title="Table of contents" rel="contents">Contents</a>][<a href="Concept-Index.html" title="Index" rel="index">Index</a>]</p>
</div>



</body>
</html>