1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99
|
/* The FreeDOS-32 Unicode Support Library version 2.1
* Copyright (C) 2001-2006 Salvatore ISAJA
*
* This file "utf16be.c" is part of the FreeDOS-32 Unicode
* Support Library (the Program).
*
* The Program is free software; you can redistribute it and/or modify
* it under the terms of the GNU General Public License as published by
* the Free Software Foundation; either version 2 of the License, or
* (at your option) any later version.
*
* The Program is distributed in the hope that it will be useful,
* but WITHOUT ANY WARRANTY; without even the implied warranty of
* MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the
* GNU General Public License for more details.
*
* You should have received a copy of the GNU General Public License
* along with the Program; see the file GPL.txt; if not, write to
* the Free Software Foundation, Inc.,
* 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA
*/
#include <config.h>
#include "unicode.h"
#include <endian.h>
/**
* \addtogroup unicode
* @{
*/
static inline uint16_t big_endian_16(uint16_t v)
{
#if __BYTE_ORDER == __LITTLE_ENDIAN /* Machine is little endian */
return (v << 8) | (v >> 8);
#else /* Machine is big endian */
return v;
#endif
}
/**
* \brief Gets the length of a UTF-16BE character.
* \remarks This is the big endian version of unicode_utf16le_len()
*/
int unicode_utf16be_len(uint16_t lead_word)
{
int res = 1;
if ((big_endian_16(lead_word) & 0xFC00) == 0xD800) res = 2;
return res;
}
/**
* \brief UTF-16BE to wide character.
* \remarks This is the big endian version of unicode_utf16le_to_wchar()
*/
int unicode_utf16be_to_wchar(wchar_t *restrict result, const uint16_t *restrict string, size_t size)
{
uint16_t v;
if (!size) return -ENAMETOOLONG;
v = big_endian_16(*string);
if ((v & 0xFC00) != 0xD800)
{
*result = (wchar_t) v;
return 1;
}
if (size < 2) return -ENAMETOOLONG;
*result = ((v & 0x03FF) << 10) + 0x010000;
v = big_endian_16(*(++string));
if ((v & 0xFC00) != 0xDC00) return -EILSEQ;
*result |= v & 0x03FF;
return 2;
}
/**
* \brief Wide character to UTF-16BE.
* \remarks This is the big endian version of unicode_wchar_to_utf16le()
*/
int unicode_wchar_to_utf16be(uint16_t *s, wchar_t wc, size_t size)
{
if (wc >= 0)
{
if (wc < 0x010000)
{
*s = big_endian_16((uint16_t) wc);
return 1;
}
if (wc < 0x200000)
{
*s = big_endian_16((uint16_t) (0xD800 + (((wc >> 16) - 1) << 6) + ((wc & 0x00FC00) >> 2)));
*(s + 1) = big_endian_16((uint16_t) (0xDC00 + (wc & 0x0003FF)));
return 2;
}
}
return -EINVAL;
}
/* @} */
|