1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128
|
/*@ Convert in between UnicodeTranformationFormats.
*
* Copyright (c) 2012 - 2020 Steffen (Daode) Nurpmeso <steffen@sdaoden.eu>.
* SPDX-License-Identifier: ISC
*
* Permission to use, copy, modify, and/or distribute this software for any
* purpose with or without fee is hereby granted, provided that the above
* copyright notice and this permission notice appear in all copies.
*
* THE SOFTWARE IS PROVIDED "AS IS" AND THE AUTHOR DISCLAIMS ALL WARRANTIES
* WITH REGARD TO THIS SOFTWARE INCLUDING ALL IMPLIED WARRANTIES OF
* MERCHANTABILITY AND FITNESS. IN NO EVENT SHALL THE AUTHOR BE LIABLE FOR
* ANY SPECIAL, DIRECT, INDIRECT, OR CONSEQUENTIAL DAMAGES OR ANY DAMAGES
* WHATSOEVER RESULTING FROM LOSS OF USE, DATA OR PROFITS, WHETHER IN AN
* ACTION OF CONTRACT, NEGLIGENCE OR OTHER TORTIOUS ACTION, ARISING OUT OF
* OR IN CONNECTION WITH THE USE OR PERFORMANCE OF THIS SOFTWARE.
*/
#ifndef su_UTF_H
#define su_UTF_H
/*!
* \file
* \ingroup UTF
* \brief \r{UTF}
*/
#include <su/code.h>
#define su_HEADER
#include <su/code-in.h>
C_DECL_BEGIN
/*!
* \defgroup UTF Unicode Transformation Formats
* \ingroup TEXT
* \brief Convert in between UnicodeTranformationFormats (\r{su/utf.h})
* @{
*/
/*!
* \defgroup UTF8 UTF-8
* \ingroup UTF
* \brief UTF-8 (\r{su/utf.h})
* @{
*/
enum{
/*! Maximum buffer size of an UTF-8 sequence including terminating NUL. */
su_UTF8_BUFFER_SIZE = 5u
};
/*! The Unicode replacement character \c{0xFFFD} as an UTF-8 literal. */
#define su_UTF8_REPLACER "\xEF\xBF\xBD"
/*! Compiled in version of \r{su_UTF8_REPLACER}. */
EXPORT_DATA char const su_utf8_replacer[sizeof su_UTF8_REPLACER];
/*! Convert, and update arguments to point after range.
* Returns \r{su_U32_MAX} on error, in which case the arguments will have been
* stepped one byte. */
EXPORT u32 su_utf8_to_32(char const **bdat, uz *blen);
/*! @} */
/*!
* \defgroup UTF32 UTF-32
* \ingroup UTF
* \brief UTF-32 (\r{su/utf.h})
* @{
*/
/*! The Unicode replacement character \c{0xFFFD} as an UTF-32 codepoint. */
#define su_UTF32_REPLACER 0xFFFDu
/*! Convert an UTF-32 character to an UTF-8 sequence.
* \a{bp} must be large enough also for the terminating NUL (see
* \r{su_UTF8_BUFFER_SIZE}), its length will * be returned. */
EXPORT uz su_utf32_to_8(u32 c, char *bp);
/*! @} */
/*! @} */
C_DECL_END
#include <su/code-ou.h>
#if !su_C_LANG || defined CXX_DOXYGEN
# define su_CXX_HEADER
# include <su/code-in.h>
NSPC_BEGIN(su)
class utf8;
class utf32;
/*!
* \ingroup UTF8
* C++ variant of \r{UTF8} (\r{su/utf.h})
*/
class EXPORT utf8{
public:
enum{
/*! \copydoc{su_UTF8_BUFFER_SIZE} */
buffer_size = su_UTF8_BUFFER_SIZE
};
/*! \copydoc{su_UTF8_REPLACER} */
static char const replacer[sizeof su_UTF8_REPLACER];
/*! \copydoc{su_utf8_to_32()} */
static u32 convert_to_32(char const **bdat, uz *blen){
return su_utf8_to_32(bdat, blen);
}
};
/*!
* \ingroup UTF32
* C++ variant of \r{UTF32} (\r{su/utf.h})
*/
class utf32{
public:
/*! \copydoc{su_UTF32_REPLACER} */
static u32 const replacer = su_UTF32_REPLACER;
/*! \copydoc{su_utf32_to_8()} */
static uz convert_to_8(u32 c, char *bp) {return su_utf32_to_8(c, bp);}
};
NSPC_END(su)
# include <su/code-ou.h>
#endif /* !C_LANG || CXX_DOXYGEN */
#endif /* su_UTF_H */
/* s-it-mode */
|