File: README.md

package info (click to toggle)
rustc 1.85.0%2Bdfsg2-3
  • links: PTS, VCS
  • area: main
  • in suites: forky, sid, trixie
  • size: 893,176 kB
  • sloc: xml: 158,127; python: 35,830; javascript: 19,497; cpp: 19,002; sh: 17,245; ansic: 13,127; asm: 4,376; makefile: 1,051; lisp: 29; perl: 29; ruby: 19; sql: 11
file content (18 lines) | stat: -rw-r--r-- 823 bytes parent folder | download | duplicates (24)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
# futf

[![Build Status](https://travis-ci.org/servo/futf.svg?branch=master)](https://travis-ci.org/kmcallister/futf)

futf is a library for *flexible* UTF-8, or UTF-8 *fragments*. I don't know.
Check out the [API documentation](http://doc.servo.org/futf/index.html).

Anyway, it takes an index into a byte buffer and tells you things about the
UTF-8 codepoint containing that byte. It can deal with incomplete codepoint
prefixes / suffixes at the ends of a buffer, which is useful for incremental
I/O. It can also handle UTF-16 surrogate code units encoded in the manner of
[CESU-8][] or [WTF-8][].

This is a low-level helper for [tendril][] that might be useful more generally.

[CESU-8]: http://www.unicode.org/reports/tr26/
[WTF-8]: http://simonsapin.github.io/wtf-8/
[tendril]: https://github.com/kmcallister/tendril