1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
|
==========================================
Sombok - Unicode Text Segmentation Package
==========================================
Sombok is Copyright (C) 2009-2014, by Hatuka*nezumi - IKEDA Soji.
It is free software; you can redistribute it and/or modify it under the
terms of either:
a) the GNU General Public License as published by the Free Software
Foundation; either version 1, or (at your option) any later version,
or
b) the "Artistic License".
----
See the COPYING and the ARTISTIC files for more details.
What is this
============
Sombok library package performs Line Breaking Algorithm described in
Unicode Standard Annex #14 (UAX #14). East_Asian_Width informative
properties defined by Annex #11 (UAX #11) may be concerned to determin
breaking positions. This package also implements "default" Grapheme
Cluster segmentation described in Annex #29 (UAX #29).
Getting Sombok
==============
You can get Sombok from:
https://github.com/hatukanezumi/sombok.git
Installing
==========
See INSTALL.
Name
====
"Sombok" (or "sambak") is Korean onomatopeic word to represent "cutting
cleanly". It is not connected to Khmer word "sombok" nor Afrikaans word
"sjambok".
Language bindings
=================
Perl
Unicode-LineBreak: http://search.cpan.org/dist/Unicode-LineBreak/
Python
pytextseg: http://pypi.python.org/pypi/pytextseg/
Author
======
Hatuka*nezumi - IKEDA Soji <hatuka(at)nezumi.nu>.
|