File: README

package info (click to toggle)
libtext-undiacritic-perl 0.07-2
  • links: PTS, VCS
  • area: main
  • in suites: bookworm, forky, sid, trixie
  • size: 148 kB
  • sloc: perl: 284; makefile: 2
file content (56 lines) | stat: -rw-r--r-- 1,482 bytes parent folder | download
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
NAME
    Text::Undiacritic - remove diacritics from a string

VERSION
    This document describes Text::Undiacritic 0.01

SYNOPSIS
        use Text::Undiacritic qw(undiacritic);
        $ascii_string = undiacritic( $czech_string );

DESCRIPTION
    Changes characters with diacritics into their base characters.

    Also changes into base character in cases where UNICODE does not provide
    a decomposition.

    E.g. all characters '... WITH STROKE' like 'LATIN SMALL LETTER L WITH
    STROKE' do not have a decomposition. In the latter case the result will
    be 'LATIN SMALL LETTER L'.

    Removing diacritics is useful for matching text independent of spelling
    variants.

SUBROUTINES/METHODS
  undiacritic
        $ascii_string = undiacritic( $characters );

    Removes diacritics from $characters and returns a simplified character
    string.

    The input string must be in character modus, i.e. UNICODE code points.

DIAGNOSTICS
CONFIGURATION AND ENVIRONMENT
DEPENDENCIES
    *   version

    *   charnames

    *   Unicode::Normalize

INCOMPATIBILITIES
BUGS AND LIMITATIONS
    There is no experience if this module gives useful results for scripts
    other than Latin.

AUTHOR
    Helmut Wollmersdorfer "<WOLLMERS@cpan.org>"

LICENSE AND COPYRIGHT
    Copyright (c) 2007, Helmut Wollmersdorfer "<WOLLMERS@cpan.org>". All
    rights reserved.

    This module is free software; you can redistribute it and/or modify it
    under the same terms as Perl itself.