File: reuse.re

package info (click to toggle)
re2c 4.4-1
  • links: PTS, VCS
  • area: main
  • in suites: forky, sid
  • size: 51,512 kB
  • sloc: cpp: 34,160; ml: 8,494; sh: 5,311; makefile: 1,014; haskell: 611; python: 431; ansic: 234; javascript: 113
file content (30 lines) | stat: -rw-r--r-- 715 bytes parent folder | download | duplicates (2)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
# re2py $INPUT -o $OUTPUT --input-encoding utf8

# This example supports multiple input encodings: UTF-8 and UTF-32.
# Both lexers are generated from the same rules block, and the use
# blocks add only encoding-specific configurations.
%{rules
    re2c:yyfill:enable = 0;
    re2c:indent:top = 1;

    "∀x ∃y" { return yycursor }
    *       { return None }
%}

def lex_utf8(yyinput):
    yycursor = 0
    %{use
        re2c:encoding:utf8 = 1;
    %}

def lex_utf32(yyinput):
    yycursor = 0
    %{use
        re2c:encoding:utf32 = 1;
    %}

s8 = [0xe2, 0x88, 0x80, 0x78, 0x20, 0xe2, 0x88, 0x83, 0x79]
assert lex_utf8(s8) == len(s8)

s32 = [0x2200, 0x78, 0x20, 0x2203, 0x79]
assert lex_utf32(s32) == len(s32)