File: Analyzer.pm

package info (click to toggle)
libplucene-perl 1.24-1
  • links: PTS
  • area: main
  • in suites: etch, etch-m68k
  • size: 1,292 kB
  • ctags: 429
  • sloc: perl: 4,158; makefile: 52
file content (45 lines) | stat: -rw-r--r-- 864 bytes parent folder | download | duplicates (6)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
package Plucene::Analysis::Analyzer;

=head1 NAME 

Plucene::Analysis::Analyzer - base class for Analyzers

=head1 SYNOPSIS

	my $analyzer = Plucene::Analysis::Analyzer::Subclass->new;

=head1 DESCRIPTION

This is an abstract base class of Analyzers.

An Analyzer builds TokenStreams, which analyze text. It thus represents 
a policy for extracting index terms from text.

Typical implementations first build a Tokenizer, which breaks the stream 
of characters from the Reader into raw Tokens. One or more TokenFilters 
may then be applied to the output of the Tokenizer.

=head1 METHODS

=cut

use strict;
use warnings;

=head2 new

	my $analyzer = Plucene::Analysis::Analyzer::Subclass->new;

=cut

sub new { bless {}, shift }

=head2 tokenstream

This must be defined in a subclass

=cut

sub tokenstream { die "tokenstream must be defined in a subclass" }

1;