File: dfa.texi

package info (click to toggle)
gnugo-dv 3.1.27-1
links: PTS
area: main
in suites: woody
size: 6,724 kB
ctags: 3,307
sloc: ansic: 46,051; perl: 1,303; sh: 710; makefile: 578; tcl: 401; lisp: 193
file content (450 lines) | stat: -rw-r--r-- 14,730 bytes
In this chapter, we describe the principles of the gnugo DFA
pattern matcher.  The aim of this system is to permit a fast
pattern matching when it becomes time critical like in owl
module (@ref{The Owl Code}).  The actual version is still
experimental but is expected to be fully integrated in later
versions of gnugo.  If you want to test it with version
@value{VERSION} you must run @command{configure --enable-dfa}
then recompile GNU Go (@ref{Using DFA}).  The basic principle
is to generate off line a finite state machine called a
Deterministic Finite State Automaton (@ref{What is a DFA})
from the pattern database and then use it at runtime to
speedup pattern matching (@ref{Pattern matching with DFA} and
@ref{Incremental Algorithm}).


@menu
* Using DFA::  How to use DFA's with gnugo.
* Scan Path::  The path used to scan the board.
* What is a DFA:: A recall of language theory.
* Pattern matching with DFA:: How to retrieve go patterns with a dfa ?
* Building the DFA:: Playing with explosives.
* Incremental Algorithm:: The joy of determinism.
* DFA Optimizations:: Some possible optimizations.
@end menu


@cindex pattern matching
@cindex fast pattern matching
@cindex pattern database
@cindex finite state automaton
@cindex automaton
@cindex dfa
@cindex dfa.h
@cindex dfa.c
@cindex matchpat.c
@cindex product


@node Using DFA, Scan Path, DFA, DFA
@comment  node-name,  next,  previous,  up
@subsection Using DFA

First build the program with
'configure --enable-dfa', then type 'make' as usual.

Some .db files will be compiled into DFA's by the program mkpat.
DFA are stored into C files and compiled with the engine.
When a DFA is found, gnugo write
"<pattern database name> --> using dfa" at startup and use the dfa
to "filter" patterns.
When no DFA is found, the standard pattern matcher is used.


@node Scan Path, What is a DFA, Using DFA, DFA
@comment  node-name,  next,  previous,  up
@subsection Scan Path

The board is scanned following a predefined path.
The default path is a spiral starting from 
the center of the pattern.
This path is used both to build the DFA and to scan the board.

@ifinfo
@example
  +---B--------------+
  | C 4 A . . . . . .|
  D 5 1 3 9 . . . . .|
  E 6 2 8 . . X . . .|
  | F 7 . . . . . . .|
  | . +-> . . . . . .|
  | . . . . . . . . .|
  | . O . . . X . . .|
  | . . . . . . . . .|
  | . . . . . . . . .|
  +------------------+
@end example
@end ifinfo
@ifnotinfo
@image{path}
@end ifnotinfo


This path is encoded by
two arrays of integers order_i[k] and order_j[k]
giving the offset where to read the values on the board.

Reading the board following a predefined path reduces
the two dimentional pattern matching to a linear text searching problem.
This pattern for example:

@example
?X?
.O?
?OO
@end example

@noindent
scanned following the path

@example
149
238
567

(i,j)->(i+1,j)->(i+1,j+1)->(i,j+1)->(i+2,j+0)->(i+2,j+1)->(i+2,j+2)...
@end example

@noindent
gives the string
@b{"?.OX?OO??"}
where @b{"?"} means @b{'don't care'}.
We can forget the two dimensional patterns for a time to
focus on linear patterns.

@node What is a DFA, Pattern matching with DFA, Scan Path, DFA
@comment  node-name,  next,  previous,  up
@subsection What is a DFA

The acronym DFA means Deterministic Finite state Automaton
(See @uref{http://www.eti.pg.gda.pl/~jandac/thesis/node12.html}
or @cite{Hopcroft & Ullman "Introduction to Language Theory"}
for more details).
DFA are common tools in compilers design 
(Read @cite{Aho, Ravi Sethi, Ullman "COMPILERS: Principles, Techniques and Tools"}
for a complete introduction), a lot of
powerfull text searching algorithm like @cite{Knuth-Morris-Pratt}
or @cite{Boyer-Moore} algorithms are based on DFA's
(See @uref{http://www-igm.univ-mlv.fr/~lecroq/string/} for a bibliography
of pattern matching algorithms).

Basically, 
a DFA is a set of @dfn{states} connected by labeled @dfn{transitions}.
The labels are the values read on the board,
in gnugo these values are EMPTY, WHITE, BLACK or OUT_BOARD, denoted
respectively by '.','O','X' and '#'.


The best way to represent a dfa is to draw its transition graph:
the pattern @b{"????..X"} is recognized by the following DFA:

@ifinfo
@example
@group
   .,X,O     .,X,O    .,X,O    .,X,O     .      .      X
[1]------>[2]----->[3]----->[4]----->[5]--->[6]--->[7]--->[8 OK!]
Start
@end group
@end example
@end ifinfo

@ifnotinfo
@image{dfa}
@end ifnotinfo


This means that
starting from state [1], if you read '.','X' or 'O' on the board, 
go to state [2] and so on until you reach state [5].
From state [5], if you read '.', go to state [6]
otherwise go to error state [0]. 
And so on until you reach state [8].
As soon as you reach state [8], 
you recognize Pattern @b{"????..X"}

Adding a pattern like @b{"XXo"} ('o' is a wildcard for not 'X')
will transform directly the automaton 
by synchronization product (@ref{Building the DFA}).
Consider the following DFA:

@ifinfo
@example
@group
Start .,O   .,X,O    .,O,X   .,X,O      .      .       X
[1]---->[2]----->[3]----->[4]------>[5]--->[6]---->[7]--->[8 OK!]
 |                ^        ^         ^
 |            .,O |        |         |
 |            ----         |         |
 |           |          X  |         |
 |           |          ---    .,X,O |
 |           |         |             |
 |     X     |   X     | O,.         |
  --------->[9]------>[A]--->[B OK!]-
@end group
@end example
@end ifinfo

@ifnotinfo
@image{dfa2}
@end ifnotinfo

By adding a special @dfn{error} state and completing each state
by a transition to error state when there is none, we transform
easily a DFA in a @dfn{Complete Deterministic Finite state
Automaton} (CDFA).  The synchronization product
(@ref{Building the DFA}) is only possible on CDFA's.


@ifinfo
@example
@group
Start .,O   .,X,O    .,O,X   .,X,O      .      .       X
[1]---->[2]----->[3]----->[4]------>[5]--->[6]---->[7]--->[8 OK!]
 |                ^        ^         ^      |       |      |
 |            .,O |        |         |      |       |      |
 |            ----         |         |      |       |      |
 |           |          X  |         |      |X,O    | .,O  |X,.,O
 |           |          ---    .,X,O |      |       |      |
 |           |         |             |      |       |      |
 |     X     |   X     | O,.         |     \ /     \ /    \ /
  --------->[9]------>[A]--->[B OK!]-      [0  Error state !]
@end group
@end example
@end ifinfo
@ifnotinfo
@image{cdfa}
@end ifnotinfo

The graph of a CDFA is coded by an array of states:
The 0 state is the "error" state and
the start state is 1.

@example
@group
----------------------------------------------------
 state  |   .    |   O    |   X    |   #    |  att
----------------------------------------------------
      1 |      2 |      2 |      9 |      0 |
      2 |      3 |      3 |      3 |      0 |
      3 |      4 |      4 |      4 |      0 |
      5 |      6 |      0 |      0 |      0 |
      6 |      7 |      0 |      0 |      0 |
      7 |      0 |      0 |      8 |      0 |
      8 |      0 |      0 |      0 |      0 | Found pattern "????..X"
      9 |      3 |      3 |      A |      0 |
      A |      B |      B |      4 |      0 |
      B |      5 |      5 |      5 |      0 | Found pattern "XXo"
----------------------------------------------------
@end group
@end example

To each state we associate an often empty
list of attributes which is the  
list of pattern indexes recognized when this state is reached.
In '@file{dfa.h}' this is basically represented by two stuctures:

@example
@group
@code{
/* dfa state */
typedef struct state
@{
  int next[4]; /* transitions for EMPTY, BLACK, WHITE and OUT_BOARD */
  attrib_t *att;
@}
state_t;

/* dfa */
typedef struct dfa
@{
  attrib_t *indexes; /* Array of pattern indexes */
  int maxIndexes;

  state_t *states; /* Array of states */
  int maxStates;
@}
dfa_t;}
@end group
@end example

@node Pattern matching with DFA, Building the DFA, What is a DFA, DFA
@comment  node-name,  next,  previous,  up
@subsection Pattern matching with DFA

Recognizing with a DFA is very simple
and thus very fast 
(See '@code{scan_for_pattern()}' in the '@file{engine/matchpat.c}' file).


Starting from the start state, we only need to read the board
following the spiral path, jump from states to states following
the transitions labelled by the values read on the board and
collect the patterns indexes on the way.  If we reach the error
state (zero), it means that no more patterns will be matched.
The worst case complexity of this algorithm is o(m) where m is
the size of the biggest pattern.

Here is an example of scan:

First we build a minimal dfa recognizing these patterns:
@b{"X..X"}, @b{"X???"}, @b{"X.OX"} and @b{"X?oX"}.
Note that wildcards like '?','o', or 'x' give multiple out-transitions.

@example
@group
----------------------------------------------------
 state  |   .    |   O    |   X    |   #    |  att
----------------------------------------------------
      1 |      0 |      0 |      2 |      0 |
      2 |      3 |     10 |     10 |      0 |
      3 |      4 |      7 |      9 |      0 |
      4 |      5 |      5 |      6 |      0 |
      5 |      0 |      0 |      0 |      0 |    2
      6 |      0 |      0 |      0 |      0 |    4    2    1
      7 |      5 |      5 |      8 |      0 |
      8 |      0 |      0 |      0 |      0 |    4    2    3
      9 |      5 |      5 |      5 |      0 |
     10 |     11 |     11 |      9 |      0 |
     11 |      5 |      5 |     12 |      0 |
     12 |      0 |      0 |      0 |      0 |    4    2
----------------------------------------------------
@end group
@end example

We perform the scan of the string 
@b{"X..XXO...."} starting from state 1:

Current state:     1, substring to scan :@b{ X..XXO....}

We read an 'X' value, so from state 1 we must go to
state 2.

Current state:     2, substring to scan : @b{..XXO....}

We read a '.' value, so from state 2 we must go to
state 3 and so on ...

@example
Current state:     3, substring to scan : .XXO....
Current state:     4, substring to scan : XXO....
Current state:     6, substring to scan : XO....
Found pattern 4
Found pattern 2
Found pattern 1                 
@end example

After reaching state 6 where we match patterns
1,2 and 4, there is no out-transitions so we stop the matching.
To keep the same match order as in the standard algorithm, 
the patterns indexes are collected in an array and sorted by indexes.

@node Building the DFA, Incremental Algorithm, Pattern matching with DFA, DFA
@comment  node-name,  next,  previous,  up
@subsection Building the DFA

The most flavouring point is the building of the minimal DFA 
recognizing a given set of patterns.
To perform the insertion of a new
pattern into an already existing DFA one must completly rebuild
the DFA: the principle is to build the minimal CDFA recognizing
the new pattern to replace the original CDFA with its
@dfn{synchronised product} by the new one.

We first give a formal definition:
Let @b{L} be the left CDFA and @b{R} be the right one.
Let @b{B} be the @dfn{synchronised product} of @b{L} by @b{R}.
Its states are the couples @b{(l,r)} where @b{l} is a state of @b{L} and 
@b{r} is a state of @b{R}.
The state @b{(0,0)} is the error state of @b{B} and the state
@b{(1,1)} is its initial state.
To each couple @b{(l,r)} we associate the union of patterns
recognized in both @b{l} and @b{r}.
The transitions set of @b{B} is the set of transitions
@b{(l1,r1)---a--->(l2,r2)} for
each symbol @b{'a'} such that both @b{l1---a--->l2} in @b{L} 
and @b{r1---a--->r2} in @b{R}.

The maximal number of states of @b{B} is the product of the
number of states of @b{L} and @b{R} but almost all this states
are non reachable from the initial state @b{(1,1)}.

The algorithm used in function '@code{sync_product()}' builds
the minimal product DFA only by keeping the reachable states.
It recursively scans the product CDFA by following simultaneously
the transitions of @b{L} and @b{R}. A hast table
(@code{gtest}) is used to check if a state @b{(l,r)} has
already been reached, the reachable states are remapped on
a new DFA. The CDFA thus obtained is minimal and recognizes the
union of the two patterns sets.

@ifnotinfo
For example these two CDFA's:


@image{sync-prod1}

Give by synchronization product the following one:


@image{sync-prod2}
@end ifnotinfo

It is possible to construct a special pattern database that
generates an "explosive" automaton: the size of the DFA is in
the worst case exponential in the number of patterns it
recognizes.  But it doesn't occur in pratical situations: 
the dfa size tends to be @dfn{stable}.  By @dfn{stable} we mean that if we
add a pattern which greatly increases the size of the dfa it
also increases the chance that the next added pattern does not
increase its size at all.  Nevertheless there are many ways to
reduce the size of the DFA. Good compression methods are
explained in @cite{Aho, Ravi Sethi, Ullman "COMPILERS:
Principles, Techniques and Tools" chapter Optimization of
DFA-based pattern matchers}.

@node Incremental Algorithm, DFA Optimizations, Building the DFA, DFA
@comment  node-name,  next,  previous,  up
@subsection Incremental Algorithm

The incremental version of the DFA pattern matcher is not yet implemented
in gnugo but we explain here how it will work.
By definition of a deterministic automaton, scanning the same
string will reach the same states every time.

Each reached state during pattern matching is stored in a stack
@code{top_stack[i][j]} and @code{state_stack[i][j][stack_idx]}
We use one stack by intersection @code{(i,j)}.  A precomputed reverse
path list allows to know for each couple of board intersections
@code{(x,y)} its position @code{reverse(x,y)} in the spiral scan path
starting from @code{(0,0)}.

When a new stone is put on the board at @code{(lx,ly)}, the only work
of the pattern matcher is:

@example
@group
@code{
 for(each stone on the board at (i,j))
    if(reverse(lx-i,ly-j) < top_stack[i][j])
      @{
         begin the dfa scan from the state
         state_stack[i][j][reverse(lx-i,ly-j)];
      @}
}
@end group
@end example

In most situations reverse(lx-i,ly-j) will be inferior to
top_stack[i][j]. This should speedup a lot pattern matching.

@node DFA Optimizations,  , Incremental Algorithm, DFA
@comment  node-name,  next,  previous,  up
@subsection Some DFA Optimizations

The dfa is constructed to minimize jumps in memory making some
assumptions about the frequencies of the values: the EMPTY
value is supposed to appear often on the board, so the the '.'
transition are almost always successors in memory.  The
OUT_BOARD are supposed to be rare, so '#' transitions will
almost always imply a big jump.