File: automaton_get_stats.rst

package info (click to toggle)
python-pyahocorasick 1.4.1-2
  • links: PTS, VCS
  • area: main
  • in suites: sid, trixie
  • size: 748 kB
  • sloc: ansic: 4,554; python: 2,823; sh: 312; makefile: 242
file content (28 lines) | stat: -rw-r--r-- 910 bytes parent folder | download | duplicates (2)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
get_stats() -> dict
----------------------------------------------------------------------

Return a dictionary containing Automaton statistics.

- *nodes_count*  - total number of nodes
- *words_count*  - number of distinct words (same as ``len(automaton)``)
- *longest_word* - length of the longest word
- *links_count*  - number of edges
- *sizeof_node*  - size of single node in bytes
- *total_size*   - total size of trie in bytes (about
  nodes_count * size_of node + links_count * size of pointer).

Examples
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

.. code:: python

    >>> import ahocorasick
    >>> A = ahocorasick.Automaton()
    >>> A.add_word("he", None)
    True
    >>> A.add_word("her", None)
    True
    >>> A.add_word("hers", None)
    True
    >>> A.get_stats()
    {'nodes_count': 5, 'words_count': 3, 'longest_word': 4, 'links_count': 4, 'sizeof_node': 40, 'total_size': 232}