File: nltk-migration.rst

package info (click to toggle)
python-wn 1.0.0-3
  • links: PTS, VCS
  • area: main
  • in suites: forky, sid
  • size: 1,100 kB
  • sloc: python: 8,429; xml: 566; sql: 238; makefile: 12
file content (111 lines) | stat: -rw-r--r-- 4,370 bytes parent folder | download
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
Migrating from the NLTK
=======================

This guide is for users of the `NLTK <https://www.nltk.org/>`_\ 's
``nltk.corpus.wordnet`` module who are migrating to Wn. It is not
guaranteed that Wn will produce the same results as the NLTK's module,
but with some care its behavior can be very similar.

Overview
--------

One important thing to note is that Wn will search all wordnets in the
database by default where the NLTK would only search the English.

>>> from nltk.corpus import wordnet as nltk_wn
>>> nltk_wn.synsets('chat')                 # only English
>>> nltk_wn.synsets('chat', lang='fra')     # only French
>>> import wn
>>> wn.synsets('chat')                      # all wordnets
>>> wn.synsets('chat', lang='fr')           # only French

With Wn it helps to create a :class:`wn.Wordnet` object to pre-filter
the results by language or lexicon.

>>> en = wn.Wordnet('omw-en:1.4')
>>> en.synsets('chat')                     # only the OMW English Wordnet

Equivalent Operations
---------------------

The following table lists equivalent API calls for the NLTK's wordnet
module and Wn assuming the respective modules have been instantiated
(in separate Python sessions) as follows:

NLTK:

>>> from nltk.corpus import wordnet as wn
>>> ss = wn.synsets("chat", pos="v")[0]

Wn:

>>> import wn
>>> en = wn.Wordnet('omw-en:1.4')
>>> ss = en.synsets("chat", pos="v")[0]

.. default-role:: python

Primary Queries
'''''''''''''''

=========================================  ===============================================
NLTK                                       Wn
=========================================  ===============================================
`wn.langs()`                               `[lex.language for lex in wn.lexicons()]`
`wn.lemmas("chat")`                        --
--                                         `en.words("chat")`
--                                         `en.senses("chat")`
`wn.synsets("chat")`                       `en.synsets("chat")`
`wn.synsets("chat", pos="v")`              `en.synsets("chat", pos="v")`
`wn.all_synsets()`                         `en.synsets()`
`wn.all_synsets(pos="v")`                  `en.synsets(pos="v")`
=========================================  ===============================================

Synsets -- Basic
''''''''''''''''

===================  =================
NLTK                 Wn
===================  =================
`ss.lemmas()`        --
--                   `ss.senses()`
--                   `ss.words()`
`ss.lemmas_names()`  `ss.lemmas()`
`ss.definition()`    `ss.definition()`
`ss.examples()`      `ss.examples()`
`ss.pos()`           `ss.pos`
===================  =================

Synsets -- Relations
''''''''''''''''''''

==========================================  =====================================
NLTK                                        Wn
==========================================  =====================================
`ss.hypernyms()`                            `ss.get_related("hypernym")`
`ss.instance_hypernyms()`                   `ss.get_related("instance_hypernym")`
`ss.hypernyms() + ss.instance_hypernyms()`  `ss.hypernyms()`
`ss.hyponyms()`                             `ss.get_related("hyponym")`
`ss.member_holonyms()`                      `ss.get_related("holo_member")`
`ss.member_meronyms()`                      `ss.get_related("mero_member")`
`ss.closure(lambda x: x.hypernyms())`       `ss.closure("hypernym")`
==========================================  =====================================

Synsets -- Taxonomic Structure
''''''''''''''''''''''''''''''

================================  =========================================================
NLTK                              Wn
================================  =========================================================
`ss.min_depth()`                  `ss.min_depth()`
`ss.max_depth()`                  `ss.max_depth()`
`ss.hypernym_paths()`             `[list(reversed([ss] + p)) for p in ss.hypernym_paths()]`
`ss.common_hypernyms(ss)`         `ss.common_hypernyms(ss)`
`ss.lowest_common_hypernyms(ss)`  `ss.lowest_common_hypernyms(ss)`
`ss.shortest_path_distance(ss)`   `len(ss.shortest_path(ss))`
================================  =========================================================

.. reset default role
.. default-role::

(these tables are incomplete)