File: wordnet-sense-index.README.debian

package info (click to toggle)
wordnet 1:3.0-36
  • links: PTS, VCS
  • area: main
  • in suites: bullseye, sid
  • size: 25,360 kB
  • sloc: sh: 10,763; ansic: 5,881; yacc: 758; ruby: 634; lex: 417; python: 317; makefile: 137
file content (30 lines) | stat: -rw-r--r-- 1,449 bytes parent folder | download | duplicates (6)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
WordNet sense index for DEBIAN
------------------------------

Quoting one of the upstream authors of WordNet:

-----------------------------------------------------------------------------------
Date: Thu, 8 May 2008 11:29:30 -0400 (EDT)
From: Ben Haskell <ben@clarity.princeton.edu>

...
It shouldn't change the output files' offsets. Though I'm not sure I'd worry about this
on the Debian version, since you have other patches that do. I forget whether I ever
circled back to you (Andreas) on this before I left WordNet, but is there some kind of
disclaimer in the Debian version that points out this problem?

The short version: the file offsets, which are the easiest-to-use ID's in WordNet,
change anytime there's a change in the lexicographer files. The synset keys, which are
harder to compute, don't change nearly as haphazardly. But, since they're harder to
compute, no one uses them. So, anyone who wants to use WordNet for research purposes
will undoubtedly come across groups that use the file-offset style of ID's, which,
since you incorporate Debian-specific patches, will be wrong for the Debian package of
WordNet.
------------------------------------------------------------------------------------

So it might or might not be that index.sense does not work like expected.
If a user of this package notices any problem please report this via the
reportbug tool!


Andreas Tille <tille@debian.org>  Thu, 08 May 2008 10:20:36 +0200