File: ToDo

package info (click to toggle)
similarity-tester 2.77-1
  • links: PTS
  • area: main
  • in suites: jessie, jessie-kfreebsd
  • size: 572 kB
  • ctags: 512
  • sloc: ansic: 3,197; lex: 1,562; makefile: 301
file content (49 lines) | stat: -rw-r--r-- 1,462 bytes parent folder | download
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
- in hash.c, some size_t -> uint64_t? for a 64-bit hash code?

- min_run_string, thresh.. = clean up sim.[ch]

- get rid of the nl_buff mechanism (2012!)

- report runs as '... matching text only ...' (proper name for Retrieve_Runs())

? plug memory leaks (and still report memory usage with -M!)

- unify idf2token() in *lang.l

- db_ not protected by #ifdef but by compilation to void ????

Done ================================================================

+ register - removed

+ Run hashing OK: average chain length = 1.5, for sim-ing the sources of MCD2
+ Idf hashing OK: smooth distribution when sim-ing the sources of MCD2

+ use two-byte tokens to obtain better resolution for sim_text and on -F option
  and UTF-8 (Johnson, Benjamin (US - Chicago))

+ different defaults per program

+ cleaning up sim.c & names

+ Microsoft comment (// ... unescaped \n)

+ emails 2009-2011 (A = I answered, R= they replied)
+AR	Marcus Brinkmann, separate letters
+AR	Scott Kuhl, percentages
+AR	Yaroslav Halchenko, identifying non-existent lines
+A	Rumen Stefanov, UTF-8
+A	Jonathan Martin, UTF-8
+AR	UTF-8 (Johnson, Benjamin (US - Chicago))

+ better structure between X.h and X.c

+ clean-up language.h and its sub-class algollike.h

+ warning in README to correct for non-MSDOS

Rejected ================================================================

X remove Miranda
X Mon Apr 11 13:23:41 1994: sim_orca
X Thu May 13 23:02:46 1993: sim ook voor C++ en Ada