File: ChangeLog

package info (click to toggle)
uni2ascii 4.20-1
  • links: PTS, VCS
  • area: main
  • in suites: sid, trixie
  • size: 992 kB
  • sloc: ansic: 8,730; sh: 4,471; tcl: 1,914; python: 53; makefile: 42
file content (543 lines) | stat: -rw-r--r-- 16,761 bytes parent folder | download
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
516
517
518
519
520
521
522
523
524
525
526
527
528
529
530
531
532
533
534
535
536
537
538
539
540
541
542
543
2022-08-21  Bill Poser  <poser@khunekyaz>

	* uni2ascii.c:
	Fixed bug in which in certain formats input from stdin worked
	input directly from a file caused a hang.

2013-08-18  Bill Poser  <poser@khunekyaz>

	* uni2ascii.c (ExpandToAscii): 
	Added expansion of trademark sign U+2122 -> (tm)

2011-08-24  Bill Poser  <poser@khunekcho>

	* ascii2uni.c (main): 
	Fixed bug in handling of L (WWW) format. \uXXXX is now handled correctly
	even if immediately followed by a possible hex digit. \U
	followed by fewer than 8 hex digits is not treated as an escape.
	A warning is issued if the \U format encodes a character within
	the BMP.
	

2011-07-01  Bill Poser  <poser@khunekcho>

	* uni2ascii.c (ShowUsage): 
	Added -A flag to usage message.

2011-06-30  Bill Poser  <poser@khunekcho>

	* uni2ascii.c (AscifyEquiv): 
	Added conversion of U+2032 prime to single quote.

2011-05-14  Bill Poser  <poser@khunekcho>
	Release of version 4.18.
	* ascii2uni.c (main): 
	Applied patch supplied by Bartosz Kuzma to handle the situation
	in NetBSD which lacks getline.

	* uni2ascii.c (AscifyStyle): 
	Fixed bug in which a couple of ascii characters were
	included in the list of characters replaced by them.
	This did not affect the output but threw off the substitution
	count.

2011-04-10  Bill Poser  <poser@khunekcho>

	* uni2ascii.c: 
	Clarified semantics of pure option as converting characters
	in ascii range other than space and newline.
	
	Fixed bug in which this was not implemented correctly for UTF8 types.

2011-02-04  Bill Poser  <poser@khunekcho.khunek>
	Release of version 4.17
	* uni2ascii.c (AscifyEquiv): 
	Now convert U+00B7 middle dot to period.
	Now convert U+0085 next line and U+2028 line separator to newline.

2011-01-20  Bill Poser  <poser@khunekcho.khunek>

	* uni2ascii.c (AscifyEquiv): 
	Now convert U+2022 bullet to 'o'.

	
2010-12-12  Bill Poser  <poser@khunekcho.khunek>
	Release of version 4.16.
	* uni2ascii.c (main): 
	in -a option now translate FType of CHENT to HTMLX to resolve problem of compatibility
	between the two directions.

	* SetFormat.c (SetFormat): 
	Q now sets FType to CHENT rather than HTMLX.

	
	* uni2ascii.c (AscifyEquiv): 
	Added U+2033 to characters converted to nearest ascii equivalent by B option.

2010-08-29  Bill Poser  <poser@khunekcho.khunek>

	* ascii2uni.c: 
	Removed getline.c dependent on GNU_SOURCE since getline is standard as of POSIX2008.

	* uni2ascii.c: 
	Renamed endian.h u2a_endian.h to avoid conflict with external endian.h.

2010-04-12  Bill Poser  <poser@khunekcho>

	* ascii2uni.c (main): 
	Changed Lineno from int to unsigned long and resolved half-a-dozen
	incompatibilities between its type and conversion specs.

2009-08-04  Bill Poser  <poser@bill-laptop>
	Release of version 4.14.
	
	* uni2ascii.c SetFormat.c:
	Now set escape type to hexadecimal HTML numeric reference in Q option
	rather than incorrect CHENT.

2009-05-11  Bill Poser  <poser@khunekcho.khunek.poser.org>

	* uni2ascii.c: 
	Fixed bug in which ascification of U+2502 and U+2503 added double quote to output.

2009-04-22  Bill Poser  <poser@khunekcho.khunek.poser.org>

	* ascii2uni.c:
	Fixed a bug in which -a S option generated a "Converted so many chars" line
	for each character due to leaving in debugging code.
	
2009-04-21  Bill Poser  <poser@khunekcho.khunek.poser.org>
	Release of version 4.13
	
	* uni2ascii.c: 
	Fixed bug that caused excessive number of characters changed to ASCII to be
	reported.

2009-03-25  Bill Poser  <poser@khunekcho.khunek.poser.org>
	Release of version 4.12

	* uni2ascii.c ascii2uni.c: 
	The input file may now be specified on the command line without redirection.

2008-10-02  Bill Poser  <poser@khunekcho.khunek.poser.org>
	Release of version 4.11.
	
	* ascii2uni.c uni2ascii.c SetFormat.c formats.h:
	Added support for <XX><XX> format.
	
2008-09-27  Bill Poser  <poser@khunekcho.khunek.poser.org>

	* ascii2uni.c uni2ascii.c SetFormat.c formats.h:
	Added support for %uXXXX format.
	
2008-08-30  Bill Poser  <poser@khunekcho.khunek.poser.org>

	* SetFormat.c (SetFormat): 
	Added setting of *Ftype to CHENT in Y case. This fixes bug in which Y was a no-op.

	* ascii2uni.1:
	Added documentation for Y argument to -a flag.
	Corrected documentation for Q argument to -a flag.
	
	* SetFormat.c: 
	Modified ListFormatArguments to give slightly different explanations for Q
	according to the direction of the conversion.

	* uni2ascii.c (main): 
	Made Y argument to -a flag illegal.

2008-07-30  Bill Poser  <poser@khunekcho.khunek.poser.org>

	* ascii2uni.c uni2ascii.c: 
	Switched to more informative ShowVersion and
	separated out Copyright().

2008-06-03  Bill Poser  <poser@khunekcho.khunek.poser.org>

	* uni2ascii.c:
	Shifted to new, more detailed action summary. Due to
	incompatibility with u2a, which I am no longer updating,
	the old action summary can be chosen at configuration time
	by using the option --disable-newsummary.

2008-05-06  Bill Poser  <poser@khunekcho.khunek.poser.org>	
	Release of version 4.9
	
2008-05-04  Bill Poser  <poser@khunekcho.khunek.poser.org>
	* ascii2uni.c: 
	Fixed bug that produced bad output or segfault if a line
	ended in the prefix to an escape, e.g. just an =-sign
	in quoted-printable format.

	In quoted-printable format if a line ends in an =-sign,
	both the equal sign and the following newline are skipped
	in accordance with RFC2045.
	
2008-05-04  Bill Poser  <poser@khunekcho.khunek.poser.org>
	
	* ascii2uni.c: 

	Release of version 4.8
	
	Restored correct version of ascii2uni.c. Version 4.7 was
	inadvertently released with the 4.5 version of ascii2uni
	with a constant set much too low for testing purposes.
	Release 4.8 has ascii2uni.c with getline as in 4.6,
	with the unnecessary strlen call eliminated.

2008-04-26  Bill Poser  <poser@khunekcho.khunek.poser.org>

	* uni2ascii.c (main): 
	Added option -P which passes through Unicode rather than converting
	to an escape if it is not transformed. This is intended to allow
	options like diacritic stripping to be used followed by another program.
	
	Added option -B which transforms to ASCII if possible. It is equivalent
	to the combination defxy.

	Added expansion of ligatures U+FB00,U+FB01,U+FB02,U+FB03,U+FB04, U+FB06 to -x option.

	Added conversion of U+00AF, U+2215, U+2216, U+2223 to -e option.

2008-04-02  Bill Poser  <poser@khunekcho.khunek.poser.org>
	* ascii2uni.c:
	Eliminated unnecessary strlen call now that we're using getline.
	
2008-04-02  Bill Poser  <poser@khunekcho.khunek.poser.org>
	Release of version 4.6.
	
	* ascii2uni.c: 
	Replaced fgets call with GNU getline so as to handle arbitrarily
	long input lines.

2008-03-25  Bill Poser  <poser@khunekcho.khunek.poser.org>

	* uni2ascii.c (main): 
	Added support for OOXML format.
	Fixed bug affecting BYTEQ, BYTED, and BYTEH formats.

	* ascii2uni.c:
	Added support for OOXML format.
	
2008-03-20  Bill Poser  <poser@khunekcho.khunek.poser.org>

	* ascii2uni.c:
	Release of version 4.5
	Error messages now include the line number.

	Microsoft-style HTML entities lacking final semi-colon are now
	passed on by default rather than converted. The new -m flag
	causes them to be converted.

2008-03-19  Bill Poser  <poser@khunekcho.khunek.poser.org>

	* ascii2uni.c (main): 
	Initialized FType to STDX, fixing bug arising when user fails
	to specify type on command line.

2008-03-10  Bill Poser  <poser@khunekcho.khunek.poser.org>

	* ascii2uni.c: 
	Corrected FSF address in header as per bug report by Kartik Mistry.
	Made minor improvements in man pages as per patch by Kartik Mistry.

2008-01-14  Bill Poser  <poser@khunekcho.khunek.poser.org>

	* uni2ascii.c: 
	Added -y option for single-character ascii approximations
	as per patch provided by Jesse Peterson (jesse.peterson@exbiblio.com)
	Changed license to GPL v.3

2007-08-07  Bill Poser  <poser@khunekcho.khunek.poser.org>

	* ascii2uni.c (main): 
	Version 4.3.2
	Fixed bug that deleted blank lines in certain cases.
	Removed obsolete -8 flag from usage message.

2007-03-11  Bill Poser  <poser@khunekcho.khunek.poser.org>

	* uni2ascii.c (main): 
	Added ability to define custom substitutions and deletions.

	Added 0x2500-0x2503 to -e flag.

2007-03-02  Bill Poser  <poser@khunekcho.khunek.poser.org>
	Released version 4.2
	
	* SetFormat.c: 
	Added some format names. 
	Added missing pattern matches for examples of I, J, and K formats.
	Added return of error code for unrecognized format spec, with tests
	   in both uni2ascii and ascii2uni.

2007-03-01  Bill Poser  <poser@khunekcho.khunek.poser.org>
	Released version 4.1.1
	
	* ascii2uni.c uni2ascii.c:
	Removed inadertently introduced direct calls to gettext.
	These will prevent compilation on systems without gettext.

2007-02-26  Bill Poser  <poser@khunekcho.khunek.poser.org>
	Released version 4.1
	
	* TestSuiteAscii2Uni:
	Added entries for missing formats.

	* ascii2uni.c: 
	Removed leftover format spec info from ShowUsage();
	Fixed bugs bugs that screwed up several conversions.

2007-02-22  Bill Poser  <poser@khunekcho.khunek.poser.org>

	* u2a.tcl (ExplainExpansions): 
	Added information about new expansions.

	* uni2ascii.c: 
	Corrected typos in usage info.
	Moved list of expansions from general info to its own info flag.
	Added expansions proposed and implemented by Cedric Luthi.

2007-02-20  Bill Poser  <poser@localhost>

	* u2a.tcl: 
	Modified to handle new method of specifying formats.

2007-02-14  Bill Poser  <poser@khunekcho.khunek.poser.org>

	* ascii2uni.c uni2ascii.c:
	Fixed bug introduced in 3.10 in which an HTML numeric character reference
	lacking the final semi-colon led to the program not termiminating.

	Added check to Z flag argument to make sure that user supplied formats
	do not contain more than one conversion specification.

	Replaced the numerous format options with arguments to the
	-a flag.

2007-02-13  Bill Poser  <poser@khunekcho.khunek.poser.org>

	* uni2ascii.c (main): 
	Replaced the numerous format options with arguments to the
	-a flag.

2007-02-10  Bill Poser  <poser@khunekcho.khunek.poser.org>

	* uni2ascii.c ascii2uni.c:
	Adds support for hexadecimal numbers with prefix "16#" as in Postscript.
	Adds support for hexadecimal numbers with prefix "#16r" as in Common Lisp.
	Adds support for hexadecimal numbers with prefix "16#" and suffix "#" as in ADA.
	
	* u2a.tcl:
	Added the above.
	Improved look a little, I hope.

2007-01-09  Bill Poser  <poser@khunekcho.khunek.poser.org>

	* ascii2uni.c (main): 
	Added support for the format consisting of a decimal integer prefixed by "v"
	as used in Perl.

2006-12-20  Bill Poser  <poser@khunekcho.khunek.poser.org>

	* uni2ascii.c: 
	Added 38 missing characters to AscifyDiacritics and moved one that
	was mapped to the wrong ASCII value.

2006-12-02  Bill Poser  <poser@khunekcho.khunek.poser.org>

	* ascii2uni.c (main): 
	Added G to getopt string. Somehow it got left out, causing
	option to be treated as erroneous.

	Fixed error messages for missing arguments to command line options.

	Fixed bug in -J format.

2006-07-31  Bill Poser  <poser@khunekyaz.khunek.poser.org>

	* ascii2uni.c (main): 
	Now suppress info about use of Unicode Replacement Character if none
	were emitted.

	Now print info about individual ill-formed HTML entities missing final semi-colon.

2006-07-05  Bill Poser  <poser@khunekyaz.khunek.poser.org>

	* uni2ascii.c (main): 
	Fixed bug in which a space was not added after spaces or newlines
	when AddWhitespaceP was set in non-UTF-8 formats.

2006-07-03  Bill Poser  <poser@khunekyaz.khunek.poser.org>

	* Get_UTF32_From_UTF8.c: 
	Incorporated a patch by Dylan Thurston that allows
	correct handling of a read interrupted in the middle of
	a UTF-8 sequence.

	* Release 3.9.4.

2006-06-06  Bill Poser  <poser@S010600023f310f85.ca.shawcable.net>
	
	* ascii2uni.1 uni2ascii.1:
	Corrected incorrect references to uni2ascii in ascii2uni.1.
	Added mention of default format.
	This fixes Debian Bug#367546.

2006-05-11  Bill Poser  <poser@S010600e0185aee1a.ca.shawcable.net>

	* uni2ascii.c (AscifyStyle):
	Fixed bugs in which -f option changed "9" to "y" and "Z" to "a"
	per Debian bug report http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=365962

	Now convert superscript and subscript digits and plus sign to ASCII equivalents.

2006-04-30  Bill Poser  <poser@S010600e0185aee1a.ca.shawcable.net>

	* u2a.tcl, uni2ascii.c
	Fixed a bug that reversed the value of the switch for converting
	ASCII characters in going from Unicode to ASCII.

	Fixed miscellaneous bugs in the reporting of the number of characters
	converted, replaced, etc. Uni2ascii now reports the total number
	of characters processed and the number actually converted.

2006-04-28  Bill Poser  <poser@S010600023f310f85.ca.shawcable.net>

	* uni2ascii.c: 
	Corrected handling of small caps. Added those that were missing
	and changed some from converting to plain upper case to converting
	to plain lower case as per Unicode classification.

	Added expansion of plain single and double arrows.

	Added replacement of union with U.

2006-04-21  Bill Poser  <poser@S010600023f310f85.ca.shawcable.net>

	* uni2ascii.c: 
	Added expansion of U+2026 to ... under -x per user request.

2006-04-17  Bill Poser  <poser@S010600023f310f85.ca.shawcable.net>
	
	* uni2ascii.c: 
	Added detection of bad option argument in getopt.
	
	* ascii2uni.c: 
	Added the three POSIX charmap formats - octal, decimal, and hex escapes for the
	three low bytes of a UTF-32 character in big-endian order.

	Added detection of bad option argument in getopt.
	
2006-04-16  Bill Poser  <poser@S010600e0185aee1a.ca.shawcable.net>

	* uni2ascii.c: 
	Added the three POSIX charmap formats - octal, decimal, and hex escapes for the
	three low bytes of a UTF-32 character in big-endian order.

2006-02-24  Bill Poser  <poser@S010600023f310f85.ca.shawcable.net>

	* ascii2uni.c (main): 
	Added detection of HTML character entities and numeric character references
	lacking the final semi-colon. These are converted but a warning message
	is printed.


	* ascii2uni.c (main): 
	Q mode now works in pure mode.

	* ascii2uni.1 uni2ascii.1:
	Corrected examples missing final semi-colon.

2006-01-22  Bill Poser  <poser@S010600023f310f85.ca.shawcable.net>

	* uni2ascii.c: 
	Added support for SGML numeric character references.

2006-01-14  Bill Poser  <poser@S010600e0185aee1a.ca.shawcable.net>

	* ascii2uni.c:
	Added RTF format (\uN with N decimal).

	* uni2ascii.c: 
	Fixed bug that prevented -G option from working.
	Added RTF format (\uN with N decimal).
	Added option of expanding some characters to sequences, e.g. ts-digraph to ts.	

2006-01-12  Bill Poser  <poser@S010600e0185aee1a.ca.shawcable.net>

	* uni2ascii.c:
	Extended the ascii replacement options to the UTF-8 formats.

2006-01-11  Bill Poser  <poser@S010600e0185aee1a.ca.shawcable.net>

	* uni2ascii.c:
	Added options for replacing Unicode characters with ascii equivalents
	rather than a textual representation, e.g. various dashes with hyphen,
	e-acute with e, boldface with plain.

2005-12-12  Bill Poser  <poser@S010600e0185aee1a.ca.shawcable.net>

	* Added GUI U2A.tcl.

2005-12-06  Bill Poser  <poser@S010600e0185aee1a.ca.shawcable.net>

	* uni2ascii.c: 
	The option is now available of converting Unicode to 
	HTML character entities if one exists.

	Fixed bug in which -q option was not recognized.

2005-12-05  Bill Poser  <poser@S010600e0185aee1a.ca.shawcable.net>

	* uni2ascii.c ascii2uni: 
	Added support for format X'00E9'.

2005-09-27  Bill Poser  <poser@khunekcho.khunek.poser.org>

	* uni2ascii.c ascii2uni.c: 
	Added support for octal-escaped UTF-8.

2005-09-21  Bill Poser  <poser@khunekcho.khunek.poser.org>

	* ascii2uni.c uni2ascii.c: 
	Added the two UTF-8 formats =XX and %XX.
	Updated man pages to reflect this.

2005-09-15  Bill Poser  <poser@khunekcho.khunek.poser.org>

	* ascii2uni.c: 
	Added option of converting all three HTML escapes,
	including character entities, for which support was
	added.

2005-09-10  Bill Poser  <poser@khunekcho.khunek.poser.org>

	* Added to both programs the -Z command line flag
	which allows the user to set the conversion format
	directly.

2005-09-09  Bill Poser  <poser@khunekcho.khunek.poser.org>

	* uni2ascii.c (main): 
	Initialized FType to STDX to prevent segfault when called
	with no arguments.

	*uni2ascii.c:
	Added -q flag to suppress chat. 
	
	*uni2ascii.c and ascii2uni.c:
	Added formats <U00E9>, U00E9, u00E9, U+00E9

2005-09-08  Bill Poser  <poser@khunekcho.khunek.poser.org>
	* uni2ascii.c (main)
	Added formats \x00E9, =x{00E9}, and Tcl \u~\U.
	* ascii2uni.c (main)
	Added program that does the inverse mapping.
	
2005-09-04  Bill Poser  <poser@khunekcho.khunek.poser.org>

	* uni2ascii.c (main): 
	Added -B flag to generate backslash-escaped hex, e.g. \x0561.