File: ChangeLog

package info (click to toggle)
msort 8.53-2.2
  • links: PTS
  • area: main
  • in suites: bullseye, buster, sid
  • size: 2,360 kB
  • sloc: sh: 10,138; ansic: 10,031; makefile: 51
file content (1206 lines) | stat: -rw-r--r-- 39,783 bytes parent folder | download | duplicates (3)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
516
517
518
519
520
521
522
523
524
525
526
527
528
529
530
531
532
533
534
535
536
537
538
539
540
541
542
543
544
545
546
547
548
549
550
551
552
553
554
555
556
557
558
559
560
561
562
563
564
565
566
567
568
569
570
571
572
573
574
575
576
577
578
579
580
581
582
583
584
585
586
587
588
589
590
591
592
593
594
595
596
597
598
599
600
601
602
603
604
605
606
607
608
609
610
611
612
613
614
615
616
617
618
619
620
621
622
623
624
625
626
627
628
629
630
631
632
633
634
635
636
637
638
639
640
641
642
643
644
645
646
647
648
649
650
651
652
653
654
655
656
657
658
659
660
661
662
663
664
665
666
667
668
669
670
671
672
673
674
675
676
677
678
679
680
681
682
683
684
685
686
687
688
689
690
691
692
693
694
695
696
697
698
699
700
701
702
703
704
705
706
707
708
709
710
711
712
713
714
715
716
717
718
719
720
721
722
723
724
725
726
727
728
729
730
731
732
733
734
735
736
737
738
739
740
741
742
743
744
745
746
747
748
749
750
751
752
753
754
755
756
757
758
759
760
761
762
763
764
765
766
767
768
769
770
771
772
773
774
775
776
777
778
779
780
781
782
783
784
785
786
787
788
789
790
791
792
793
794
795
796
797
798
799
800
801
802
803
804
805
806
807
808
809
810
811
812
813
814
815
816
817
818
819
820
821
822
823
824
825
826
827
828
829
830
831
832
833
834
835
836
837
838
839
840
841
842
843
844
845
846
847
848
849
850
851
852
853
854
855
856
857
858
859
860
861
862
863
864
865
866
867
868
869
870
871
872
873
874
875
876
877
878
879
880
881
882
883
884
885
886
887
888
889
890
891
892
893
894
895
896
897
898
899
900
901
902
903
904
905
906
907
908
909
910
911
912
913
914
915
916
917
918
919
920
921
922
923
924
925
926
927
928
929
930
931
932
933
934
935
936
937
938
939
940
941
942
943
944
945
946
947
948
949
950
951
952
953
954
955
956
957
958
959
960
961
962
963
964
965
966
967
968
969
970
971
972
973
974
975
976
977
978
979
980
981
982
983
984
985
986
987
988
989
990
991
992
993
994
995
996
997
998
999
1000
1001
1002
1003
1004
1005
1006
1007
1008
1009
1010
1011
1012
1013
1014
1015
1016
1017
1018
1019
1020
1021
1022
1023
1024
1025
1026
1027
1028
1029
1030
1031
1032
1033
1034
1035
1036
1037
1038
1039
1040
1041
1042
1043
1044
1045
1046
1047
1048
1049
1050
1051
1052
1053
1054
1055
1056
1057
1058
1059
1060
1061
1062
1063
1064
1065
1066
1067
1068
1069
1070
1071
1072
1073
1074
1075
1076
1077
1078
1079
1080
1081
1082
1083
1084
1085
1086
1087
1088
1089
1090
1091
1092
1093
1094
1095
1096
1097
1098
1099
1100
1101
1102
1103
1104
1105
1106
1107
1108
1109
1110
1111
1112
1113
1114
1115
1116
1117
1118
1119
1120
1121
1122
1123
1124
1125
1126
1127
1128
1129
1130
1131
1132
1133
1134
1135
1136
1137
1138
1139
1140
1141
1142
1143
1144
1145
1146
1147
1148
1149
1150
1151
1152
1153
1154
1155
1156
1157
1158
1159
1160
1161
1162
1163
1164
1165
1166
1167
1168
1169
1170
1171
1172
1173
1174
1175
1176
1177
1178
1179
1180
1181
1182
1183
1184
1185
1186
1187
1188
1189
1190
1191
1192
1193
1194
1195
1196
1197
1198
1199
1200
1201
1202
1203
1204
1205
1206
2010-01-10  Bill Poser  <poser@bill-laptop>

	* msort.c:
	Now use new function names with prefix tre_ exported by libtre.
	
2009-08-23  Bill Poser  <poser@bill-laptop>

	* msort.c: 
	Removed unnecessary conditioning of Hybrid mapping code on LOCALE_SORT_ORDER.

2009-05-30  Bill Poser  <poser@khunekcho.khunek.poser.org>

	* msort.c: 
	Now reverse digits of Unix time before using as random number
	seed so as to make the seed change visibly from run to run.

2009-01-26  Bill Poser  <poser@khunekcho.khunek.poser.org>

	* msort.c: 
	Changed type of ExclusionTables, fixing a bug
	in the reporting of exclusions and saving considerable
	memory when exclusions are used.

2009-01-13  Bill Poser  <poser@khunekcho.khunek.poser.org>

	* msg.tcl: 
	Labelled algorithm choices as stable and unstable.

	* msort.c: 
	Added -Z option for copying the first record to the output without
	sorting it. This is useful for files with a header.
	Added a regression test for this option.

2008-12-05  Bill Poser  <poser@khunekcho.khunek.poser.org>
	Release 8.52.
	
	* msort.c (GetKeys): 
	Don't bother to extract and store field if comparison type for key
	is random since we aren't going to use it. This saves a little time
	and possibly a good bit of storage.

	* RegressionTests
	Separated the tests whose success depends on locales
	from the rest. The main set are run by "make test" as before.
	To run the locale-dependent tests, type "make localetest".
	The results appear in LocaleTestResults.

2008-11-02  Bill Poser  <poser@khunekcho.khunek.poser.org>

	* input.c (ReportReadError): 
	Changed stray explicit calls of gettext to calls
	to macro _, which is properly conditioned.

2008-10-27  Bill Poser  <poser@khunekcho.khunek.poser.org>

	Added regression test for more complex substition.

2008-10-20  Bill Poser  <poser@khunekcho.khunek.poser.org>

	* msg.tcl (GetLogInfo): 
	Modified to accomodate modifications to msort.log.

	* msort.c: 
	Corrected format specifier from d to u for printing number of records.
	Moved closing of log to very end, avoiding problem of early closing,
	and moved this into function CloseLog, which adds log-termination.

2008-10-19  Bill Poser  <poser@khunekcho.khunek.poser.org>

	* msort.c: 
	If one or more records have been discarded due to problems in key
	extraction but the run is otherwise successful, the exit code is
	now RECORDEXCLUDED rather than BADRECORD.

	* misc.c:
	Modified GetISO8601Key to handle an optional leading sign and
	moved it into iso8601.c.
	
2008-10-18  Bill Poser  <poser@khunekcho.khunek.poser.org>

	* msort.c:
	Made error-checking and reporting finer-grained in GetMonthNames.
	
	Split time and iso8601 date/time regression tests so as not to
	mix data with and without time zone offsets since mixing them causes
	tests to fail if executed in some time zones.
	
2008-10-11  Bill Poser  <poser@khunekcho.khunek.poser.org>
	Release of version 8.51.
	
	* msg.tcl:
	Added setting of random number generator seed.
	Added regression test for angles.
	Added regression tests for collating sequences.

2008-10-08  Bill Poser  <poser@khunekcho.khunek.poser.org>

	* msort.c: 
	Moved completion log time stamp ahead of other information so
	that it immediately follows start time stamp, facilitating comparison.

2008-10-05  Bill Poser  <poser@khunekcho.khunek.poser.org>

	* msort.c: 
	Moved initial log time stamp and key description to just before
	point at which records are read.

2008-09-30  Bill Poser  <poser@khunekcho.khunek.poser.org>

	* msort.c: 
	Added -P option for setting randomization seed so that runs
	will be reproducible.

	Now report the randomization seed used in the log for future reference.

2008-09-28  Bill Poser  <poser@khunekcho.khunek.poser.org>
	Release of version 8.50.
	
	* msort.c: 
	Fixed bug in which compound positional keys (those involving portions of more
	than one field) were not properly null terminated, resulting in various sorts
	of errors, including crashes. 

	Key specifications with a character offset of zero are now treated
	as fatal errors and reported to the user.
	
	Added 17 more regression tests.
	
2008-09-26  Bill Poser  <poser@khunekcho.khunek.poser.org>
	* configure.ac:
	Made failure of check for libintl non-fatal.
	
	* msort.c:
	Made report of failure of key extraction on one or more records
	subject to Verbose flag.

	Added ISO8601 time/date stamps to beginning and end of log.
	
	* msort.c (GetKeys): 
	Now trap fields with empty value, which previously
	could cause segfaults.
	

2008-09-24  Bill Poser  <poser@khunekcho.khunek.poser.org>
	Release of version 8.49.
	
	* RegressionTests
	Added nine regression tests.
	
	* README:
	Added section on Dependencies, discussing use of non-standard
	libraries and how to build the simplest configuration.

2008-09-22  Bill Poser  <poser@khunekcho.khunek.poser.org>

	* misc.c: 
	Eliminated direct call to libgmp, which should simplify installation.
	Now get gmp version info from function in libuninum.

2008-09-21  Bill Poser  <poser@khunekcho.khunek.poser.org>

	* configure.ac: 
	Added check for libintl.

	* msort.c: 
	Eliminated printing of message "Consult the log for details" if
	logging is not enabled.

	* strs.c (ws2u8):
	Fixed bug resulting from allocation of insufficient memory.
	
	* msort.c (GetWhiteSpaceDefinition):
	Removed debugging line inadvertently left in.
	
	(GetSubstitutions): 
	Removed inappropriate free of stack storage.

2008-09-20  Bill Poser  <poser@khunekcho.khunek.poser.org>
	Release of version 8.48.	

	* misc.c (GetISO8601Key): 
	Fixed bug that caused inappropriate return of error
	if no numerical time zone offset is present.

	(GetTimeKey): 
	Fixed bug that caused inappropriate return of error
	from well-formed ISO8601 times.

	(GetDateKey): 
	Fixed a bug that caused a segmentation fault if
	an attempt was made to analyze as a date a string not
	containing the expected first date component separator.
	
2008-09-19  Bill Poser  <poser@khunekcho.khunek.poser.org>
	
	* msort.c (CaseFoldTbl): 
	Updated case folding to Unicode 5.1.
	
	* msort.c (DescribeKeys): 
	Added indication of use of only first character if that option is selected.

2008-07-30  Bill Poser  <poser@khunekcho.khunek.poser.org>

	* misc.c (ShowVersion): 
	Added information about system and machine type.

2008-07-14  Bill Poser  <poser@khunekcho.khunek.poser.org>

	* msort.c: 
	Added option for lexicographic sorts of using only the
	first character (after subs, exclusions, etc.). This
	emulates pre-modern alphabetization practice.

2008-07-10  Bill Poser  <poser@khunekcho.khunek.poser.org>

	* unicode.h: 
	Changed UTF32 etc. from standard types (unsigned long etc.) to
	minimum size types (u_int32_t) for portability.

2008-06-30  Bill Poser  <poser@khunekcho.khunek.poser.org>
	Release of version 8.47.
	
	* msort.c (ConstructSortOrder): 
	Fixed storage allocation bug when using utf8proc as per
	patch by Steve Tinney.

2008-05-28  Bill Poser  <poser@khunekcho.khunek.poser.org>

	* msort.c: 
	Modified domain name processing to handle optional protocol
	segment, which is treated as lowest priority component like
	the username in an email address.

2008-05-23  Bill Poser  <poser@khunekcho.khunek.poser.org>
	Release of version 8.46.
	
	* msort.c (ReverseDomainComponents):
	Fixed bugs in rearrangement of email addresses.

2008-05-18  Bill Poser  <poser@khunekcho.khunek.poser.org>

	* msort.c (MapHybrid): 
	Fixed potential malloc bug (inconsistent use of long and int)

2008-05-14  Bill Poser  <poser@khunekcho.khunek.poser.org>

	* msort.c (GetKeys): 
	Fixed small bug in Turkic case folding.

2008-04-19  Bill Poser  <poser@khunekcho.khunek.poser.org>

	* manual.tex
	Filled in missing text at end of section 10.7 (IP address
	example).

2008-04-10  Bill Poser  <poser@khunekcho.khunek.poser.org>

	* msg.tcl:
	Now check for Tk 8.5 and adapt font sizes accordingly.

2008-04-09  Bill Poser  <poser@khunekcho.khunek.poser.org>

	* msort.c: 
	Fixed ValidateCRange so as not to detect error when
	first index of a character range is positive but second
	index is negative, fixing bug reported by Jeppe Oland.

	* misc.c: 
	Added ifdef for include of uninum.h as per patch
	by Jeppe Oland. This eliminates a problem compiling
	without libuninum if libuninum's header files are not
	installed.

2008-01-27  Bill Poser  <poser@khunekcho.khunek.poser.org>

	* misc.c: 
	Moved include of nsdefs.h within ifdef for uninum
	as per patch by Steve Tinney.

2007-10-05  Bill Poser  <poser@khunekcho.khunek.poser.org>

	* msort.c: 
	Changed license to GPL3.
	Eliminated printing of version info when called with no arguments.

2007-10-02  Bill Poser  <poser@khunekcho.khunek.poser.org>

	* msort.c: 
	Added command-line options for specifying input and output
	files. Also, if the specified input and output files are
	the same, the input is now overwritten.

	Added option of suppressing log.
	Refactored so most writes to log are via PrintLog and wPrintLog.

2007-09-04  Bill Poser  <poser@khunekcho.khunek.poser.org>

	* misc.c (ShowVersion): 
	Now print version of glibc if present.

2007-09-02  Bill Poser  <poser@khunekcho.khunek.poser.org>

	* misc.c  (GetTimeKey, GetISO8601Key): :
	Added code for handling ISO 8601 time zone info.
	Also, modified GetISO8601Key so that, like GetTimeKey,
	it works on a UTF8 string.
	The separators in ISO time-date strings are now optional,
	as per the standard.
	

2007-09-01  Bill Poser  <poser@khunekcho.khunek.poser.org>

	* misc.c: 
	Added function TimezoneOffset for use in normalizing times
	to UTC.

2007-06-08  Bill Poser  <poser@khunekcho.khunek.poser.org>

	* msort.c: 
	Now increase number of records by InitialMaxRecords
	rather than half that number.

2007-03-28  Bill Poser  <poser@khunekcho.khunek.poser.org>

	* msort.c: 
	Now flush log during startup to preserve info on crashes.

2007-03-23  Bill Poser  <poser@khunekcho.khunek.poser.org>
	Release of version 8.40
	
	* msort.c: 
	It is now possible to choose between the utf8proc and icu
	libraries for normalization.	

2007-03-21  Bill Poser  <poser@khunekcho.khunek.poser.org>
	Release of version 8.39.

	* msort.c: 
	Added conditionals so that it is possible to use an
	option to configure to compile msort without
	libuninum or libgmp. This simplifies installation for those
	who do not need this capability.

	* misc.c (OpenFile): 
	Inability to open a file now causes an error exit.

2007-03-20  Bill Poser  <poser@khunekcho.khunek.poser.org>

	* msort.c (main): 
	Added unicode normalization of input records.
	Added unicode normalization of tag regexps.
	Added unicode normalization of exclusions read from files.
	Added unicode normalization of exclusions read from command line.
	Added unicode normalization of sort order file.
	Added unicode normalization of substitutions.
	Added unicode normalization of record separator.
	Added unicode normalization of field separators.

	* misc.c: 
	Renamed IdentifySelf to ShowVersion for consistency with other programs.
	Added ICU version info to ShowVersion.

	* uniio.c: 
	Unified uout.c and UTF8in.c into uniio.c.

2007-03-04  Bill Poser  <poser@khunekcho.khunek.poser.org>

	* msort.c: 
	Month names may now be used in the month field of dates,
	with their ordering determined as with independent month names.

	* misc.c (GetDateKey): 
	Added stricter error checking on number conversion so as
	to reject ill-formed numbers in dates like "4b".

2007-02-28  Bill Poser  <poser@khunekcho.khunek.poser.org>

	* misc.c (GetDateKey): 
	Added ability to handle day of year keys.

2007-02-07  Bill Poser  <poser@khunekcho.khunek.poser.org>

	* msg.tcl (ExplainEndOfLineMarker): 
	Adapted list of number systems to livuninum version 2.1.

	* info.c: 
	Adapted ListNumberSystems to libuninum version 2.1.

2007-01-26  Bill Poser  <poser@khunekcho.khunek.poser.org>

	* info.c (PrintLimits): 
	Made thousands separation consistent and ifdefed for C libraries that
	do not support it.

	* limits.h: 
	Abstracted multigraph numbers and improved comments.

	* misc.c (IdentifySelf): 
	Version info now provides the version of all three of the
	non-standard libraries used.

2007-01-25  Bill Poser  <poser@khunekcho.khunek.poser.org>
	Release 8.36.1
	
	* configure.ac: 
	Changed AC_FUNC_MALLOC in configure.ac to AC_CHECK_FUNCS([...malloc...])
	since on Ubuntu malloc apparently fails the GNU test of returning non-null
	when passed an argument of zero. This causes the compilation to fail
	on unresolved references to rpl_malloc.

	* info.c misc.c: 
	Added includes of <uninum/unicode.h>.

2007-01-24  Bill Poser  <poser@khunekcho.khunek.poser.org>

	* msort.c (GetKeys): 
	Adapted to version 2.0 of libuninum by changing NS_RETURN_*
	to NS_TYPE_*.

2007-01-12  Bill Poser  <poser@khunekcho.khunek.poser.org>

	* configure.ac: 
	Added configuration option --disable-comparisoncount
	which removes the code that reports the number of comparisons.
	This speeds things up a little bit for sorts involving a lot
	of comparisons.

2007-01-09  Bill Poser  <poser@khunekcho.khunek.poser.org>
	Version 8.35 released.
	
	* msort.c (main): 
	Added checks to several key-specific options to ensure that a key has
	been specified earlier on the command line. This eliminates segfaults
	when such incorrect command-lines are encountered. All key-specific
	options are now covered. For some reason this check was missing
	from a few of them. Also made error message more informative and
	moved the check into a function.

	* Makefile.am (AM_LDFLAGS): 
	Force linkage with gmp library. Apparently this is not taken care
	of by the autoconf process on Mac OS X and OpenBSD.

2007-01-05  Bill Poser  <poser@khunekcho.khunek.poser.org>
	Version 8.34 released.
	
	* msg.tcl:
	Modified GUI to accomodate number system specification for
	numeric string keys.

	* msort.c (GetKeys): 
	Added support for number system specification for numeric string keys.

2006-12-31  Bill Poser  <poser@khunekcho.khunek.poser.org>

	* msort.c (GetKeys):
	Modifed to use libuninum for non-Western numeric keys.

2006-12-29  Bill Poser  <poser@khunekcho.khunek.poser.org>

	* msort.c input.c:
	Modified input routines to make more informative reports on
	input errors including bad UTF-8.

2006-12-25  Bill Poser  <poser@khunekcho.khunek.poser.org>

	* msg.tcl (SetupKey): 
	Added support for writing system specification for numeric keys.

2006-12-24  Bill Poser  <poser@khunekcho.khunek.poser.org>

	* msort.c: 
	Added ability for numeric keys to use pretty much any number system,
	with -y flag for specifying number system.

2006-12-18  Bill Poser  <poser@khunekcho.khunek.poser.org>

	* sorts.c (Compare): 
	Fixed bug that caused segfaults on numeric keys.

2006-12-17  Bill Poser  <poser@khunekcho.khunek.poser.org>
	* msort.c sorts.c: 
	Fixed subtle bug in multigraph mapping (MapNonHybridLI).
	
	Redid the handling of non-locale-dependent hybrid keys,
	which required modifications of both Compare and
	GetKeys. This eliminates occasional, apparently random,
	segfaults.
	
	Corrected storage allocated for tmp in ExtractNum and ExtractTxt.
	
2006-12-10  Bill Poser  <poser@khunekcho.khunek.poser.org>

	* msort.c info.c: 
	Separated listing of informational options from other general
	options in order to shorten the latter listing.

2006-12-09  Bill Poser  <poser@khunekcho.khunek.poser.org>

	* msort.c: 
	Modified domain name comparison to handle email addresses
	as well, that is, domain names with a leading component separated
	from the domain name by an at-sign.

	Improved description of keys in log and added logging of domain name
	comparison type.

2006-11-29  Bill Poser  <poser@khunekcho.khunek.poser.org>

	* msort.c: 
	Added domain name sort type.

2006-11-24  Bill Poser  <poser@khunekcho.khunek.poser.org>

	* msort.c (main): 
	The report of the command line by which msort was invoked is now indented slightly.

2006-10-09  Bill Poser  <poser@khunekcho.khunek.poser.org>

	* info.c (PrintDefaults): 
	Added default algorithm to list of defaults.

	* info.c msort.c (PrintGNUEquivalences):
	Added command line option -G which prints GNU sort equivalences.

	* msort.c: 
	Added ifdef for alloca.h which fixes build problem on FreeBSD.

2006-09-17  Bill Poser  <poser@khunekcho.khunek.poser.org>

	* msg.tcl (PopupIPAEntryC): 
	Fixed non-functional uvular trill button.

2006-09-06  Bill Poser  <poser@khunekyaz.khunek.poser.org>

	* msort.c (DescribeKeys): 
	Fixed Debian bug 383230 as per patch by celelibi@gmail.com
	(key index i instead of 0).
	
	(main): 
	Fixed Debian bug 383232 as per patch by celebi@gmail.com.
	(Extra fclose on log.)

2006-08-10  Bill Poser  <poser@khunekcho.khunek.poser.org>

	* msort.c: 
	Updated case-folding to Unicode 5.0.

2006-08-09  Bill Poser  <poser@khunekcho.khunek.poser.org>

	* msort.c: 
	If there is only one record, we still mention that there
	is no point in sorting only one record but we now
	exit successfully after writing out that record.
	This is necessary for non-interactive use.

2006-07-06  Bill Poser  <poser@khunekyaz.khunek.poser.org>

	* input.c strs.c:
	Modified to use new prefixed UTF8 error codes.

	* Get_UTF32_From_UTF8i.c: 
	Patched to handle read interrupted in middle of UTF-8 sequence.

2006-05-22  Bill Poser  <poser@S010600023f310f85.ca.shawcable.net>

	* msg.tcl (About): 
	Changed help popups to toggle.

	Made various sections of pages roll-up, reducing footprint.

2006-05-20  Bill Poser  <poser@S010600023f310f85.ca.shawcable.net>

	* msort.c (GetKeys): 
	Updated case-folding to Unicode 4.1.
	
	Added the option of Turkic case-folding.

2006-05-18  Bill Poser  <poser@S010600023f310f85.ca.shawcable.net>

	* msort.c sorts.c record.h: 
	Revamped hybrid locale comparison, eliminating bugs.

2006-05-13  Bill Poser  <poser@S010600e0185aee1a.ca.shawcable.net>

	* msort.c (GetKeys): 
	Eliminated small memory leak (LocaleTempKey).

2006-05-12  Bill Poser  <poser@S010600023f310f85.ca.shawcable.net>

	* msg.tcl:
	Added explanatory popups for conversion of stylistic variants and enclosed forms,
	and stripping of diacritics.

	* conversions.c: 
	Added Hebrew presentation forms to ConvertStylistic.
	This required storage reallocation.

	* msort.c (main): 
	Added option of just checking whether the input is sorted.

2006-05-12  Bill Poser  <poser@S010600e0185aee1a>

	* conversions.c: 
	Corrected a bug that erroneously converted "9" to "y"
	when the option for converting stylistic equivalents
	is used.

	Superscript and subscript forms are now converted
	to their plain equivalents.

2006-05-06  Bill Poser  <poser@S010600e0185aee1a.ca.shawcable.net>

	* info.c (PrintUsage): 
	Updated usage message to reflect split between help for global and
	key-specific options.

2006-05-05  Bill Poser  <poser@S010600023f310f85.ca.shawcable.net>

	* msort.c:
	Added support for month names.
	
	Now distinguish count of well-formed records from count of input records,
	which corrects record numbers output with error messages. 
	
	*passim:
	Sorted out configuration conditions into three groups, LOCALE_SORT_ORDER,
	LOCALE_MONTH_NAME, and LOCALE_GETTEXT, so that we now check for just
	the right combinations.
	
2006-04-28  Bill Poser  <poser@S010600023f310f85.ca.shawcable.net>
	Added options for treating certain classes of characters as their "basic" counterparts:
	      (a) strip diacritics;
              (b) treat enclosed characters as basic.
              (c) treat stylistic variants as basic.
	      	  At present this includes the stylistic variants of the ASCII characters,
	  	  the Latin but non-ASCII symbols, the Arabic positional variants,
	  	  the half-width CJK punctuation, the half-width kana, and the half-width
		  hangul.

	
2006-04-21  Bill Poser  <poser@S010600023f310f85.ca.shawcable.net>

	* misc.c:
	Eliminated minor memory corruption arising in DelimitNumber.

	* msort.c:
	Eliminated significant memory leakage from GetKeys.
	Neatened up info generated about dynamic memory, comparisons, and record count.

2006-04-11  Bill Poser  <poser@khunekyaz.khunek.poser.org>

	* info.c
	Split PrintFlags into PrintGeneralFlags and PrintKeySpecificFlags.

2006-04-10  Bill Poser  <poser@khunekyaz.khunek.poser.org>

	* msort.c (main): 
	Added long options if supported.
	
2006-04-08  Bill Poser  <poser@khunekyaz.khunek.poser.org>

	* msg.tcl: 
	Changed 0xff to 255 in three color default settings
	as a workaround to a bug in some MS Windows implementations
	of Tcl.

2006-04-03  Bill Poser  <poser@khunekyaz.khunek.poser.org>
	* msort.c:
	If the argument to the -s flag is a locale name, the rules
	of that locale are now used. If the argument to the -s
	flag is "current" or "locale", the rules of the current
	locale are used.

	Added record of amount of memory dynamically allocated to log.

2006-03-26  Bill Poser  <poser@S010600023f310f85.ca.shawcable.net>

	* msort.c: 
	Fixed bug that prevented multigraph mapping from affecting
	string length.
	
2006-03-09  Bill Poser  <poser@S010600e0185aee1a.ca.shawcable.net>

	* msort.c: 
	In atofwc now strip space characters so as to prevent
	strto(l)d from reporting an error due to, e.g., a newline
	following the number when records are not lines and fields
	are not lines.

2006-03-05  Bill Poser  <poser@S010600e0185aee1a.ca.shawcable.net>

	* configure.ac: 
	Added check for libtre so that absence will be detected prior to
	compilation.
	
2006-02-25  Bill Poser  <poser@S010600e0185aee1a.ca.shawcable.net>
	passim:
	Added option of numeric string comparison.
	
2006-02-17  Bill Poser  <poser@S010600e0185aee1a.ca.shawcable.net>
	passim:
	
	Key selection by position has been extended to allow keys to span
	contiguous fields from one position to another, where a position
	consists of a field number and a character offset. This makes key
	selection by position approximately the same as in GNU sort. One
	difference is that unlike GNU sort, msort permits field numbers to be
	negative, indicating a count from the end. The other is that in msort
	if only a single position is specified only that field is included in
	the key, whereas in GNU sort the key extends from the specified
	position to the end of the record.
	

2006-02-11  Bill Poser  <poser@S010600e0185aee1a.ca.shawcable.net>	
	passim:
	Adds to the permitted format for angles the sexagesimal format in which
	the components are separated by whitespace rather than colon.

	Fixes a bug affecting negative angles.


2006-02-08  Bill Poser  <poser@S010600e0185aee1a.ca.shawcable.net>

	* misc.c (GetAngleKey): 
	Added GetAngleKey.

	msort.c:
	Added support for angle key.
	
	info.c:
	Added info about angle key.
	

2006-02-05  Bill Poser  <poser@S010600e0185aee1a.ca.shawcable.net>

	* msort.c: 
	Moved the four sort functions, ancillary routines, and Compare
	into sorts.c This required moving some definitions out of msort.c
	into header files, including the creation of comparisons.h and keys.h.

	* info.c:
	Added command line option for fixed length records to option summary.
	
	* msort.c: 
	Added support for fixed length records.

	Fixed bug that added extra newlines to output.

	* input.c:
	Added GetFixedLengthRecord().

2006-02-04  Bill Poser  <poser@S010600e0185aee1a.ca.shawcable.net>

	* msort.c (main): 
	Added check of whether input is restricted to BMP if -B option is specified.
	
	Added choice of algorithm, and added Mergesort and InsertionSort as possible algorithms.
	
	Also added a configure option --(dis|en)able-allocaok, which controls
	whether alloca is used in Mergesort. By default it is, but storage may be
	allocated on the heap instead if alloca is buggy, as it is on some systems.
	
2006-01-17  Bill Poser  <poser@S010600e0185aee1a.ca.shawcable.net>

	* msort.c misc.c record.h:
	Now use long double internally for numeric keys rather than double
	if machine supports it.

Version 8.11
	
2006-01-17  Bill Poser  <poser@S010600e0185aee1a.ca.shawcable.net>

	* info.c (PrintLimits): 
	The maximum length of a numeric key is now listed as a limit.
	
2006-01-17  Bill Poser  <poser@S010600e0185aee1a.ca.shawcable.net>

	* msort.c (GetKeys): 
	Now check the conversion of numeric strings to internal floats for errors.
	Fixed bug in which msort segfaulted if given a very long  numeric string
	as key.

Version 8.9

Carried out a thorough code-cleanup. Ran gcc with -Wall -pedantic and eliminated
all but 2 warnings.

Eliminated the restriction on number of keys.

Fixed a bug that made the widget for inserting characters
by their Unicode codepoint insert them into itself.

Version 8.8
Modified msort to make inability to open log file non-fatal.
It now tries to open a log in /tmp. Only if that fails
does it exit.

Updated msg to use improved font control panel.

Updated character entry widgets.

Updated configuration system.
	Incompatibilities: character entry widget definition format
			   Boolean command names

Added binding on <<B3>> to all scrollable windows in which a significant number of
lines might appear that scrolls in larger increments than the left mouse button.

version 8.7
Added hybrid sort type.
Added random sort type.
Positional key field now defaults to key number.
Improved log/stderr info output.
Changed ComparisonCnt to long long to prevent overflow. 

Right-click in trough of scale now changes font size by increment of five.
Improved the font control panel in various ways.

version 8.6.1
Fixes a bug in the exclusion entry widget in the GUI.

version 8.6
Added the ability to configure msg via an initialization file.
Added the ability to select fonts interactively.
Moved the toggle for balloon help from the Help menu to the
      new Configure menu.
Considerable cosmetic improvement to the GUI.
Lines beginning with crosshatch in substitution files are now comments.
Fixed a bug in GetSubstitutions that truncated long substitutions.

Made a change that may result in browser execution working under MacOSX .
Minor tweaking of info messages.

version 8.5.1
Added include of stdlib to dstr.c in response to report of compiler warnings.

version 8.5
Added character ranges as keys to msort and msg.
Added regular expression substitution on keys.
Fixed a bug in Unicode case folding.
Added ability to specify CR (0x0D) as end-of-line character instead of LF (0x0A).
Rearranged General page.
Removed unused images and conditionals for using them.
Updated manual.
Removed DOS stuff from manual.

version 8.4
Fixed serious bug affecting numeric keys.
Fixed bug in msort that garbled copy of bad record when key extraction failed.
Improved detection of OS and base graphics system by msg.
Fixed bug in msg that triggered an error when the "How to Use this Program" popup
      was popped up.
Added missing balloon help for save and cancel buttons of sort order file separator
      control panel.
The Tk-Aqua adaptation of msg was improved. The command buttons are now positioned at the
top of the window and are available at all times, not just when the General page is selected.

2005/07/26 Bill Poser version 8.3
The GUI does a better job of identifying the operating system.
The GUI's adaptation to Tk-Aqua under Mac OS X has been improved.

2005/07/24 Bill Poser version 8.2
Added -W flag to allow the user to set which characters are treated
      as separators in the sort order file.
Improved error messages regarding invalid UTF-8 input.
Modified msg to handle -W flag.
Fixed bug with optional keys.
Fixed bug in sort order specification in msg.
Fixed a bug in ASCII case-folding.
Updated manual page and reference manual.

2005-07-21 Bill Poser version 8.1
Fixed a bug that interfered with use of multigraphs.
Adds a modified configuration of msg adapted to the Aqua implementation of Tk. 
msort can now be conditionally compiled without the internationalization and
      localization libraries that are not available under Mac OS X.
	
2005-07-20 Bill Poser version 8.0
msort now accepts UTF-8 Unicode.
The command line flag -B was added.
   This flag, if present, restricts the characters in the input
   and sort order specifications to the BMP. 
The command line flag -p was added. 
    This flag instructs msort not to make internal use of the
    Unicode Private Use areas.
In GUI added character entry tools.
Replaced Henry Spencer regexp with TRE in msort.
Added help link for regular expression syntax.
Instead of just trying to use the default browser, msg now
	works through a list of browsers, trying each in turn.

2005-05-20 Bill Poser version 7.1
It is now possible to reorder keys by dragging one button over the other.
Miscellaneous improvements in error-handling and error-reporting in msg.

2005-04-25 Bill Poser version 7.0
Eliminated the restriction of exclusions to lexicographic and string length keys.
Added iso8601 date/time combinations as a key type.
Added gui msg.tcl.

2005-04-24 Bill Poser version 6.20
Added case-folding option.
Changed default date format to International Date Format (y-m-d).

2005-04-22 Bill Poser version 6.19
Correct manual, man page, and option printout
to show date format as a key-specific option.

2005-02-03 Bill Poser version 6.18
Improved man page and reference manual.

2005-01-21 Bill Poser version 6.17
Eliminated -a option since it was not useful.

2003-12-15 Bill Poser version 6.14
 Changed MAXMULTIGRAPHS in limits.h to 65279 from 65280. The maximum value now fits
 within an unsigned short, as it should.

2003-11-19 Bill Poser version 6.13
 Fixed bug in error message for bad position code in GetExclusions.

2003-11-14 Bill Poser version 6.12
 Fixed bug in allocation of temporary storage and cleaned up a bit.

2003-11-11 Bill Poser version 6.11
 Now allocate temporary storage for tag strings dynamically, eliminating
 unlikely but possible overflow bug.

2003-02-27 Bill Poser version 6.10
 Added test for field reduced to zero length by exclusions. If the field is
 optional, such a field is treated as an optional field that was missing outright.
 If not, the error is reported and execution aborted.
 Also corrected error in Compare()  that reversed the intended semantics of the
 arguments to -o.

2003-02-24 Bill Poser version 6.9
 Fixed bug that caused a crash when -t flag was used.

2003-02-22 Bill Poser version 6.8
 Added comma delimitation of comparison count to make more readable.

2003-02-21 Bill Poser version 6.7
 Fixed serious bug in storage reallocation for InputText.

2003-02-19 Bill Poser version 6.6
 Now catch and abort on field numbered 0.

2003-02-18 Bill Poser version 6.5
 Eliminated limit on record size by allocating dynamically.
 Eliminated associated command line flag B.

2003-02-17 Bill Poser version 6.4
 Eliminated external sort option.
 Eliminated DOS-related conditional code.

2003-02-17 Bill Poser version 6.3
 Added ability to sort on key size.
 Added count of number of comparisons, which is useful for understanding what
       is happening.
 Added logging-report of global inversion of order.

2003-02-16 Bill Poser version 6.2 = RCS version 1.10 of msort.c
 Modified DescribeKeys to produce more readable description of positional keys.

2003-02-16 Bill Poser version 6.1
 Fixed bug that allowed no keys to be specified without specifying sort on whole record.
 This resulted in a non-sort since the key count was 0.
 Fixed bug that resulted in presence of keys not being correctly detected in
 Compare due to reversed test.

2002-12-23 Bill Poser version 6.0
 Added argument to -o flag for specifying missing key comparison.
 Added field to keyinfo structure to store this.
 Modified comparison code to test for missing keys and handle them
 as specified by the argument to -o.

2002-12-22 Bill Poser version 5.23
 Increased 8-bit MAXMULTIGRAPHS to 65,279, which is the maximum possible.
 all values above 0xFF are available as codes for multigraphs.
 That is 2^16 - 2^8 = 65,280.

2002-11-16 Bill Poser version 5.22
 Added X flag to permit context-free exclusions to be specified on command line.

2002-06-02 Bill Poser version 5.21
 Eliminated log entry of what file exclusions were read from.

2002-05-05 Bill Poser version 5.20
 Added more specific information to log entry regarding ill-formed keys.
 Previously this information was written on stderr, but the log only
 received a copy of the ill-formed record.

2002-05-03 Bill Poser version 5.19
 Increased number of digits for count of records processed.
 Also increased REPORTINTERVAL since machines have gotten so much faster.

2000-05-18 Bill Poser version 5.18
 Now don't print warning about singleton record if not verbose.

2000-01-03 Bill Poser version 5.17
  Eliminated limit on length of sort order file line by allocating
  dynamically

1999-04-04 Bill Poser version 5.16
  Eliminated error exit on fewer than two records. A warning is still
  printed, but the run is completed. This facilitates use in shell scripts.

1998-12-21 Bill Poser version 5.15
  Fixed bug in DescribeKeys.
  Improved information about command line flags.

1996-11-14 Bill Poser version 5.14
  Eliminited limit on length of input file name by allocating
  dynamically.

1996-06-30 Bill Poser version 5.13
  Eliminated limit on number of field delimiter characters by allocating
  dynamically.

1996-01-19 Bill Poser version 5.12
  Eliminated limit on length of tag regexp by allocating dynamically.

1995-12-23 Bill Poser version 5.11
  Eliminated limit on maximum number of fields per record by allocating
  storage for pointers dynamically.

1995-08-20 Bill Poser version 5.10
  Improved format of report of exclusions and eliminated report
  on stderr of reading exclusion file.

1995-05-16 Bill Poser version 5.9
  Eliminated elapsed time computation and report.

1994-12-18 Bill Poser version 5.8 
  Now allow spaces as separators in sort order file, with backslash
  escape when space is to be entered in sort order.

1994-09-23 Bill Poser version 5.7
  Increased DEFMAXCHUNK. 

1994-09-22 Bill Poser version 5.6
  Fixed bug in -o flag.
  Eliminated typo in usage.

1994-06-21 Bill Poser version 5.5
  Added initial newlines to error reports from PutExtRecord.

1994-04-26 Bill Poser version 5.4
  Moved description of exclusions into DescribeKeys so as to associate
  with appropriate key.

1994-04-05 Bill Poser version 5.3
  Slightly modified report of elapsed time and time per record.

1994-04-3 Bill Poser version 5.2
  Corrected size of ymdinfo.fmt.

1994-03-24 Bill Poser version 5.1
  Added specification of separators to date format.

1994-03-19 Bill Poser version 5.0
  Increased maximum number of keys to 16.
  Moved limits into limits.h and made it unnecessary to transfer
     information manually into PrintLimits().
  Generalized forward-inverse option so as to apply to textual
     keys as well as numeric ones. 
     Added separate -i flag for setting inversion. Renamed
     old -i flag, which specified global inversion, -I.
  Removed -N flag and added -c flag with argument to specify
    comparison type.
  Added time key type.
  Eliminated multigraph count in DescribeKeys if count is 0. 
  Added date key type, with date format info in keyinfo and init,
    and -f flag for setting date format.
  Added -o flag for making key optional instead of using ? before key spec.
  Eliminated option of writing on file. 

1994-03-10 Bill Poser version 4.13
  Eliminated dump of sort order in log, which was once useful for
  debugging but has little use now.

1994-03-03 Bill Poser version 4.12
  Fixed bug in DescribeKeys when sorting on whole record. 

1994-02-08 Bill Poser version 4.11
  Now allow numeric key field specification to be negative,
  interpretated as counting from end of record.

1994-01-20 Bill Poser version 4.10
  Now allocate MapTable dynamically, when needed, saving a fair
  amount of space when a key does not use any multigraphs.
  Added log entries regarding dynamic memory usage. 
  Now allocate numeric keys dynamically. This makes numeric comparisons
  very slightly slower and wastes 4 bytes per record per key for numeric
  keys, but it saves 4 bytes per record per key for textual keys,
  which are far more common in the intended applications of this
  program. 

1993-12-21 Bill Poser version 4.9
  Reserved highest rank table slot for default key, reducing
    number of multigraphs permitted by one.
  Added copy of command line to log.
  Fixed small bug in exclusion code.

1993-12-19 Bill Poser version 4.8
  Fixed bug in default key for records without optional key.
  Changed reports of records processed and written to in place. 

1993-11-28 Bill Poser version 4.7 
  Inlined comparisons, saving a little code space and maybe some time.

1993-11-27 Bill Poser version 4.6
  Reduced size of each record slightly (typically 4 bytes) by making
  file position and text pointer a union.

1993-11-27 Bill Poser version 4.5
   Rescinded replacement of lexicographic keys with ranks, which was
   buggy and not worth fixing since the speedup was so small.
   Now use quicksort instead of shellsort for large numbers of records.
   Added -a flag to warn that input is almost in order, in which case
   shellsort is used for any number of records. 
   Also changed report of time per record to integral milliseconds. 

1993-10-28  Bill Poser version 4.4
   Replaced lexicographic keys with ranks.

1993-10-27  Bill Poser version 4.3
   Added reversal of key field, for producing reverse dictionaries.
   Changed max record flag to -M so as to use -R for this.

1993-07-05  Bill Poser version 4.2
   Made key code type compile time option. If only 7-bit ASCII
   input is desired and 128 multigraphs are sufficient, msort can
   be compiled with SEVENBIT defined and will use half the space for
   textual keys. 

1993-07-04  Bill Poser version 4.1
   Modified format of elapsed time report and moved into library functions.

1993-06-26 Bill Poser version 4.0
   Added exclusions. 
   Reduced other memory usage, mostly by allocating RankTables only
      as needed. 

1993-06-20 Bill Poser version 3.3
   Sent info on flags, defaults, and limits to log as well as stdout.
   Sent ill-formed regexp to log, and copy of beginning of overlong
      record.
   Fixed bug in bug fix in version 3.2 in report of ill-formed keys.

1993-06-15 Bill Poser version 3.1
   Add information to log.
   Improved formatting of summary of keys.
 
1993-06-19 Bill Poser version 3.2
   Fixed bug that led to incorrect reports of ill-formed keys
   on external sorts. 

1993-06-12 Bill Poser version 3.0
   Implemented optional keys.
   Added information to log. 
   Fixed bug fixed in version 1.6 and re-introduced in version 2.2
      that prevented a string from sorting before a string of
      which it is a prefix.

1993-06-10 Bill Poser version2.4
Improved correspondance of output to input.
Fixed bugs in input routines.

1993-05-30 Bill Poser version2.3
Made minor modifications to PrintDefaults and PrintFlags.

1993-05-19 Bill Poser version 2.2
   Added backslash escapes to sort order file, -d and -r flags.
   Added logging to msort.log.
Arranged for characters for which no rank is specified to
be ranked after all others, in ASCII order.

1993-05-17 Bill Poser version 2.1
   Added specification of record separator.

1993-05-17 Bill Poser version 2.0
   Added subsorts.

1993-04-24 Bill Poser version 1.6
   Fixed bug in ConstructSortOrder. Strings are now guaranteed
   to sort after prefixes to them.

1993-04-16 Bill Poser version1.5
   Fixed bug in ConstructSortOrder. Spaces are now handled properly.

1993-04-12 Bill Poser version 1.4
   Changed tag specification from strings to regular expressions.
   Improved formatting of record processing progress report.
   Added progress report on write-out of sorted records.

1993-04-11 Bill Poser version 1.3
   Eliminated limitation on number of records.

1993-04-10 Bill Poser version 1.2
   Added listing of multigraphs.
   Added numerical sort option.
   Rationalized flags.

1993-04-05 Bill Poser version 1.0