File: CHANGELOG.rst

package info (click to toggle)
python-clevercsv 0.8.3%2Bds-1
  • links: PTS, VCS
  • area: main
  • in suites: forky, sid, trixie
  • size: 2,076 kB
  • sloc: python: 6,184; ansic: 870; makefile: 90
file content (407 lines) | stat: -rw-r--r-- 9,716 bytes parent folder | download
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407

Changelog
=========

Version 0.8.3
-------------


* Allow users to specify output encoding for some CLI commands (thanks to 
  @jbdesbas)
* Optimize the normal-form detection (thanks to @no23reason)
* Internal: fix names of C modules

Version 0.8.2
-------------


* Add more type hints to CleverCSV
* Move the import of the optional tabview dependency to where it's needed (for 
  #101)
* Allow inspecting more rows for header detection (fixes #98)

Version 0.8.1
-------------


* Add type hints to CleverCSV
* Disable 32-bit builds on Windows and Linux
* Bump minimal Python version to 3.8
* Minor documentation improvements

Version 0.8.0
-------------


* Improve median runtime by ~68% (~52% on average) by: 1) more caching, 2) 
  implementing a heavy function in C.
* Redesign computation of consistency measure to a class: 
  ``ConsistencyDetector``.
* Fix potential memory leak in C code for base abstraction
* Fixes to escape sequences in regexes (thanks to @JakobGM!)
* Various improvements to code quality
* Switch documentation style to `furo <https://pypi.org/project/furo/>`_.

Version 0.7.7
-------------


* Use r-prefix for regex patterns (thanks to @JakobGM!)
* Fix documentation typo (thanks to @Aritra8438!)

Version 0.7.6
-------------


* Simplify faust-cchardet import for Windows builds

Version 0.7.5
-------------


* Add support for Python 3.11 by fixing a bug regarding empty strings in 
  dialects (thanks to @stefanor!)
* Fix installation error due to change in internals at setuptools (thanks to 
  @mweinelt!)
* Migrate to `faust-cchardet <https://github.com/faust-streaming/cChardet/>`_ as 
  cChardet fails to install on Python 3.11 (on Windows, currently only chardet 
  will work for Python 3.11)
* Migrate to `packaging <https://github.com/pypa/packaging>`_ for version 
  comparison

Version 0.7.4
-------------


* Add wrapper for writing a list of dictionaries (write_dicts)
* Fix bug when writing CSVs using the ``csv`` module dialects
* Add the builtin dialects to CleverCSV (e.g., ``clevercsv.excel``\ )

Version 0.7.3
-------------


* Release to build wheels for Python 3.10

Version 0.7.2
-------------


* Re-implement command line interface using Wilderness
* Add man-pages to package

Version 0.7.1
-------------


* Remove deprecated wrapper functions
* Expand URL regex to support ``localhost:<port>`` urls
* Minor changes to the TypeDetector API
* Add cChardet as optional dependency (fixes 
  `#48 <https://github.com/alan-turing-institute/CleverCSV/issues/48>`_\ )

Version 0.7.0
-------------


* Add a JSON object data type to address a specific failure case 
  (\ `#37 <https://github.com/alan-turing-institute/CleverCSV/issues/37>`_\ ).
* Add support for timezones for time data type
* Add support for building wheels on non-native architectures 
  (\ `#39 <https://github.com/alan-turing-institute/CleverCSV/issues/39>`_\ ).
* Add a flag to disable skipping type detection using the command line 
  interface.

Version 0.6.8
-------------


* Add a "bytearray" type to address a specific failure case 
  (\ `#35 <https://github.com/alan-turing-institute/CleverCSV/issues/35>`_\ ).
* Minor clarifications to licensing.

Version 0.6.7
-------------


* Updates to release process. This version introduces pre-compiled wheels for 
  Python 3.9.

Version 0.6.6
-------------


* Add an ``encoding`` argument to ``write_table`` to allow specifying the output 
  encoding. Thanks to @mitchgrogg for reporting `issue 
  #27 <https://github.com/alan-turing-institute/CleverCSV/issues/27>`_.

Version 0.6.5
-------------


* Add support for standardizing in-place and standardizing multiple files.
* Add warning on duplicate field names in DictReader
* Add return value to writers to match the standard library.

Version 0.6.4
-------------


* Various speed ups to constructing the list of potential dialects. This 
  removes a costly step of the detection process that will likely add a few 
  more potential dialects, but has the end result of making overall dialect 
  detection faster.

Version 0.6.3
-------------


* Rename wrapper functions to a more coherent naming scheme. Old names will be 
  available until 0.7.0, but now produce a FutureWarning.
* Add ``stream_dicts`` wrapper function.
* Improve handling of file encoding for the ``read_dataframe`` wrapper: 
  detected encoding is now passed on to Pandas.
* Fix handling of optional dependency error for TabView on non-Windows 
  platforms.

Version 0.6.2
-------------


* Update URL regex to avoid catastrophic backtracking and increase 
  performance. See `issue 
  #13 <https://github.com/alan-turing-institute/CleverCSV/issues/13>`_ and 
  `issue #15 <https://github.com/alan-turing-institute/CleverCSV/issues/15>`_. 
  Thanks to @kaskawu for the fix and @jlumbroso for re-raising the issue.
* Add ``num_chars`` keyword argument to ``read_as_dicts`` and ``csv2df`` 
  wrappers.
* Improve documentation w.r.t. handling large files. Thanks to @jlumbroso for 
  raising this issue.

Version 0.6.1
-------------


* Add an ``explore`` command to the command line application for CleverCSV. 
  This command makes it easy to start exploring a CSV file using the Python 
  interactive shell.

Version 0.6.0
-------------


* Split the package into a "core" and "full" version. This allows users who 
  only need the improved dialect detection functionality to download a version 
  with a smaller footprint. Fixes `issue 
  #10 <https://github.com/alan-turing-institute/CleverCSV/issues/10>`_\ ]. Thanks 
  to @seperman.

Version 0.5.6
-------------


* Fix speed of ``unix_path`` regex used in type detection. (\ `issue 
  #13 <https://github.com/alan-turing-institute/CleverCSV/issues/13>`_\ ). Thanks 
  to @kaskawu.

Version 0.5.5
-------------


* Add ``stream_csv`` wrapper that returns a generator over rows
* Minor update to the URL type detection
* Documentation updates

Version 0.5.4
-------------


* Fix bugs discovered from fuzz testing (\ `issue 
  #7 <https://github.com/alan-turing-institute/CleverCSV/issues/7>`_\ )
* Minor changes to readme and code quality

Version 0.5.3
-------------


* Fix using nan as default value when skipping a dialect (\ `issue 
  #5 <https://github.com/alan-turing-institute/CleverCSV/issues/5>`_\ )

Version 0.5.2
-------------


* Bump version to fix wheel building

Version 0.5.1
-------------


* Bump version to fix wheel building

Version 0.5.0
-------------


* Improve type detection for quoted alphanumeric cells (#4)
* Pass ``strict`` dialect property to parser.

Version 0.4.7
-------------


* Bugfix for ``write_table`` wrapper on Windows.
* Move building Windows platform wheels to Travis.
* Use ``cibuildwheel`` version 1.0.0 for building wheels.

Version 0.4.6
-------------


* Add a wrapper function that writes a table to a CSV file.

Version 0.4.5
-------------


* Update CleverCSV to match updated clikit dependency
* Fix dependency versions for clikit and cleo

Version 0.4.4
-------------


* Update ``standardize`` command to use CRLF line endings on all platforms.
* Add work around for Tabview being unavailable on Windows.
* Remove packaging and dependency management with Poetry.
* Add support for building platform wheels on Travis and AppVeyor.

Version 0.4.3
-------------


* Add optional ``method`` parameter to dialect detector.
* Bugfix for ``clevercsv code`` command when the delimiter is tab.

Version 0.4.2
-------------


* Fix a failing build due to dependency version mismatch

Version 0.4.1
-------------


* Allow underscore in alphanumeric strings
* Update unix path regular expression
* Add more integration tests and log detection method

Version 0.4.0
-------------


* Update URL regular expression and add unit tests
* Add IPv4 type detection
* Add tie-breaker for combined quotechar and escapechar ties

Version 0.3.7
-------------


* Bugfix for console script ``code`` command
* Update readme

Version 0.3.6
-------------


* Cleanly handle failure to detect dialect in console application
* Remove any (partial) support for Python 2

Version 0.3.5
-------------


* Remove Python parser - this speeds up file reading and tie breaking

Version 0.3.4
-------------


* Ensure the C parser is used in the ``reader``.
* Update integration tests to improve error handling
* Readme updates

Version 0.3.3
-------------


* Ensure detected encoding is in the generated Python code for the ``clevercsv 
  code`` command.
* Ensure encoding is detected in ``wrappers.detect_dialect``.
* Bugfix in integration test
* Expand readme

Version 0.3.2
-------------


* Add documentation on `Read the Docs <https://clevercsv.readthedocs.io/>`_
* Use requirements.txt file for dependencies when packaging

Version 0.3.1
-------------


* Add help description to each CLI command
* Update README
* Add transpose flag for ``standardize`` and ``view`` commands

Version 0.3.0
-------------


* Rewrite console application using Cleo
* Add unit tests for console application
* Add ``detect_dialect`` wrapper function
* Add support for "unix_path" data type in type detection
* Add ``encoding`` and ``num_chars`` options to ``read_csv`` wrapper
* Add ``-p/--pandas`` flag to ``code`` command to generate Pandas output.

Version 0.2.5
-------------


* Rename ``read_as_lol`` to ``read_csv``.

Version 0.2.4
-------------


* Allow setting the number of characters to read
* Simplify printing of skipped potential dialects

Version 0.2.3
-------------


* Add ``read_as_lol`` wrapper function.

Version 0.2.2
-------------


* Add ``code`` command to ``clevercsv`` command line program.

Version 0.2.1
-------------


* Bugfix to update executable to new name

Version 0.2.0
-------------


* Rename package to clevercsv