File: chipz-doc.txt

package info (click to toggle)
cl-chipz 20160318-1
  • links: PTS, VCS
  • area: main
  • in suites: stretch
  • size: 260 kB
  • sloc: lisp: 2,515; makefile: 13
file content (286 lines) | stat: -rw-r--r-- 11,687 bytes parent folder | download | duplicates (6)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
(:author "Nathan Froyd"
 :email "froydnj@gmail.com"
 :package "Chipz"
 :cl-package "CHIPZ"
 :version #.(asdf:component-version (asdf:find-system :chipz))
 :homepage "http://www.method-combination.net/lisp/chipz/"
 :download "http://www.method-combination.net/lisp/files/chipz.tar.gz")

(:h1 ${package})

(:p ${package} " is a library for decompressing DEFLATE and BZIP2 data.
DEFLATE data, defined in " (:url "http://www.ietf.org/rfc/rfc1951.txt"
"RFC1951") ", forms the core of popular compression formats such as
zlib (" (:url "http://www.ietf.org/rfc/rfc1950.txt" "RFC 1950") ") and
gzip (" (:url "http://www.ietf.org/rfc/rfc1952.txt" "RFC 1952") ").  As
such, " ${package} " also provides for decompressing data in those
formats as well.  BZIP2 is the format used by the popular compression
tool " (:url "http://www.bzip.org/" "bzip2") ".")

(:p ${package} " is the reading complement to " (:url
"http://www.xach.com/salza2/" "Salza") ".")

(:h2 "Installation")

(:p ${package} " can be downloaded at " (:url ${download} ${download})
".  The latest version is " ${version} ".")

(:p "It comes with an ASDF system definition, so " `(ASDF:OOS
'ASDF:LOAD-OP :CHIPZ)` " should be all that you need to get started.")

(:h2 "License")

(:p ${package} " is released under a MIT-like license; you can do pretty
much anything you want to with the code except claim that you wrote
it.")

(:h2 "Using the library")

(:p "The main function of the library is " `decompress` ":")

(:describe :generic-function (chipz:decompress output))

(:p "Five distinct use cases are covered by this single function:")

(:ul
(:li "Decompressing from an octet vector to a fresh octet vector;")
(:li "Decompressing from a stream to a fresh octet vector;")
(:li "Decompressing from an octet vector to a user-specified octet vector;")
(:li "Decompressing from an octet vector to a stream;")
(:li "Decompressing from a stream to a stream;"))

(:note ${package} " does not provide for decompressing data from a stream
to a user-specified buffer, as the buffer management involved cannot be
done automatically by the library--the application must be involved in
this case.")

(:h3 ((:a name "one-shot")) "One-shot decompression")

(:p "The first and second use cases above are intended to be convenient
\"one-shot\" decompression methods.  Therefore, although the description
of the following methods attached to this generic function have an "
`decompression-state` " parameter, as returned by " @make-dstate ",
respectively, the usual way to use them will be to provide a "
`format` " argument.  This " `format` " argument should be one of:")

(:ul
(:li `chipz:bzip2` " for decompressing data in the bzip2 format;")
(:li `chipz:gzip` " for decompressing data in the gzip format;")
(:li `chipz:zlib` " for decompressing data in the zlib format;")
(:li `chipz:deflate` " for decompressing data in the deflate format."))

(:p "The " `format` " argument can also be a keyword, such as "
`:gzip` ", for backwards compatibility.  Using symbols in the " `CHIPZ`
" package is preferred, however.")

(:p "Most applications will use " `chipz:gzip` " or " `chipz:bzip2` ", a
few applications will use " `chipz:zlib` ", and uses of "
`chipz:deflate` " will probably be few and far between.")

(:p "All the method signatures described below also accept a "
`format` " argument in lieu of an " `decompression-state` " argument.")

(:p "The signatures of the first two methods are as follows.")

(:describe :method (chipz:decompress (null chipz:decompression-state vector) output))
(:describe :method (chipz:decompress (null chipz:decompression-state stream) output))

(:p "A simple function to retrieve the contents of a gzip-compressed
file, then, might be:")

(:pre
"(defun gzip-contents (pathname)
  (with-open-file (stream pathname :direction :input
                                   :element-type '(unsigned-byte 8))
    (chipz:decompress nil 'chipz:gzip stream)))")

(:p "These one-shot methods also support a " `:buffer-size` " argument
as a hint of the size of decompressed data.  The library uses this to
pre-allocate the output buffer to the hinted size.  Therefore, if you
know the size of the decompressed data or have a good estimate, fewer
allocations will be done, leading to slightly better performance.  If "
`:buffer-size` " is not provided or proves to be too small, the library
will of course grow the output buffer as necessary.")

(:h3 "Decompressing to a vector")

(:p "An alternate way to deal with compressed data is to read in a
buffer's worth of data, decompress the buffer, and then deal with any
remaining input and the produced output, looping to read and process
more data as appropriate.  This scheme is the third use case
described above and is handled in zlib with the " (:tt "inflate") "
function.  In " ${package} ", it is just another method of " `decompress`
".")

(:describe :method (chipz:decompress (vector chipz:decompression-state vector)  (values n-bytes-consumed n-bytes-produced)))

(:p "This method decompresses the data from " 'input' " between "
'input-start' " and " 'input-end' " and place the uncompressed data in "
'output' ", limited by " 'output-start' " and " 'output-end' ".  Please
note that it is possible to consume some or all of the input without
producing any output and to produce some or all of the output without
consuming any input.")

(:p "As above, you can use a " `format` " argument instead of an "
`decompression-state` ".  You will usually not want to do this unless
you know exactly how large the decompressed data is going to be;
otherwise, you will only decompress a portion of the data and any
intermediate state required to decompress the remainder of the data will
be thrown away.")

(:h3 "Decompressing to a stream")

(:p "Finally, " `decompress` " can also be used to write the
decompressed data directly to a stream, enabling a poor man's gunzip
function:")

(:pre "(defun gunzip (gzip-filename output-filename)
  (with-open-file (gzstream gzip-filename :direction :input
                            :element-type '(unsigned-byte 8))
    (with-open-file (stream output-filename :direction :output
                            :element-type '(unsigned-byte 8)
                            :if-exists :supersede)
      (chipz:decompress stream 'chipz:gzip gzstream)
      output-filename)))")

(:p "The relevant methods in this case are:")

(:describe :method (chipz:decompress (stream chipz:decompression-state vector) stream))
(:describe :method (chipz:decompress (stream chipz:decompression-state stream) stream))

(:p "Both return the output stream.")

(:h3 "Creating " `decompression-state` " objects")

(:p "The core data structure of " ${package} " is a "
`decompression-state` ", which stores the internal state of an ongoing
decompression process.  You create a " `decompression-state` " with "
@make-dstate ".")

(:describe :function (chipz:make-dstate dstate))

(:p "Return an " `decompression-state` " object suitable for
uncompressing data in " 'data-format' ".  " 'data-format' " should be:")

(:ul
(:li `chipz:bzip2` " for decompressing data in the bzip2 format;")
(:li `chipz:gzip` " for decompressing data in the gzip format;")
(:li `chipz:zlib` " for decompressing data in the zlib format;")
(:li `chipz:deflate` " for decompressing data in the deflate format."))

(:p "As with " @decompress ", you can use keywords instead, but doing so
is deprecated.")

(:p "Prior to adding bzip2 support, " ${package} " supported only
deflate-based formats.  " @make-inflate-state " was the primary
interface then; it is now deprecated, but kept around for backwards
compatibility.")

(:describe :function (chipz:make-inflate-state inflate-state))

(:p @make-inflate-state " supports the same " 'data-format' " arguments
as " @make-dstate " does, with the obvious exception of "
'chipz:bzip2' ".  The " `inflate-state` " object returned is a "
`decompression-state` ", so it can be passed to " @decompress " and "
@finish-dstate ".")

(:p "Once you are done with a " `decompression-state` " object, you must
call " @finish-dstate " on it.  " @finish-dstate " checks that the
given " 'state' " decompressed all the data in a given stream.  It does
not dispose of any resources associated with " 'state' "; it is meant
purely as an error-checking construct.  Therefore, it is inappropriate
to call from, say, the cleanup forms of " (:tt "UNWIND-PROTECT") ".  The
cleanup forms may be run when an error is thrown during decompression
and of course the stream will only be partially decompressed at that
point.")

(:describe :function (chipz:finish-dstate t))

(:p @finish-inflate-state " does the same thing, but only for "
`inflate-state` ".  Its use, like that of " @make-inflate-state " is
deprecated.")

(:describe :function (chipz:finish-inflate-state t))

(:h2 "Gray streams")

(:p ${package} " includes support for creating Gray streams to wrap
streams containing compressed data and read the uncompressed data from
those streams.  SBCL, Allegro, Lispworks, CMUCL, and OpenMCL are
supported at this time.")

(:describe :function (chipz::make-decompressing-stream decompressing-stream))

(:p "Return a stream that provides transparent decompression of the data
from " 'stream' " in " 'format' ".  That is, " `read-byte` " and "
`read-sequence` " will decompress the data read from " 'stream' " and
return portions of the decompressed data as requested.  " 'format' " is
as in the " ((:a href "#one-shot") "one-shot decompression
methods") ".")

(:h2 "Conditions")

(:describe :condition chipz-error)

(:p "All errors signaled by " ${package} " are of this type.")

(:describe :condition invalid-format-error)

(:p "This error is signaled when the " 'format' " argument to "
@decompress " or " @make-dstate " is not one of the symbols specified
for " @make-dstate ".  This error is also signaled in "
@make-inflate-state " if the " 'format' " argument is not valid for that
function.")

(:describe :condition decompression-error)

(:p "All errors signaled during decompression are of this type.")

(:describe :condition invalid-checksum-error)

(:p "The zlib, gzip, and bzip2 formats all contain checksums to verify
the integrity of the uncompressed data; this error is signaled when the
stored checksum is found to be inconsistent with the checksum computed
by " ${package} ".  It indicates that the compressed data has probably
been corrupted in some fashion (or there is an error in " ${package}
").")

(:describe :condition premature-end-of-stream)

(:p "This error is signaled when " @finish-dstate " is
called on an " `decompression-state` " that has not finished processing
an entire decompressed data stream.")

(:describe :condition inflate-error)

(:p "All errors signaled while decompressing deflate-based formats are
of this type.")

(:describe :condition invalid-zlib-header-error)

(:p "This error is signaled when an invalid zlib header is read.")

(:describe :condition invalid-gzip-header-error)

(:p "This error is signaled when an invalid gzip header is read.")

(:describe :condition reserved-block-type-error)

(:p "This error is signaled when a deflate block is read whose
type is 3.  This type is reserved for future expansion and should
not be found in the wild.")

(:describe :condition invalid-stored-block-length-error)

(:p "This error is signaled when the length of a deflate stored
block is found to be invalid.")

(:describe :condition bzip2-error)

(:p "All errors signaled while decompressing bzip2-based formats are of
this type.")

(:describe :condition invalid-bzip2-data)

(:p "This error is signaled when the compressed bzip2 data is found to
be corrupt in some way that prevents further decompression.")