File: changes-30.rst

package info (click to toggle)
feedparser 6.0.12-1
  • links: PTS, VCS
  • area: main
  • in suites: forky, sid
  • size: 10,540 kB
  • sloc: xml: 11,459; python: 4,575; makefile: 7
file content (226 lines) | stat: -rw-r--r-- 7,761 bytes parent folder | download | duplicates (7)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
Changes in version 3.0
======================


:program:`Universal Feed Parser` 3.0 was released on June 21, 2004.

- don't try ``iso-8859-1`` (can't distinguish between ``iso-8859-1`` and ``windows-1252`` anyway, and most incorrectly marked feeds are ``windows-1252``)

- fixed regression that could cause the same encoding to be tried twice (even if it failed the first time)


:program:`Universal Feed Parser` 3.0fc3 was released on June 18, 2004.

- fixed bug in ``_changeEncodingDeclaration`` that failed to parse UTF-16 encoded feeds

- made ``source`` into a FeedParserDict

- duplicate admin:generatorAgent/@rdf:resource in ``generator_detail.url``

- added support for image

- refactored ``parse()`` fallback logic to try other encodings if SAX parsing fails (previously it would only try other encodings if re-encoding failed)

- remove ``unichr`` madness in normalize_attrs now that we're properly tracking encoding in and out of BaseHTMLProcessor

- set ``feed.language`` from root-level xml:lang

- set ``entry.id`` from rdf:about

- send ``Accept`` header


:program:`Universal Feed Parser` 3.0fc2 was released on May 10, 2004.

- added and passed Sam's amp tests

- added and passed my blink tag tests


:program:`Universal Feed Parser` 3.0fc1 was released on April 23, 2004.

- made ``results.entries[0].links[0]`` and ``results.entries[0].enclosures[0]`` into FeedParserDict

- fixed typo that could cause the same encoding to be tried twice (even if it failed the first time)

- fixed DOCTYPE stripping when DOCTYPE contained entity declarations

- better textinput and image tracking in illformed :abbr:`RSS (Rich Site Summary)` 1.0 feeds


:program:`Universal Feed Parser` 3.0b23 was released on April 21, 2004.

- fixed ``UnicodeDecodeError`` for feeds that contain high-bit characters in attributes in embedded :abbr:`HTML (HyperText Markup Language)` in description (thanks Thijs van de Vossen)

- moved ``guid``, ``date``, and ``date_parsed`` to mapped keys in FeedParserDict

- tweaked FeedParserDict.has_key to return ``True`` if asking about a mapped key


:program:`Universal Feed Parser` 3.0b22 was released on April 19, 2004.

- changed ``channel`` to ``feed``, ``item`` to ``entries`` in ``results`` dict

- changed ``results`` dict to allow getting values with ``results.key`` as well as ``results[key]``

- work around embedded illformed :abbr:`HTML (HyperText Markup Language)` with half a DOCTYPE

- work around malformed ``Content-Type`` header

- if character encoding is wrong, try several common ones before falling back to regexes (if this works, ``bozo_exception`` is set to ``CharacterEncodingOverride``

- fixed character encoding issues in BaseHTMLProcessor by tracking encoding and converting from Unicode to raw strings before feeding data to sgmllib.SGMLParser

- convert each value in results to Unicode (if possible), even if using regex-based parsing


:program:`Universal Feed Parser` 3.0b21 was released on April 14, 2004.

- added Hot RSS support


:program:`Universal Feed Parser` 3.0b20 was released on April 7, 2004.

- added :abbr:`CDF (Channel Definition Format)` support


:program:`Universal Feed Parser` 3.0b19 was released on March 15, 2004.

- fixed bug exploding author information when author name was in parentheses

- removed ultra-problematic :file:`mxTidy` support

- patch to workaround crash in PyXML/expat when encountering invalid entities (MarkMoraes)

- support for textinput/textInput


:program:`Universal Feed Parser` 3.0b18 was released on February 17, 2004.

- always map description to ``summary_detail`` (Andrei)

- use :file:`libxml2` (if available)


:program:`Universal Feed Parser` 3.0b17 was released on February 13, 2004.

- determine character encoding as per `RFC 3023 <http://www.ietf.org/rfc/rfc3023.txt>`_


:program:`Universal Feed Parser` 3.0b16 was released on February 12, 2004.

- fixed support for :abbr:`RSS (Rich Site Summary)` 0.90 (broken in b15)


:program:`Universal Feed Parser` 3.0b15 was released on February 11, 2004.

- fixed bug resolving relative links in wfw:commentRSS

- fixed bug capturing author and contributor :abbr:`URI (Uniform Resource Identifier)`

- fixed bug resolving relative links in author and contributor :abbr:`URI (Uniform Resource Identifier)`

- fixed bug resolving relative links in generator :abbr:`URI (Uniform Resource Identifier)`

- added support for recognizing :abbr:`RSS (Rich Site Summary)` 1.0

- passed Simon Fell's namespace tests, and included them permanently in the test suite with his permission

- fixed namespace handling under :program:`Python` 2.1


:program:`Universal Feed Parser` 3.0b14 was released on February 8, 2004.

- fixed CDATA handling in non-wellformed feeds under :program:`Python` 2.1


:program:`Universal Feed Parser` 3.0b13 was released on February 8, 2004.

- better handling of empty :abbr:`HTML (HyperText Markup Language)` tags (br, hr, img, etc.) in embedded markup, in either :abbr:`HTML (HyperText Markup Language)` or :abbr:`XHTML (Extensible HyperText Markup Language)` form (<br>, <br/>, <br />)


:program:`Universal Feed Parser` 3.0b12 was released on February 6, 2004.

- fiddled with ``decodeEntities`` (still not right)

- added support to Atom 0.2 subtitle

- added support for Atom content model in copyright

- better sanitizing of dangerous :abbr:`HTML (HyperText Markup Language)` elements with end tags (script, frameset)


:program:`Universal Feed Parser` 3.0b11 was released on February 2, 2004.

- added rights to list of elements that can contain dangerous markup

- fiddled with ``decodeEntities`` (not right)

- liberalized date parsing even further


:program:`Universal Feed Parser` 3.0b10 was released on January 31, 2004.

- incorporated ISO-8601 date parsing routines from :file:`xml.util.iso8601`


:program:`Universal Feed Parser` 3.0b9 was released on January 29, 2004.

- fixed check for presence of ``dict`` function

- added support for summary


:program:`Universal Feed Parser` 3.0b8 was released on January 28, 2004.

- added support for contributor


:program:`Universal Feed Parser` 3.0b7 was released on January 28, 2004.

- support Atom-style author element in ``author_detail`` (dictionary of ``name``, ``url``, ``email``)

- map ``author`` to ``author_detail`` if ``author`` contains name + email address


:program:`Universal Feed Parser` 3.0b6 was released on January 27, 2004.

- added feed type and version detection, ``result['version']`` will be one of ``SUPPORTED_VERSIONS.keys()`` or empty string if unrecognized

- added support for creativeCommons:license and cc:license

- added support for full Atom content model in title, tagline, info, copyright, summary

- fixed bug with gzip encoding (not always telling server we support it when we do)


:program:`Universal Feed Parser` 3.0b5 was released on January 26, 2004.

- fixed bug parsing multiple links at feed level


:program:`Universal Feed Parser` 3.0b4 was released on January 26, 2004.

- fixed xml:lang inheritance

- fixed multiple bugs tracking xml:base :abbr:`URI (Uniform Resource Identifier)`, one for documents that don't define one explicitly and one for documents that define an outer and an inner xml:base that goes out of scope before the end of the document


:program:`Universal Feed Parser` 3.0b3 was released on January 23, 2004.

- parse entire feed with real :abbr:`XML (Extensible Markup Language)` parser (if available)

- added several new supported namespaces

- fixed bug tracking naked markup in description

- added support for enclosure

- added support for source

- re-added support for cloud which got dropped somehow

- added support for expirationDate


:program:`Universal Feed Parser` 3.0b2 and 3.0b1 have been lost in the mists of time.