1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 190 191 192 193 194 195 196 197 198 199 200 201 202 203 204 205 206 207 208 209 210 211 212 213 214 215 216 217 218 219 220 221 222 223 224 225 226
|
Changes in version 3.0
======================
:program:`Universal Feed Parser` 3.0 was released on June 21, 2004.
- don't try ``iso-8859-1`` (can't distinguish between ``iso-8859-1`` and ``windows-1252`` anyway, and most incorrectly marked feeds are ``windows-1252``)
- fixed regression that could cause the same encoding to be tried twice (even if it failed the first time)
:program:`Universal Feed Parser` 3.0fc3 was released on June 18, 2004.
- fixed bug in ``_changeEncodingDeclaration`` that failed to parse UTF-16 encoded feeds
- made ``source`` into a FeedParserDict
- duplicate admin:generatorAgent/@rdf:resource in ``generator_detail.url``
- added support for image
- refactored ``parse()`` fallback logic to try other encodings if SAX parsing fails (previously it would only try other encodings if re-encoding failed)
- remove ``unichr`` madness in normalize_attrs now that we're properly tracking encoding in and out of BaseHTMLProcessor
- set ``feed.language`` from root-level xml:lang
- set ``entry.id`` from rdf:about
- send ``Accept`` header
:program:`Universal Feed Parser` 3.0fc2 was released on May 10, 2004.
- added and passed Sam's amp tests
- added and passed my blink tag tests
:program:`Universal Feed Parser` 3.0fc1 was released on April 23, 2004.
- made ``results.entries[0].links[0]`` and ``results.entries[0].enclosures[0]`` into FeedParserDict
- fixed typo that could cause the same encoding to be tried twice (even if it failed the first time)
- fixed DOCTYPE stripping when DOCTYPE contained entity declarations
- better textinput and image tracking in illformed :abbr:`RSS (Rich Site Summary)` 1.0 feeds
:program:`Universal Feed Parser` 3.0b23 was released on April 21, 2004.
- fixed ``UnicodeDecodeError`` for feeds that contain high-bit characters in attributes in embedded :abbr:`HTML (HyperText Markup Language)` in description (thanks Thijs van de Vossen)
- moved ``guid``, ``date``, and ``date_parsed`` to mapped keys in FeedParserDict
- tweaked FeedParserDict.has_key to return ``True`` if asking about a mapped key
:program:`Universal Feed Parser` 3.0b22 was released on April 19, 2004.
- changed ``channel`` to ``feed``, ``item`` to ``entries`` in ``results`` dict
- changed ``results`` dict to allow getting values with ``results.key`` as well as ``results[key]``
- work around embedded illformed :abbr:`HTML (HyperText Markup Language)` with half a DOCTYPE
- work around malformed ``Content-Type`` header
- if character encoding is wrong, try several common ones before falling back to regexes (if this works, ``bozo_exception`` is set to ``CharacterEncodingOverride``
- fixed character encoding issues in BaseHTMLProcessor by tracking encoding and converting from Unicode to raw strings before feeding data to sgmllib.SGMLParser
- convert each value in results to Unicode (if possible), even if using regex-based parsing
:program:`Universal Feed Parser` 3.0b21 was released on April 14, 2004.
- added Hot RSS support
:program:`Universal Feed Parser` 3.0b20 was released on April 7, 2004.
- added :abbr:`CDF (Channel Definition Format)` support
:program:`Universal Feed Parser` 3.0b19 was released on March 15, 2004.
- fixed bug exploding author information when author name was in parentheses
- removed ultra-problematic :file:`mxTidy` support
- patch to workaround crash in PyXML/expat when encountering invalid entities (MarkMoraes)
- support for textinput/textInput
:program:`Universal Feed Parser` 3.0b18 was released on February 17, 2004.
- always map description to ``summary_detail`` (Andrei)
- use :file:`libxml2` (if available)
:program:`Universal Feed Parser` 3.0b17 was released on February 13, 2004.
- determine character encoding as per `RFC 3023 <http://www.ietf.org/rfc/rfc3023.txt>`_
:program:`Universal Feed Parser` 3.0b16 was released on February 12, 2004.
- fixed support for :abbr:`RSS (Rich Site Summary)` 0.90 (broken in b15)
:program:`Universal Feed Parser` 3.0b15 was released on February 11, 2004.
- fixed bug resolving relative links in wfw:commentRSS
- fixed bug capturing author and contributor :abbr:`URI (Uniform Resource Identifier)`
- fixed bug resolving relative links in author and contributor :abbr:`URI (Uniform Resource Identifier)`
- fixed bug resolving relative links in generator :abbr:`URI (Uniform Resource Identifier)`
- added support for recognizing :abbr:`RSS (Rich Site Summary)` 1.0
- passed Simon Fell's namespace tests, and included them permanently in the test suite with his permission
- fixed namespace handling under :program:`Python` 2.1
:program:`Universal Feed Parser` 3.0b14 was released on February 8, 2004.
- fixed CDATA handling in non-wellformed feeds under :program:`Python` 2.1
:program:`Universal Feed Parser` 3.0b13 was released on February 8, 2004.
- better handling of empty :abbr:`HTML (HyperText Markup Language)` tags (br, hr, img, etc.) in embedded markup, in either :abbr:`HTML (HyperText Markup Language)` or :abbr:`XHTML (Extensible HyperText Markup Language)` form (<br>, <br/>, <br />)
:program:`Universal Feed Parser` 3.0b12 was released on February 6, 2004.
- fiddled with ``decodeEntities`` (still not right)
- added support to Atom 0.2 subtitle
- added support for Atom content model in copyright
- better sanitizing of dangerous :abbr:`HTML (HyperText Markup Language)` elements with end tags (script, frameset)
:program:`Universal Feed Parser` 3.0b11 was released on February 2, 2004.
- added rights to list of elements that can contain dangerous markup
- fiddled with ``decodeEntities`` (not right)
- liberalized date parsing even further
:program:`Universal Feed Parser` 3.0b10 was released on January 31, 2004.
- incorporated ISO-8601 date parsing routines from :file:`xml.util.iso8601`
:program:`Universal Feed Parser` 3.0b9 was released on January 29, 2004.
- fixed check for presence of ``dict`` function
- added support for summary
:program:`Universal Feed Parser` 3.0b8 was released on January 28, 2004.
- added support for contributor
:program:`Universal Feed Parser` 3.0b7 was released on January 28, 2004.
- support Atom-style author element in ``author_detail`` (dictionary of ``name``, ``url``, ``email``)
- map ``author`` to ``author_detail`` if ``author`` contains name + email address
:program:`Universal Feed Parser` 3.0b6 was released on January 27, 2004.
- added feed type and version detection, ``result['version']`` will be one of ``SUPPORTED_VERSIONS.keys()`` or empty string if unrecognized
- added support for creativeCommons:license and cc:license
- added support for full Atom content model in title, tagline, info, copyright, summary
- fixed bug with gzip encoding (not always telling server we support it when we do)
:program:`Universal Feed Parser` 3.0b5 was released on January 26, 2004.
- fixed bug parsing multiple links at feed level
:program:`Universal Feed Parser` 3.0b4 was released on January 26, 2004.
- fixed xml:lang inheritance
- fixed multiple bugs tracking xml:base :abbr:`URI (Uniform Resource Identifier)`, one for documents that don't define one explicitly and one for documents that define an outer and an inner xml:base that goes out of scope before the end of the document
:program:`Universal Feed Parser` 3.0b3 was released on January 23, 2004.
- parse entire feed with real :abbr:`XML (Extensible Markup Language)` parser (if available)
- added several new supported namespaces
- fixed bug tracking naked markup in description
- added support for enclosure
- added support for source
- re-added support for cloud which got dropped somehow
- added support for expirationDate
:program:`Universal Feed Parser` 3.0b2 and 3.0b1 have been lost in the mists of time.
|