1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 190 191 192 193 194 195 196 197 198 199 200 201 202 203
|
Revision history for Perl extension HTML::TreeBuilder::LibXML
0.28 2024-06-20T08:48:55Z
0.28 2024-06-20T08:17:27Z
- Same code as 0.27 - republishing to force CPAN reindexing
0.27 2024-01-09T06:19:12Z
- don't have the parser string blanks by default #15
https://github.com/tokuhirom/HTML-TreeBuilder-LibXML/pull/15
- fallback to toString() when toStringC14N(1) gives an empty string #17
https://github.com/tokuhirom/HTML-TreeBuilder-LibXML/pull/17
0.26 2016-10-19T15:03:00Z
- Improve `HTML::TreeBuilder::LibXML::Node->new` compatibility
with `HTML::Element->new`
https://github.com/tokuhirom/HTML-TreeBuilder-LibXML/issues/12
(reported by nanto++)
0.25 2015-04-21T23:39:33Z
commit 5c46775d50678e611729135f3601b66e6a0d10cd
Author: Yanick Champoux <yanick@babyl.dyndns.org>
Date: Thu Jan 15 14:15:04 2015 -0500
silence warnings
... as the arguments can be undefined
Fixes #10
0.24 2014-09-22T09:31:26Z
- Removed Web::Scraper from runtime dependencies.
https://github.com/tokuhirom/HTML-TreeBuilder-LibXML/issues/9
0.23 2013-05-17T00:16:48Z
- fixed guts(), clone() and replace_with() to properly handle XML::LibXML::Dtd nodes
- guts() now includes the Dtd node in the returned document (unless it were implicitly created)
- clone() calls createInternalSubset() on the new document
- replace_with() calls createInternalSubset() if the replacement is a XML::LibXML::Dtd (can't import Dtd node)
(cafe01)
0.22 2013-05-13T00:04:09Z
- improved guts(), calling nonBlankChildNodes() instead of childNodes()
- improved HTML::TreeBuilder::LibXML::Node documentation
(cafe01)
0.21 2013-05-12T19:12:53Z
- fixed guts(),
- now returning nodes from <head> and <body> instead of just <body>
- now returning text and comment nodes instead of just element nodes
- returned nodes now belong to the same document
- fixed to_HTML to render valid html, not xml
(cafe01)
0.20 2013-05-10T20:44:16Z
- improved replace_with() on document node.
- fixed push_content() and unshift_content() to work with document mode.
(cafe01)
0.19 2013-05-10T01:03:58Z
- fixed replace_with() and parent(),
to avoid calling appendChild() on a Document node, which is not supported by XML::LibXML.
(cafe01)
0.18 2013-05-09T20:49:04Z
- implemented all node methods needed for Web::Query::LibXML to work
- clone_list
- detach
- delete_content
- content_list
- replace_with
- push_content
- unshift_content
- postinsert
- preinsert
- disembowel (HTML::TreeBuilder::LibXML)
(cafe01)
- modified parse_file() to read file content, then call parse_content()
- thats because parse_content() will detect (heuristically) when the parser will add implict <html><body> tags, so guts() can work properly.
(cafe01)
0.18 2013-05-09T01:27:46Z
- implemented matches(), parent(), guts() node method
(Carlos Fernando Avila Gratz)
0.17
handle /(de)?objectify_text/ for <script> extraction
(Stanislaw Pusep)
0.16
commit 07b40205fd03564d476eff7675e9f19196939f2f
Author: Oleg G <verdrehung@gmail.com>
Date: Sat Mar 31 13:26:11 2012 +0700
added few methods to support Web::Query
0.15
* Add additional methods to better match
HTML::TreeBuilder::XPath::Node API:
- exists($xpath)
- find($elem_name)
- findvalues($xpath)
- findnodes_as_string($xpath)
- findnodes_as_strings($xpath)
(genehack)
0.14
* added workaround for Web::Scraper 0.36
(tokuhirom)
0.13
* Added 'getValue' node's method as in HTML::TreeBuilder::XPath for
comment nodes in web-scraper module
* Added dummy method 'store_comments' as in HTML::TreeBuilder
for web-scraper module (for comment nodes) and for
HTML::TreeBuilder::XPath working in tests with comment nodes
* Now this module requires HTML::TreeBuilder::XPath v.0.14 (before 0.14
there getValue() didn't work for comment nodes)
0.12
* no Test::Base(tokuhirom)
* depend to latest libraries(tokuhirom)
ref. http://d.hatena.ne.jp/mikeda/20100622/1277229060
* fixed typo in pod(tokuhirom)
0.11 Tue Oct 6 10:47:16 PDT 2009
- Fix parse when a content is truly an empty string
0.10
- added as_trimmed_text(chiba)
0.09 Sun Aug 2 06:02:01 PDT 2009
- Fixed parse_html method when parsing whitespace strings so it won't break and consistent with HTML::TB
(Reported by otsune)
0.08 Mon Jul 20 10:00:19 PDT 2009
- Updated POD document
0.07
- Implemented all_attr, all_attr_names, all_external_attr and all_external_attr_names (miyagawa)
0.06
- Added new_from_content and new_from_file to for more compat. with HTML::TreeBuilder(miyagawa)
0.05
- fixed deps
0.04
- support more look_down() params(miyagawa)
- added ->id support(tokuhirom)
- call ->eof automatically for compatibility.(tokuhirom)
- added ->findvalue support(tokuhirom)
- added ->eof(tokuhirom)
0.03
- [FEATURE] added to restore hacked constructor(suggested by miyagawa++).
0.02
- [MISC] $parser->no_network(1)
0.01_03
- [BUG] Node->delete is unbound()
- [FEATURE] added getFirstChild() method
- [FEATURE] added Node->as_HTML
- [MISC] added THANKS secion in the pod(感謝の意)
0.01_02
- more loose check(libxml is too strict)
- added benchmark script
- added benchmark result into pod
0.01_01
- initial dev. release
0.01 Tue Mar 24 21:26:10 2009
- original version
|