******************************************************************************* * Developers * * * * Do not update this change log manually. * * Instead, make your commit messages informative, and these will be harvested * * at integration time by the cvs2cl.pl script and added here. * * For details on this script see: * * http://www.red-bean.com/cvs2cl/ * * * * For guidelines on what to put in commit messages see: * * http://www.red-bean.com/cvs2cl/changelogs.html * * * * This file has been trimmed of changes prior to version 1.4. For access to * * earlier change histories, please consult htmlparser.cvs.sorceforge.net. * * * ******************************************************************************* Release Build 1.6 - 20060610 -------------------------------- 2006-06-10 10:39 derrickoswald * docs/faq.html: add faq to docs 2006-06-05 19:53 derrickoswald * src/org/htmlparser/tests/InstanceofPerformanceTest.java: Remove InstanceofPerformanceTest, no longer needed. 2006-06-04 15:17 derrickoswald * src/org/htmlparser/tests/AllTests.java, src/org/htmlparser/tests/ParserTest.java, src/org/htmlparser/tests/tagTests/BodyTagTest.java, src/org/htmlparser/tests/tagTests/FormTagTest.java, src/org/htmlparser/tests/tagTests/LabelTagTest.java, src/org/htmlparser/tests/tagTests/LinkTagTest.java, src/org/htmlparser/tests/tagTests/ObjectCollectionTest.java, build.xml, src/org/htmlparser/Parser.java, src/org/htmlparser/StringNodeFactory.java, src/org/htmlparser/Tag.java, src/org/htmlparser/tests/lexerTests/TagTests.java, src/org/htmlparser/tests/scannersTests/ScriptScannerTest.java, src/org/htmlparser/util/LinkProcessor.java, src/org/htmlparser/util/SpecialHashtable.java, src/org/htmlparser/util/Translate.java, src/org/htmlparser/nodes/TagNode.java, src/org/htmlparser/tags/LinkTag.java: Eliminate deprecated classes and methods. Removed nodeDecorator package, StringNodeFactory, LinkProcesor, SpecialHashTable, and methods for linkData, non-Ex Attributes and FindAllNodesThatAre. 2006-06-01 23:14 derrickoswald * src/org/htmlparser/: Parser.java, lexer/Lexer.java, util/NodeTreeWalker.java: Fix Javadoc warnings. 2006-06-01 22:43 derrickoswald * src/org/htmlparser/: http/ConnectionManager.java, lexer/Page.java: implement RFE #1394144 handle deflate encoding InflaterInputStream needed an additional Inflater argument. 2006-06-01 21:48 derrickoswald * src/org/htmlparser/: http/ConnectionManager.java, http/HttpHeader.java, Parser.java: implement RFE #1436082 Follow redirections with cookie processing Use ConnectionManager.setRedirectionProcessingEnabled(true). Probably only useful if combined with ConnectionManager.setCookieProcessingEnabled(true). 2006-05-30 22:10 derrickoswald * src/org/htmlparser/: tests/utilTests/NodeListTest.java, Node.java, nodes/AbstractNode.java, nodes/RemarkNode.java, nodes/TagNode.java, nodes/TextNode.java, tags/CompositeTag.java, tags/ScriptTag.java, util/NodeList.java: implement task #93148 toHtml(boolean verbatim) To avoid printing generated end tags use toHtml(true). 2006-05-29 23:11 derrickoswald * src/org/htmlparser/Parser.java: Update javadoc for new Parser constructor behaviour. 2006-05-29 22:53 derrickoswald * src/org/htmlparser/Parser.java: Allow passing HTML in the Parser constructor. So now it allows HTML, a URL or a file name. 2006-05-29 21:30 derrickoswald * src/org/htmlparser/http/ConnectionManager.java: Handle bad cookie names. Traps cookie name problems, but ignores any following cookies. 2006-05-29 21:07 derrickoswald * src/org/htmlparser/: beans/StringBean.java, tests/utilTests/BeanTest.java: fix bug#1496863 StringBean collapse() adds extra whitespace Keep collapsing state machine state as member variable. Integration Build 1.6 - 20060527 -------------------------------- 2006-05-27 10:36 derrickoswald * src/org/htmlparser/: scanners/ScriptScanner.java, tests/scannersTests/ScriptScannerTest.java: fix bug #1457371 Script tag consumes too much from document being parsed The default for ScriptScanner.STRICT was set to true. If you want the older, more lax, script parsing, set it to false. 2006-05-27 10:03 derrickoswald * src/org/htmlparser/: nodes/RemarkNode.java, tests/tagTests/FormTagTest.java: fix bug #1488951 RemarkNode.toPlainTextString() incorrect behaviour RemarkNode.toPlainTextString() now always returns an empty string if you want the remark text use getText() 2006-05-27 10:02 derrickoswald * src/org/htmlparser/: lexer/Lexer.java, tests/FunctionalTests.java, tests/lexerTests/LexerTests.java, tests/parserHelperTests/RemarkNodeParserTest.java: fix bug #1345049 HTMLParser should not terminate a comment with ---> add static STRICT_REMARKS to Lexer class, which when true follows the specification for remarks 2006-05-16 05:14 ian_macfarlane * src/org/htmlparser/filters/: AndFilter.java, OrFilter.java: Incorrect grammar in javadoc. Changed [it's] to [its]. 2006-05-16 05:11 ian_macfarlane * src/org/htmlparser/filters/XorFilter.java: New class that does XOR logic (to round out our NOT, AND and OR filters). 2006-05-16 03:58 ian_macfarlane * src/org/htmlparser/filters/: AndFilter.java, OrFilter.java: Added constructors to OrFilter/AndFilter that take an array of NodeFilter's. 2006-04-24 18:12 derrickoswald * src/org/htmlparser/Parser.java: Fix incorrect example. 2006-04-23 07:59 derrickoswald * src/org/htmlparser/tags/TableHeader.java: Change copyright as per request by P.I.M. Schrama 2006-04-17 20:08 derrickoswald * src/org/htmlparser/tests/: lexerTests/KitTest.java, PerformanceTest.java: Move non-junit test code to Request For Enhancement (RFE) as attachments. 2006-04-17 19:45 derrickoswald * src/org/htmlparser/tests/: ParserTestCase.java, PerformanceTest.java: Fix unit tests. 2006-04-17 09:53 derrickoswald * src/org/htmlparser/tests/: ParserTest.java, lexerTests/LexerTests.java, tagTests/InputTagTest.java, tagTests/TableTagTest.java, utilTests/CharacterTranslationTest.java: Fix unit tests. Move failing test cases to downloads on corresponding RFE artifacts. 2006-04-17 09:51 derrickoswald * bin/: translate.cmd, beanybaby.cmd, filterbuilder.cmd, lexer.cmd, linkextractor.cmd, parser.cmd, sitecapturer.cmd, stringextractor.cmd, thumbelina.cmd: Allow execution from directory name containing spaces on Windows. 2006-04-14 18:18 derrickoswald * build.xml, src/org/htmlparser/Parser.java, src/org/htmlparser/http/ConnectionManager.java, src/org/htmlparser/lexer/Lexer.java, src/org/htmlparser/util/NodeList.java: Cleanup to isolate htmllexer jar build. 2006-04-11 08:03 derrickoswald * src/org/htmlparser/tests/: AllTests.java, MemoryTest.java: Move failing unit test to RFE as a download. 2006-04-10 17:38 derrickoswald * src/org/htmlparser/lexer/Page.java: Fix Bug #1467712 Page#getCharset never works Use Content-Type header field instead of connection's getContentType method. 2006-04-08 09:33 derrickoswald * src/org/htmlparser/tests/utilTests/CharacterTranslationTest.java: Typo. 2006-04-06 20:58 derrickoswald * src/org/htmlparser/: lexer/Page.java, tests/lexerTests/PageTests.java: Fix Bug #1461473 Relative links starting with ? Added overloaded methods taking boolean 'strict' flag on URL manipulators. Default is loose interpretation like most browsers. Integration Build 1.6 - 20060319 -------------------------------- 2006-03-19 19:02 derrickoswald * src/org/htmlparser/tests/tagTests/BodyTagTest.java: Fix unit test for body tag. 2006-03-19 17:13 derrickoswald * docs/panel.html: Fix name of current build. 2006-03-19 17:03 derrickoswald * build.xml, docs/bug.html, docs/panel.html: Fix bug #1363500 http://htmlparser.sourceforge.net/bug.html is wrong Take down the wiki. 2006-03-19 16:26 derrickoswald * src/org/htmlparser/: lexer/Lexer.java, tags/BodyTag.java: Fix bug #1375230 some javascript breaks stringbean Retrace non-conforming end of remark. 2006-03-19 15:14 derrickoswald * src/org/htmlparser/http/: ConnectionManager.java, Cookie.java: Fix bug #1376851 Null-valued cookies cause exception Add handling for namewless cookies. 2006-03-19 13:40 derrickoswald * src/org/htmlparser/http/ConnectionManager.java: Remove deflate option from default request properties. See RFE #1394144 handle deflate encoding. 2006-03-19 12:09 derrickoswald * src/org/htmlparser/lexer/Page.java: Typo. 2006-03-19 11:11 derrickoswald * src/org/htmlparser/lexer/Lexer.java: Fix bug #1445795 return as TextNode when processing jsp Handle single and double line comments within jsp nodes. Suggested alteration to handle jsp tags within tag attributes wasn't implemented. 2006-03-19 10:01 derrickoswald * docs/contributors.html, src/org/htmlparser/PrototypicalNodeFactory.java, src/org/htmlparser/lexer/Lexer.java, src/org/htmlparser/tags/ProcessingInstructionTag.java, src/org/htmlparser/tests/lexerTests/LexerTests.java: Incorporated patch #1450095 Fix for Bug 1445309 from Trejkaz Xaoza. Addition of code to parse XML processing instructions. 2006-02-13 09:50 ian_macfarlane * src/org/htmlparser/util/NodeTreeWalker.java: A utility class to traverse a tree of Node objects using either depth-first or breadth-first tree traversal. Kind of like a NodeIterator for DOM-type trees of Nodes instead of linear sequences of Nodes. Post to the dev mailing list about this on the way. 2005-11-14 21:09 derrickoswald * src/org/htmlparser/: Attribute.java, Node.java, Parser.java, PrototypicalNodeFactory.java, Remark.java, StringNodeFactory.java, Tag.java, Text.java: Fix warnings flagged by doccheck. 2005-11-12 11:44 derrickoswald * src/org/htmlparser/tests/: lexerTests/LexerTests.java, tagTests/FormTagTest.java, tagTests/LinkTagTest.java: Update tests for addition of Paragraph tag. Integration Build 1.6 - 20051112 -------------------------------- 2005-11-12 09:19 derrickoswald * src/org/htmlparser/http/ConnectionManager.java, docs/contributors.html, src/org/htmlparser/PrototypicalNodeFactory.java: Add cookie processing changes suggested by Marcus Mattern. 2005-11-04 10:49 ian_macfarlane * src/org/htmlparser/nodes/TextNode.java: Add method isWhiteSpace to TextNode that returns if the node is nothing but white space (or null) or if it contains some characters. 2005-11-01 03:55 ian_macfarlane * src/org/htmlparser/nodeDecorators/AbstractNodeDecorator.java: Add methods first/last child previous/next sibling added to AbstractNode. This is required to enable the project to compile. 2005-10-31 11:26 ian_macfarlane * src/org/htmlparser/: PrototypicalNodeFactory.java, tags/DefinitionList.java, tags/DefinitionListBullet.java, tags/HeadingTag.java, tags/ParagraphTag.java, tags/TableColumn.java, tags/TableHeader.java, tags/TableRow.java: Added support for P and h1-h6 tags. Added support for definition list tags (dl, dt, dd). Let table row/column tags know when to close if encounter TBODY/TFOOT/THEAD. 2005-10-26 18:01 derrickoswald * docs/contributors.html, src/org/htmlparser/Node.java, src/org/htmlparser/nodes/AbstractNode.java: Incorporate patch #1338534 Support get first/last child, previous/next sibling from Ian Macfarlane. No unit tests. 2005-10-24 22:06 derrickoswald * src/org/htmlparser/: lexer/Page.java, tags/MetaTag.java: Fix bug 1322686 when illegal charset specified Use current source charset as the default if there is already a source. 2005-10-24 21:26 derrickoswald * src/org/htmlparser/lexer/InputStreamSource.java: Fixed bug #1334408 Exception occurs based on string length Changed >= test to > to avoid off-by-one error. 2005-09-25 21:01 derrickoswald * build.xml: Fix htmlparser target. Integration Build 1.6 - 20050925 -------------------------------- 2005-09-18 22:35 derrickoswald * docs/contributors.html, src/org/htmlparser/lexer/InputStreamSource.java, src/org/htmlparser/lexer/Lexer.java, src/org/htmlparser/tests/lexerTests/LexerTests.java: Apply patch #1247128 Bug Fix: #1227213 Particular SCRIPT tags close too late from Keiron McCammon. 2005-09-18 19:40 derrickoswald * docs/contributors.html, src/org/htmlparser/beans/FilterBean.java, src/org/htmlparser/filters/LinkRegexFilter.java: Add recursive property on FilterBean suggested by Martin Hudson. 2005-09-18 19:00 derrickoswald * docs/contributors.html, src/org/htmlparser/tests/utilTests/NodeListTest.java, src/org/htmlparser/util/NodeList.java: Add remove(Node) method and code suggested by Matthew Buckett. 2005-06-19 21:56 derrickoswald * src/org/htmlparser/: http/ConnectionManager.java, lexer/Lexer.java, lexer/Page.java, lexer/Stream.java, tags/CompositeTag.java: Optimizations suggested by profiling. Correction to previous drop: -- use extractAllNodesThatmatch to replace searchFor(cls) -- 2005-06-19 08:01 derrickoswald * build.xml, src/org/htmlparser/Parser.java, src/org/htmlparser/http/ConnectionManager.java, src/org/htmlparser/http/HttpHeader.java, src/org/htmlparser/http/package.html, src/org/htmlparser/util/NodeList.java: Changes to allow compilation of htmllexer.jar by gcj. Move non-JDK1.1 functionality to HttpHeader class. Unhook NodeList from filters by removing searchFor(cls) - use keepAllNodesThatMatch(new NodeClassFilter(cls)) instead. Include classes missing from closure set. Release Build 1.5 - 20050614 -------------------------------- 2005-05-15 07:49 derrickoswald * resources/htmlparser_checks.xml, src/org/htmlparser/Attribute.java, src/org/htmlparser/NodeFactory.java, src/org/htmlparser/NodeFilter.java, src/org/htmlparser/Remark.java, src/org/htmlparser/Tag.java, src/org/htmlparser/Text.java, src/org/htmlparser/beans/BeanyBaby.java, src/org/htmlparser/beans/FilterBean.java, src/org/htmlparser/beans/HTMLLinkBean.java, src/org/htmlparser/beans/HTMLTextBean.java, src/org/htmlparser/beans/LinkBean.java, src/org/htmlparser/beans/StringBean.java, src/org/htmlparser/filters/CssSelectorNodeFilter.java, src/org/htmlparser/filters/HasAttributeFilter.java, src/org/htmlparser/filters/HasChildFilter.java, src/org/htmlparser/filters/HasParentFilter.java, src/org/htmlparser/filters/HasSiblingFilter.java, src/org/htmlparser/filters/IsEqualFilter.java, src/org/htmlparser/filters/LinkRegexFilter.java, src/org/htmlparser/filters/LinkStringFilter.java, src/org/htmlparser/filters/NodeClassFilter.java, src/org/htmlparser/filters/NotFilter.java, src/org/htmlparser/filters/OrFilter.java, src/org/htmlparser/filters/RegexFilter.java, src/org/htmlparser/filters/StringFilter.java, src/org/htmlparser/filters/TagNameFilter.java, src/org/htmlparser/http/ConnectionManager.java, src/org/htmlparser/http/Cookie.java, src/org/htmlparser/lexer/Cursor.java, src/org/htmlparser/lexer/InputStreamSource.java, src/org/htmlparser/lexer/Lexer.java, src/org/htmlparser/lexer/Page.java, src/org/htmlparser/lexer/PageAttribute.java, src/org/htmlparser/lexer/PageIndex.java, src/org/htmlparser/lexer/Source.java, src/org/htmlparser/lexer/Stream.java, src/org/htmlparser/lexer/StringSource.java, src/org/htmlparser/scanners/ScriptDecoder.java, src/org/htmlparser/tests/lexerTests/KitTest.java, src/org/htmlparser/tests/lexerTests/LexerTests.java, src/org/htmlparser/tests/lexerTests/PageTests.java, src/org/htmlparser/tests/lexerTests/TagTests.java, src/org/htmlparser/tests/tagTests/InputTagTest.java, src/org/htmlparser/tests/utilTests/SortTest.java, src/org/htmlparser/util/ParserUtils.java: Documentation revamp part four. Remove some checkstyle warnings. 2005-05-13 06:44 derrickoswald * docs/contributors.html, src/org/htmlparser/sax/XMLReader.java: Add parse(InputSource) suggested by Jamie McCrindle. 2005-05-10 18:11 derrickoswald * src/org/htmlparser/tests/tagTests/SelectTagTest.java: Remove Shamil's email address. 2005-04-24 13:48 derrickoswald * build.xml, docs/main.html, lib/checkstyle-all-3.1.jar, lib/fit.jar, resources/htmlparser_checks.xml, src/doc-files/building.html, src/doc-files/overview.html, src/doc-files/using.html, src/org/htmlparser/Node.java, src/org/htmlparser/Parser.java, src/org/htmlparser/PrototypicalNodeFactory.java, src/org/htmlparser/tags/package.html, src/org/htmlparser/tests/ParserTest.java, src/org/htmlparser/visitors/NodeVisitor.java: Documentation revamp part three. Reworked some JavaDoc descriptions. Added "HTML Parser for dummies" introductory text. Removed checkstyle.jar and fit.jar (and it's cruft). 2005-04-12 07:27 derrickoswald * src/org/htmlparser/: Attribute.java, beans/package.html, lexer/Cursor.java, lexer/InputStreamSource.java, lexer/Lexer.java, lexer/Page.java, lexer/PageAttribute.java, lexer/Source.java, lexer/Stream.java, lexer/StringSource.java, lexer/package.html, lexerapplications/thumbelina/PicturePanel.java, parserapplications/LinkExtractor.java, parserapplications/SiteCapturer.java, parserapplications/StringExtractor.java, parserapplications/WikiCapturer.java, parserapplications/package.html, parserapplications/filterbuilder/Filter.java, parserapplications/filterbuilder/FilterBuilder.java, parserapplications/filterbuilder/HtmlTreeCellRenderer.java, parserapplications/filterbuilder/HtmlTreeModel.java, parserapplications/filterbuilder/SubFilterList.java, parserapplications/filterbuilder/layouts/NullLayoutManager.java, parserapplications/filterbuilder/layouts/VerticalLayoutManager.java, parserapplications/filterbuilder/wrappers/AndFilterWrapper.java, parserapplications/filterbuilder/wrappers/HasAttributeFilterWrapper.java, parserapplications/filterbuilder/wrappers/HasChildFilterWrapper.java, parserapplications/filterbuilder/wrappers/HasParentFilterWrapper.java, parserapplications/filterbuilder/wrappers/HasSiblingFilterWrapper.java, parserapplications/filterbuilder/wrappers/NodeClassFilterWrapper.java, parserapplications/filterbuilder/wrappers/NotFilterWrapper.java, parserapplications/filterbuilder/wrappers/OrFilterWrapper.java, parserapplications/filterbuilder/wrappers/RegexFilterWrapper.java, parserapplications/filterbuilder/wrappers/StringFilterWrapper.java, parserapplications/filterbuilder/wrappers/TagNameFilterWrapper.java, sax/Feedback.java, sax/XMLReader.java: Documentation revamp part two. 2005-04-10 19:20 derrickoswald * bin/beanybaby.bat, bin/beanybaby.cmd, bin/filterbuilder.bat, bin/filterbuilder.cmd, bin/lexer.bat, bin/lexer.cmd, bin/linkextractor.bat, bin/linkextractor.cmd, bin/parser.bat, bin/parser.cmd, bin/sitecapturer, bin/sitecapturer.cmd, bin/stringextractor.bat, bin/stringextractor.cmd, bin/thumbelina.bat, bin/thumbelina.cmd, bin/translate.bat, bin/translate.cmd, src/org/htmlparser/Attribute.java, src/org/htmlparser/Node.java, src/org/htmlparser/NodeFactory.java, src/org/htmlparser/PrototypicalNodeFactory.java, src/org/htmlparser/Remark.java, src/org/htmlparser/StringNodeFactory.java, src/org/htmlparser/Tag.java, src/org/htmlparser/Text.java, src/org/htmlparser/beans/BeanyBaby.java, src/org/htmlparser/beans/FilterBean.java, src/org/htmlparser/beans/HTMLLinkBean.java, src/org/htmlparser/beans/HTMLTextBean.java, src/org/htmlparser/beans/LinkBean.java, src/org/htmlparser/beans/StringBean.java, src/org/htmlparser/beans/package.html, src/org/htmlparser/filters/AndFilter.java, src/org/htmlparser/filters/CssSelectorNodeFilter.java, src/org/htmlparser/filters/HasAttributeFilter.java, src/org/htmlparser/filters/HasChildFilter.java, src/org/htmlparser/filters/HasParentFilter.java, src/org/htmlparser/filters/HasSiblingFilter.java, src/org/htmlparser/filters/LinkRegexFilter.java, src/org/htmlparser/filters/LinkStringFilter.java, src/org/htmlparser/filters/NodeClassFilter.java, src/org/htmlparser/filters/NotFilter.java, src/org/htmlparser/filters/OrFilter.java, src/org/htmlparser/filters/RegexFilter.java, src/org/htmlparser/filters/TagNameFilter.java, src/org/htmlparser/http/ConnectionManager.java, src/org/htmlparser/http/ConnectionMonitor.java, src/org/htmlparser/http/Cookie.java, src/org/htmlparser/http/package.html, src/org/htmlparser/nodeDecorators/AbstractNodeDecorator.java, src/org/htmlparser/nodeDecorators/DecodingNode.java, src/org/htmlparser/nodeDecorators/EscapeCharacterRemovingNode.java, src/org/htmlparser/nodeDecorators/NonBreakingSpaceConvertingNode.java, src/org/htmlparser/nodeDecorators/package.html, src/org/htmlparser/nodes/AbstractNode.java, src/org/htmlparser/nodes/RemarkNode.java, src/org/htmlparser/nodes/TagNode.java, src/org/htmlparser/nodes/TextNode.java, src/org/htmlparser/nodes/package.html, src/org/htmlparser/parserapplications/filterbuilder/FilterBuilder.java, src/org/htmlparser/scanners/CompositeTagScanner.java, src/org/htmlparser/tags/BaseHrefTag.java, src/org/htmlparser/tags/BodyTag.java, src/org/htmlparser/tags/CompositeTag.java, src/org/htmlparser/tags/DoctypeTag.java, src/org/htmlparser/tags/FormTag.java, src/org/htmlparser/tags/FrameSetTag.java, src/org/htmlparser/tags/FrameTag.java, src/org/htmlparser/tags/HeadTag.java, src/org/htmlparser/tags/ImageTag.java, src/org/htmlparser/tags/JspTag.java, src/org/htmlparser/tags/LabelTag.java, src/org/htmlparser/tags/LinkTag.java, src/org/htmlparser/tags/MetaTag.java, src/org/htmlparser/tags/OptionTag.java, src/org/htmlparser/tags/ScriptTag.java, src/org/htmlparser/tags/SelectTag.java, src/org/htmlparser/tags/TableRow.java, src/org/htmlparser/tags/TableTag.java, src/org/htmlparser/tags/TextareaTag.java, src/org/htmlparser/tags/TitleTag.java, src/org/htmlparser/tags/package.html, src/org/htmlparser/tests/lexerTests/KitTest.java, src/org/htmlparser/tests/lexerTests/LexerTests.java: Documentation revamp part one. Deprecated node decorators. Added doSemanticAction for Text and Comment nodes. Added missing sitecapturer scripts. Fixed DOS batch files to work when called from any location. 2005-04-06 06:27 derrickoswald * build.xml, docs/release.txt, docs/samples.html: End user experience issues: remove multiple wiki files in zip fix sample application links change readme.txt to use Windows line endings change copyright date 2005-04-06 06:20 derrickoswald * docs/contributors.html, src/org/htmlparser/filters/LinkRegexFilter.java, src/org/htmlparser/filters/LinkStringFilter.java: Add link pattern filters submitted by John Derrick. 2005-04-04 20:48 derrickoswald * src/org/htmlparser/: NodeFilter.java, Parser.java, package.html, parserapplications/SiteCapturer.java: Update javadocs. Enable SiteCapturer to handle resource names containing spaces. Integration Build 1.5 - 20050313 -------------------------------- 2005-03-13 09:51 derrickoswald * src/org/htmlparser/: lexer/Lexer.java, lexer/Page.java, lexer/Source.java, lexerapplications/tabby/Tabby.java, scanners/ScriptDecoder.java, tests/lexerTests/TagTests.java, util/IteratorImpl.java: Bug #1121401 No Parsing with yahoo! By default nio.charset.CharsetDecoder replaces characters it cannot represent in the current encoding with zero, which was the value returned by the page when the Stream reached EOF. This changes the Page return value to (char)Source.EOF (-1) when the end of stream is encountered. 2005-03-12 16:39 derrickoswald * src/org/htmlparser/beans/: BeanyBaby.java, LinkBean.java: Fix bean example, stop sharing connections. 2005-03-12 15:27 derrickoswald * build.xml, lib/commons-logging.jar: Bug #1018884 'compile' ant task from build.xml messes up ./src directory Added optional "classes" property to build.xml. This directory is where class files are put. It defaults to src. To use: build -Dclasses=classdir where classdir is a peer directory to src. Removed unused commons-logging.jar while I was in there. 2005-03-12 12:53 derrickoswald * src/org/htmlparser/: lexer/Lexer.java, scanners/ScriptScanner.java, tests/scannersTests/ScriptScannerTest.java: Add STRICT flag to ScriptScanner to revert to legacy handling of broken ETAGO (