1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135
|
Revision history for Perl module URI::Find.
20100505 Wed May 5 18:48:44 PDT 2010
Test Fixes
* Fixed t/urifind/find.t on Windows
20100504.1039 Tue May 4 10:39:23 PDT 2010
Doc Fixes
* Forgot to mention that we ship with urifind now.
20100504 Tue May 4 10:29:52 PDT 2010
New Features
* Added a urifind program. (Darren Chamberlain)
Bug Fixes
* The final semi-colon was being strippped form URLs found in HTML
that ended with HTML entities. (Michael Peters)
Example: http://google.com/search?q=<html>
* URLs with leading dots, pluses and minuses are now found.
[rt.cpan.org 57032]
Example: stuff...http://example.com
20100211 Thu Feb 11 04:02:26 PST 2010
Bug Fixes
* Finding URIs inside brackets was pretty badly broken by
the last release. (Michael Peters)
20090319 Thu Mar 19 12:17:53 PDT 2009
Bug Fixes
* Schemeless now ignores the case of the TLD.
New Features
* Updated the list of accepted domains for finding schemeless URIs
from the latest ICANN list.
Docs
* Add LICENSE section
* Remove wildly out of date CAVEATS
* Added an example of how to get a list of all URIs.
* Updated INSTALL section to reflect new dependencies and Module::Build
installation process
* Regenerated the README file
20090316 Mon Mar 16 16:18:10 PDT 2009
New Features
* Added optional replacement function to find(). Now you
can not only replace URLs found, but also the rest of the text around
them in one fell swoop. (Mike Schilli) [rt.cpan.org 20486]
* Whitespace inside <...> is now ignored as per the suggestion of
RFC 3986 appendix C. [rt.cpan.org 20483]
Other
* Michael G Schwern is now primary maintainer again. Thanks for all your
work, Roderick!
* Repository moved to http://github.com/schwern/uri-find
* Now requires Test::More
* Verisoning scheme changed to ISO date integers
* Minimum Perl version is now 5.6.0.
0.16 Fri Jul 22 06:00:24 EDT 2005
- Oops, make the URI::Find::Schemeless->top_level_domain_re case
insensitive, as it should be and the docs claimed it was. Thanks
to Todd Eigenschink.
0.15 Tue Mar 22 07:23:17 EST 2005
- Have all functions croak if invoked with the wrong number of
arguments. Add URI::Find->badinvo.
https://rt.cpan.org/NoAuth/Bug.html?id=1845
- Mention DARREN's urifind script in the man page.
- Oops, URI::URL::strict was turned on and left on. Put it back the
way you found it. Thanks to Chris Nandor.
https://rt.cpan.org/NoAuth/Bug.html?id=11906
- Schemeless.pm:
- Find '<x>intag.com</x>'.
- Get $tldRe from a new class method, ->top_level_domain_re.
- Update top level domain list.
0.14 Sat Oct 9 08:20:04 EDT 2004
- Add copyright notice.
- Add ] to main $cruftSet, } to schemeless $cruftSet, for
[http://square.com] and {brace.com}.
- quotemeta() $cruftSet.
0.13 Mon Jul 1 10:37:54 EDT 2002
- Don't find any schemeless URIs with a plain URI::Find. Previously
it'd find ones which started with "ftp." and "www.", but it was
more prone to false positives than URI::Find::Schemeless.
- Have schemeless_to_schemed use http:// except in the specific case
in which it uses ftp://. Remove URI::Find::Schemeless's version.
0.12 Wed Mar 20 14:39:21 EST 2002
- Improve the "wrap each URI found in an HTML anchor" example.
- Release a new version so CPAN sees the maintainer change.
0.11 Thu Jul 26 14:43:49 EDT 2001
- Michael passed the module to Roderick for maintenance.
- Improve test suite.
- Tweak URI::Find::Schemeless not to find Foo.p[ml].
0.10 Mon Jul 10 20:14:08 EDT 2000
- Rearchitected the internals to allow simple subclassing
- Added URI::Find::Schemeless (thanks Roderick)
0.04 Sat Feb 26 09:05:11 GMT 2000
- Added # to the uric set of characters so HTML anchors are caught.
0.03 Tue Feb 1 16:15:22 EST 2000
- Added some heuristic discussion to the docs.
- Added some heuristics to avoid picking up perl module names
- Improved schemeless URI heuristic to avoid picking up
usenet board names.
- Handling the <URL:...> case better as suggested in RFC 2396 Apdx E
- Added ; to the cruft heuristic
0.02 Tue Feb 1 13:11:56 EST 2000
- Added heuristic to handle 'URL:http://www.foo.com'
- Added heuristic to handle trailing quotes.
0.01 Mon Jan 31 19:12:23 EST 2000
- First working version released to CPAN.
|