1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 190 191 192 193 194 195 196 197 198 199 200 201 202 203 204 205 206 207 208 209 210 211 212 213 214 215 216 217 218 219 220 221 222 223 224 225 226 227 228 229 230 231 232 233 234 235 236 237 238 239 240 241 242 243 244 245 246 247 248 249 250 251 252 253 254 255 256 257 258 259 260 261 262 263 264 265 266 267 268 269 270 271 272 273 274 275 276 277 278 279 280 281 282 283 284 285 286 287 288 289 290 291 292 293 294 295 296 297 298 299 300 301 302 303 304 305 306 307 308 309 310 311 312 313 314
|
<!-- ettext.default -->
<p><!DOCTYPE html PUBLIC "-//W3C//DTD HTML 3.2//EN"> <html> <head> <title></title></head>
<body bgcolor="#ffffff" text="#000000" link="#3300cc" vlink= "#660066">
</p>
<h1>1. CONTRIBUTING</h1>
<p>There's a mailing list for discussion of sitescooper. To join, send
a mail to <sitescooper-request /at/ netnoteinc.com> with the one word
subscribe in the message body to join. If you're already on the list,
send a mail to <sitescooper-request /at/ netnoteinc.com> with the word
unsubscribe in the message body to unsubscribe. Note: the mail addresses
above are "spam-protected", so you need to change the " /at/ " parts
to an @ sign to send a mail to them!
</p>
<p>If you have a site you think others will like, mail the .site file to
the list, and I'll stick it in the distribution -- and list your name
in the CREDITS section. Same goes for bug patches!
</p>
<h1>2. BUGS</h1>
<p>If you find one, send a bug report to the list (or myself) and I'll
try to get around to fixing it. Could take a while though, as I don't
get paid for this stuff. BTW I really like bugfix patches if you feel
like submitting one after finding a bug ;)
</p>
<h1>3. COPYRIGHT AND CREDITS</h1>
<p>Some of the post-processing and HTML cleanup code include ideas and code
shamelessly stolen from <tt><a
href="http://pilot.screwdriver.net/">http://pilot.screwdriver.net/</a></tt> ,
Christopher Heschong's <chris at screwdriver.net> webpage-to-pilot
conversion tool. </p>
<p>
Included in the distribution is a copy of Algorithm::Diff, an implementation of
the Longest Common Subsequence algorithm, Copyright 1998, 1999 M-J. Dominus
(mjd-perl-diff /at/ plover.com).
</p><p>
Also Robb Canfield (robbc /at/ canfield.com) has kindly provided Table.pm, "a
general purpose HTML table converter that tries, usually successfully, to
convert wide tables to long lists. In general it copies the table headers and
rotates them down for each row." It's used when you set "TableRender: list".
</p><p>
James Brown (jbrown /at/ burgoyne.com) has also contributed NewsHound.pm, which
"adds what I call story profiles to sitescooper. Basically you tell
sitescooper what sort of stories you are interested in by describing them in
one or more profiles. Then the system only scoops stories that interest you.
Obviously this works better on either 2 or 3 level sites where stories are
encapsulated in a single file. You can also disable the profiles for a
particular site (for example, a headlines page where you want everything)."
It's used when you use the <B>-grep</b> command-line argument.
</p><p>
Both are free software; you can redistribute it and/or modify it under the same
terms as Perl itself. If you've downloaded the "full" version of sitescooper,
also included under the Artistic license are:
HTML-Parser 2.23, by Gisle Aas; 1995-1999 Gisle Aas. All rights reserved.
<br>
Libwww-perl 5.45, 1995-1999 Gisle Aas. All rights reserved, and 1995 Martijn Koster. All rights reserved.
<br>
MIME-Base64 2.11, Copyright 1995-1999 Gisle Aas <gisle /at/ aas.no>.
<br>
URI 1.04, Copyright 1998-1999 Gisle Aas, Copyright 1998 Graham Barr.
</p>
<p>
These are included to ease the task of installation.
</p>
<p>Here's a list of people who've contributed to sitescooper, either with
.site files, patches, or suggested fixes and functionality:
</p>
<blockquote>
Carsten Clasohm, <cc /at/ clasohm.com>: fix for diffing sites with newlines
in the href tags, regional_germany sites.
</blockquote>
<blockquote>
michael d. ivey <ivey /at/ gweezlebur.com>: packaging sitescooper
as a .deb, and general Debian compliance -- thanks!
</blockquote>
<blockquote>Stefan Schwingeler <stefan /at/ schwingeler.de>: fix for ContentsSkipURL,
regional_germany sites. Stefan and Carsten are responsible, between
them, for all the sites in the regional_germany category -- thanks
guys!
</blockquote>
<blockquote>Pierre-Yves Letournel <e-py.letournel /at/ wanadoo.fr>: regional_francais:
afp.site, le_monde.site, 01_informatique.site, lmi_hebdo.site,
lmi_quotidien.site.
</blockquote>
<blockquote>Jacques Turbé <jturbe /at/ cybercable.fr>: regional_francais: lemondecomplet.site
nouvelobs.site libe_portrait_du_jour.site libe_rebonds.site libe_q.site
journaldunet_dossiers.site echos_infos.site, and journaldunet.site.
Jacques and Pierre-Yves have, between them, provided all the sites
in regional_francais, which is great!
</blockquote>
<blockquote>Jason Simpson <jason /at/ xio.com>: contributed seattletimes.site
</blockquote>
<blockquote>Joe Pfeiffer <pfeiffer /at/ cs.nmsu.edu>: HTML rendering fixes, lots
of sites
</blockquote>
<blockquote>Mike Miller <mmiller /at/ mediageneral.com>: several sites
</blockquote>
<blockquote>dLux <dlux /at/ dlux.hu>: sites for Debian Weekly News, Freshmeat,
Hirnet, Linux.Hu, Palmcentral, updated Linux Today
</blockquote>
<blockquote>Andrew Fletcher <fletch /at/ computer.org>: MacOS support
</blockquote>
<blockquote>spacehog /at/ knowfear.knowfear.net>: yahoo_top_stories.site
</blockquote>
<blockquote>Jason C. Axley <jason /at/ axley.net>: installation instructions
update for RedHat 6.0, and SRPM for the URI module.
</blockquote>
<blockquote>Kennis Koldewyn <kennis.koldewyn /at/ wcom.com>: NY Times sites.
</blockquote>
<blockquote>Michael Lapsley <mlapsley /at/ ndirect.co.uk>: fixed bug with "-refresh
-fromcache".
</blockquote>
<blockquote>Jason Yanowitz <yanowitz /at/ poboxes.com>: site file for The Guardian.
</blockquote>
<blockquote>
Kevin Olson <kevolson /at/ visi.com>: fixed bug with RichReader
command-line.
</blockquote>
<blockquote>
Vince <reverso /at/ club-internet.fr>: contributed le_temps.site.
</blockquote>
<blockquote>
Dave Collins: <Dave.Collins /at/ tiuk.ti.com>: fix for (no text to write)
when text started with a quote char.
</blockquote>
<blockquote>
Albert K T Hui <avatar /at/ deva.net>: lots of regional_hk site files,
and fixed to allow more 8-bit text; also HTML abuse by Sing Tao Daily
worked around.
</blockquote>
<blockquote>
Alastair Rankine <arankine /at/ lucent.com>: fairfax_it.site
</blockquote>
<blockquote>
Kevin L. Dupree <kdupree /at/ flash.net>: image-only site support.
</blockquote>
<blockquote>
Andy Rabagliati <andyr /at/ wizzy.com>: csmonitor.site and KPilot support.
</blockquote>
<blockquote>
Memeteau, Michael <Michael.Memeteau /at/ autoeuropa.pt>: site files.
</blockquote>
<blockquote>
Derek Glidden <dglidden /at/ illusionary.com>:
fixed lots of delinquent site files, added science_daily.site, spaceref.site.
</blockquote>
<blockquote>
Justin Henry <jhenry /at/ fjicl.com>:
A fine selection of sites: updated salon.site; gist_tv.site;
cats_cradle.site; clark_howard.site; morbid_fact_du_jour.site;
news_observer.site; ny_times_handheld.site; roger_ebert.site;
usa_today.site; weather24.site, wral_tv.site, and movietickets.site.
</blockquote>
<blockquote>
Sergi Puso Gallart <sergi /at/ iAgora.net>:
elmundo_* and marca_* sites, creating the new regional_spain category.
</blockquote>
<blockquote>
Lim Swee Tat <st_lim /at/ 3ui.com>:
samba_traffic.site, wine_traffic.site, techweb.site added;
webmonkey.site, javaworld.site fixed; AnywhereYouGo sites
contributed.
</blockquote>
<blockquote>
Marko Bozikovic <redbyron /at/ fly.srk.fer.hr >:
All of regional_croatia; a lot of comics, and several science sites.
</blockquote>
<blockquote>
Thean Yoon Fui <yoonfui /at/ bigfoot.com>:
Lots of comics sites, and updates to the visorcentral site.
</blockquote>
<blockquote>
Peter Marschall <peter.marschall /at/ mayn.de>:
updates to de_sz, de_heise, the_register and de_zeit sites;
pointed out that .pdb was correct extn for iSilo output;
support for multiple site choices files and FHS conformance.
</blockquote>
<blockquote>
David Czerwinski:
Chicago Tribune sites
</blockquote>
<blockquote>
Wari Wahab:
concept and implementation of the index page for HTML
and M-HTML output
</blockquote>
<blockquote>
David A. Desrosiers <hacker /at/ gnu-designs.com>
Lots of new ''Palm version'' site file URLs
</blockquote>
<blockquote>
Robert Edmonds <stu /at/ brainfood.com>
Several ''humor'' sites: BOFH, ditherati, pigdog.site.
</blockquote>
<!-- start of nav links --><hr>
<p align=right>
<nobr> [
<a href=index.html>README</a> ]
<br>
[
<a href=installation.html>Installing</a> ]|[
<a href=unix_install.html>on UNIX</a> ]|[
<a href=windows_install.html>on Windows</a> ]|[
<a href=mac_install.html>on a Mac</a> ]
<br>
[
<a href=running.html>Running</a> ]|[
<a href=sitescooper.html>Command-line Arguments Reference</a> ]
<br>
[
<a href=writing_site.html>Writing a Site File</a> ]|[
<a href=site_params.html>Site File Parameters Reference</a> ]
<br>
[
<a href=rss-to-site.html>The rss-to-site Conversion Tool</a> ]|[
<a href=subs-to-site.html>The subs-to-site Conversion Tool</a> ]
<br>
[
<a href=contributing.html>Contributing</a> ]|[
<a href=gpl.html>GPL</a> ]|[
<a href=http://sitescooper.org/>Home Page</a> ]
</nobr>
</p>
<!-- end of nav links --> </body></html>
|