File: example.cfg

package info (click to toggle)
w3mir 1.0pre4-2
  • links: PTS
  • area: main
  • in suites: hamm
  • size: 312 kB
  • ctags: 47
  • sloc: perl: 1,260; makefile: 37
file content (115 lines) | stat: -rw-r--r-- 3,849 bytes parent folder | download | duplicates (2)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
# This is a example w3mir config file.  The file and options are fully
# documented in the w3mir man page.  This file might not be completely
# updated, but the man page should be.
#
# Comment lines start with '#'.  Comments are not allowed at end of
# lines.  The keywords are not casesensitive, but some settings are,
# notably fetch/ignore/apply settings, as well as the urls given.
# - janl
#
# Available options: recurse, no-date-check, only-nonexistent, list-urls,
# lowercase.  It is not default to recurse.  Options must be the
# first directive to appear in the config file.
Options: recurse
# This directive was previously named 'Fetch-options'
#
# Start here:
URL: http://www.ifi.uio.no/~janl/
#
#
# How much noise to make, one of quiet, brief or chatty.  Default is brief
#Verbosity: brief
#
# Initial referer.  Some servers are very picky....
#Initial-Referer: http://foo.bar/
#
# Headers to disable for anonymitys sake: 
#Disable-headers: referer, user
#
# Nicenes, robotrules can be true or false
#Robot-Rules: true
# Default pause between http connections is now 0 seconds 
# Pause: 0
# Pause between retries, in seconds.  If w3mir runs as a once-a-day thing
# this can and should be high.  I suggest at least one hour (3600 seconds).
#Retry-Pause: 600
# How many times a failed retrival is retried.
#Retries: 3
#
# URL selection.  Order is significant, first Fetch/Ignore rule overrides
# later rules.  Supply as many as you want.  If not matched by ignore rules
# a doc is gotten.  
#Fetch: *.html
#
# Ignore urls we can't copy anyway
Ignore: *.cgi
Ignore: *-cgi
Ignore: *.map
#
# RE variants are full perl REs.  Mix freely with the others.  These
# must be legal perl match res, including the starting m and any
# options you want, like 'i'.
#
#Fetch-RE: m/jpeg/
#Ignore-RE: m/\.gif$/
#
# URL manipulation.  _All_ rules are applied in order.  If lowercasing
# options is given that is done first.  This must be a legal perl
# substitution RE.  Including the starting s and any options you want,
# like 'g' and 'i'.
#
# This changes the name of all .gif files to .fif
#Apply: s/\.gif$/.fif/
#
# HTTP Proxy server:
#HTTP-Proxy: www-cache.uio.no:3128
#
# HTTP Proxy options: 
# - 'revalidate': Force proxy to revalidate cached documents.
# - 'refresh': Force proxy to get cached documents from server _again_.
# - 'no-store': Use if the contents of the documents is sensitive.
# - 'no-pragma': Force use of HTTP/1.1 'Cache-Control:' header.
# If only 'refresh' is specified the default is to use the 'Pragma: no-cache'
# HTTP/1.0 header, this can be overruled with the 'no-pragma' option.
# You can be fairly sure any proxy will understand 'refresh', ONLY HTTP/1.1
# proxies will understand 'revalidate' and 'no-store'.
#proxy-options: refresh
#
# HTTP Proxy authentication.  The mechanism  is simpleminded, we either need
# to authenticate ourselves with the proxy or not, and there can only be
# one user/password tuple, domain/realm information is ignored...
# HTTP-Proxy-user: foo
# HTTP-Proxy-passwd: bar
#
# What to do with files, one of save, stdout or forget.
#File-disposition: save
#
# Debug level
#Debug: 0|1|2
#
# Umask, if not the default
#Umask: 077
#
# HTML document editing options
#Header: Mirrored from http://www.ifi.uio.no/~janl/ts/
#Remove-nomirror: true
#
# Authentication:
#Auth-domain: server:port/realm
#Auth-user: me
#Auth-passwd: my_password
#
# You can use '*' in place of server:port and/or realm:
#Auth-domain: */*
#Auth-user: otherme
#Auth-passwd: otherpassword
#

# 'Fixup' is done by the external w3mfix program.  If you want a
# mirror that's browseable in a filesystem (as opposed to via a http
# server) just put this in:
Fixup: run

# Related is this, what is the name of the file the http server serves
# when a directory URL is requested?  w3mir default is index.html.
#Index-name: welcome.html