File: parsero.1

package info (click to toggle)
parsero 0.0%2Bgit20140929.e5b585a-4
  • links: PTS, VCS
  • area: main
  • in suites: bullseye
  • size: 132 kB
  • sloc: python: 216; sh: 8; makefile: 6
file content (67 lines) | stat: -rw-r--r-- 1,682 bytes parent folder | download | duplicates (3)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
.\" Text automatically generated by txt2man
.TH parsero 1 "18 Aug 2020" "parsero-0.0+git20140929.e5b585a" "Audit tool for robots.txt of a site"
.SH NAME
\fBparsero \fP- Audit tool for robots.txt of a site
\fB
.SH SYNOPSIS
.nf
.fam C
\fBparsero\fP [\fB-h\fP] [\fB-u\fP \fIURL\fP] [\fB-o\fP] [\fB-sb\fP] [\fB-f\fP \fIFILE\fP]

.fam T
.fi
.fam T
.fi
.SH DESCRIPTION
Parsero is a free script written in Python which reads the Robots.txt file
of a web server through the network and looks at the Disallow entries. The
Disallow entries tell the search engines what directories or files hosted
on a web server mustn't be indexed. For example, "Disallow: /portal/login"
means that the content on www.example.com/portal/login it's not allowed to
be indexed by crawlers like Google, Bing, Yahoo\.\.\. This is the way the
administrator have to not share sensitive or private information with the
search engines.
.SH OPTIONS
.TP
.B
\fB-h\fP, \fB--help\fP
Show help message and exit.
.TP
.B
\fB-u\fP \fIURL\fP
Type the \fIURL\fP which will be analyzed.
.TP
.B
\fB-o\fP
Show only the "HTTP 200" status code.
.TP
.B
\fB-sb\fP
Search in Bing indexed Disallows.
.TP
.B
\fB-f\fP \fIFILE\fP
Scan a list of domains from a list.
.SH EXAMPLE
Common usage:
.PP
.nf
.fam C
    $ parsero -u www.example.com

.fam T
.fi
Using a list of domains from a list:
.PP
.nf
.fam C
    $ parsero -f /tmp/list-of-domains.txt

.fam T
.fi
.SH SEE ALSO
\fBlinkchecker\fP(1), \fBproxychains4\fP(1).
.SH AUTHOR
\fBparsero\fP was written by Javier Nieto <javier.nieto@behindthefirewalls.com>.
.PP
This manual page was written by Thiago Andrade Marques <andrade@debian.org> for the Debian project (but may be used by others).