File: README.rst

package info (click to toggle)
python-publicsuffix 1.1.0-2
  • links: PTS, VCS
  • area: main
  • in suites: buster, stretch
  • size: 352 kB
  • ctags: 68
  • sloc: python: 241; makefile: 6
file content (87 lines) | stat: -rw-r--r-- 2,726 bytes parent folder | download
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
Public Suffix List module for Python
====================================

This module allows you to get the public suffix of a domain name using the
Public Suffix List from http://publicsuffix.org

A public suffix is one under which Internet users can directly register
names. Some examples of public suffixes are .com, .co.uk and pvt.k12.wy.us.
Accurately knowing the public suffix of a domain is useful when handling
web browser cookies, highlighting the most important part of a domain name
in a user interface or sorting URLs by web site.

It's up the user to provide this module with an appropriate version of the
Public Suffix List. A convenience function is provided that downloads the most
recent version available on http://publicsuffix.org


Module content
--------------

The `fetch()` function downloads the most recent version of the list and
returns a file object. You should cache the data as appropriate for your
application::

    >>> from publicsuffix import fetch
    >>> psl_file = fetch()

Alternatively, if you have the list already downloaded by other means::

    >>> import codecs
    >>> psl_file = codecs.open('publicsuffix/public_suffix_list.dat', encoding='utf8')

The `PublicSuffixList` class parses the Public Suffix List and allows queries
for individual domain names::

    >>> from publicsuffix import PublicSuffixList
    >>> psl = PublicSuffixList(psl_file)
    >>> psl.get_public_suffix("www.example.com")
    'example.com'

Please note that the ``host`` part of an URL can contain strings that are
not plain DNS domain names (IP addresses, Punycode-encoded names, name in
combination with a port number or an username, etc.). It is up to the
caller to ensure only domain names are passed to the get_public_suffix()
method.



Installation
------------

To install from the source distribution and run unit tests, use these
commands::

    $ python setup.py install
    $ python setup.py test


Source
------

You can get a local copy of the development repository with::

    git clone http://www.tablix.org/~avian/git/publicsuffix.git


Notes on backwards compatibility
--------------------------------

This module ships with an out-dated copy of the Public Suffix List for
compatiblity with older versions. Use of this list is deprecated since it
encouraged applications that never updated the Public Suffix List data. The
built-in list will be removed in a future version.

If you are using the `PublicSuffixList` constructor without any arguments,
please use the `fetch()` function and implement an appropriate cache, as shown
above.


Support
-------

Questions, bug reports, useful code bits, and suggestions should be sent to
tomaz.solc@tablix.org

..
    vim: set filetype=rst: