File: uncommon-rss.rst

package info (click to toggle)
feedparser 6.0.12-1
  • links: PTS, VCS
  • area: main
  • in suites: forky, sid
  • size: 10,540 kB
  • sloc: xml: 11,459; python: 4,575; makefile: 7
file content (93 lines) | stat: -rw-r--r-- 2,979 bytes parent folder | download | duplicates (7)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
Uncommon :abbr:`RSS (Rich Site Summary)` Elements
=================================================

These elements are less common, but are useful for niche applications and may
be present in any :abbr:`RSS (Rich Site Summary)` feed.

An :abbr:`RSS (Rich Site Summary)` feed can specify a small image which some
aggregators display as a logo.


Accessing feed image
--------------------

::

    >>> import feedparser
    >>> d = feedparser.parse('http://feedparser.org/docs/examples/rss20.xml')
    >>> d.feed.image
    {'title': u'Example banner',
    'href': u'http://example.org/banner.png',
    'width': 80,
    'height': 15,
    'link': u'http://example.org/'}

Feeds and entries can be assigned to multiple categories, and in some versions
of :abbr:`RSS (Rich Site Summary)`, categories can be associated with a
"domain".  Both are free-form strings.  For historical reasons,
:program:`Universal Feed Parser` makes multiple categories available as a list
of tuples, rather than a list of dictionaries.


Accessing multiple categories
-----------------------------

::

    >>> import feedparser
    >>> d = feedparser.parse('http://feedparser.org/docs/examples/rss20.xml')
    >>> d.feed.categories
    [(u'Syndic8', u'1024'),
    (u'dmoz', 'Top/Society/People/Personal_Homepages/P/')]

Each item in an :abbr:`RSS (Rich Site Summary)` feed can have an "enclosure", a
delightful misnomer that is simply a link to an external file (usually a music
or video file, but any type of file can be "enclosed").  Once rare, this
element has recently gained popularity due to the rise of
`podcasting <http://en.wikipedia.org/wiki/Podcasting>`_.  Some clients (such
as Apple's :program:`iTunes`) may automatically download enclosures; others
(such as the web-based Bloglines) may simply render each enclosure as a link.

The :abbr:`RSS (Rich Site Summary)` specification states that there can be at
most one enclosure per item.  However, Atom entries may contain more than one
enclosure per entry, so :program:`Universal Feed Parser` captures all of them
and makes them available as a list.


Accessing enclosures
--------------------

::

    >>> import feedparser
    >>> d = feedparser.parse('http://feedparser.org/docs/examples/rss20.xml')
    >>> e = d.entries[0]
    >>> len(e.enclosures)
    1
    >>> e.enclosures[0]
    {'type': u'audio/mpeg',
    'length': u'1069871',
    'href': u'http://example.org/audio/demo.mp3'}


Accessing feed cloud
--------------------

No one is quite sure what a cloud is.

::

    >>> import feedparser
    >>> d = feedparser.parse('http://feedparser.org/docs/examples/rss20.xml')
    >>> d.feed.cloud
    {'domain': u'rpc.example.com',
    'port': u'80',
    'path': u'/RPC2', 
    'registerprocedure': u'pingMe',
    'protocol': u'soap'}

.. note::

    For more examples of accessing :abbr:`RSS (Rich Site Summary)` elements,
    see the annotated examples: :ref:`annotated.rss10`, :ref:`annotated.rss20`,
    and :ref:`annotated.rss20dc`.