File: README

package info (click to toggle)
libxrd-parser-perl 0.201-3
  • links: PTS, VCS
  • area: main
  • in suites: buster, jessie, jessie-kfreebsd, stretch
  • size: 348 kB
  • ctags: 270
  • sloc: perl: 4,041; makefile: 16
file content (241 lines) | stat: -rw-r--r-- 8,591 bytes parent folder | download | duplicates (2)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
NAME
    XRD::Parser - parse XRD and host-meta files into RDF::Trine models

SYNOPSIS
      use RDF::Query;
      use XRD::Parser;
  
      my $parser = XRD::Parser->new(undef, "http://example.com/foo.xrd");
      my $results = RDF::Query->new(
        "SELECT * WHERE {?who <http://spec.example.net/auth/1.0> ?auth.}")
        ->execute($parser->graph);
        
      while (my $result = $results->next)
      {
        print $result->{'auth'}->uri . "\n";
      }

    or maybe:

      my $data = XRD::Parser->hostmeta('gmail.com')
                              ->graph
                                ->as_hashref;

DESCRIPTION
    While XRD has a rather different history, it turns out it can mostly be
    thought of as a serialisation format for a limited subset of RDF.

    This package ignores the order of <Link> elements, as RDF is a graph
    format with no concept of statements coming in an "order". The XRD spec
    says that grokking the order of <Link> elements is only a SHOULD. That
    said, if you're concerned about the order of <Link> elements, the
    callback routines allowed by this package may be of use.

    This package aims to be roughly compatible with RDF::RDFa::Parser's
    interface.

  Constructors
    "$p = XRD::Parser->new($content, $uri, [\%options], [$store])"
        This method creates a new XRD::Parser object and returns it.

        The $content variable may contain an XML string, or a
        XML::LibXML::Document. If a string, the document is parsed using
        XML::LibXML::Parser, which may throw an exception. XRD::Parser does
        not catch the exception.

        $uri the base URI of the content; it is used to resolve any relative
        URIs found in the XRD document.

        Options [default in brackets]:

        *       default_subject - If no <Subject> element. [undef]

        *       link_prop - How to handle <Property> in <Link>? 0=skip,
                1=reify, 2=subproperty, 3=both. [0]

        *       loose_mime - Accept text/plain, text/html and
                application/octet-stream media types. [0]

        *       tdb_service - Use thing-described-by.org when possible. [0]

        $storage is an RDF::Trine::Storage object. If undef, then a new
        temporary store is created.

    "$p = XRD::Parser->new_from_url($url, [\%options], [$storage])"
        $url is a URL to fetch and parse.

        This function can also be called as "new_from_uri". Same thing.

    "$p = XRD::Parser->hostmeta($uri)"
        This method creates a new XRD::Parser object and returns it.

        The parameter may be a URI (from which the hostname will be
        extracted) or just a bare host name (e.g. "example.com"). The
        resource "/.well-known/host-meta" will then be fetched from that
        host using an appropriate HTTP Accept header, and the parser object
        returned.

  Public Methods
    "$p->uri($uri)"
        Returns the base URI of the document being parsed. This will usually
        be the same as the base URI provided to the constructor.

        Optionally it may be passed a parameter - an absolute or relative
        URI - in which case it returns the same URI which it was passed as a
        parameter, but as an absolute URI, resolved relative to the
        document's base URI.

        This seems like two unrelated functions, but if you consider the
        consequence of passing a relative URI consisting of a zero-length
        string, it in fact makes sense.

    "$p->dom"
        Returns the parsed XML::LibXML::Document.

    "$p->graph"
        This method will return an RDF::Trine::Model object with all
        statements of the full graph.

        This method will automatically call "consume" first, if it has not
        already been called.

    $p->set_callbacks(\%callbacks)
        Set callback functions for the parser to call on certain events.
        These are only necessary if you want to do something especially
        unusual.

          $p->set_callbacks({
            'pretriple_resource' => sub { ... } ,
            'pretriple_literal'  => sub { ... } ,
            'ontriple'           => undef ,
            });

        Either of the two pretriple callbacks can be set to the string
        'print' instead of a coderef. This enables built-in callbacks for
        printing Turtle to STDOUT.

        For details of the callback functions, see the section CALLBACKS.
        "set_callbacks" must be used *before* "consume". "set_callbacks"
        itself returns a reference to the parser object itself.

        *NOTE:* the behaviour of this function was changed in version 0.05.

    "$p->consume"
        This method processes the input DOM and sends the resulting triples
        to the callback functions (if any).

        It called again, does nothing.

        Returns the parser object itself.

  Utility Functions
    "$host_uri = XRD::Parser::host_uri($uri)"
        Returns a URI representing the host. These crop up often in graphs
        gleaned from host-meta files.

        $uri can be an absolute URI like 'http://example.net/foo#bar' or a
        host name like 'example.com'.

    "$uri = XRD::Parser::template_uri($relationship_uri)"
        Returns a URI representing not a normal relationship, but the
        relationship between a host and a template URI literal.

    "$hostmeta_uri = XRD::Parser::hostmeta_location($host)"
        The parameter may be a URI (from which the hostname will be
        extracted) or just a bare host name (e.g. "example.com"). The
        location for a host-meta file relevant to the host of that URI will
        be calculated.

        If called in list context, returns an 'https' URI and an 'http' URI
        as a list.

CALLBACKS
    Several callback functions are provided. These may be set using the
    "set_callbacks" function, which taskes a hashref of keys pointing to
    coderefs. The keys are named for the event to fire the callback on.

  pretriple_resource
    This is called when a triple has been found, but before preparing the
    triple for adding to the model. It is only called for triples with a
    non-literal object value.

    The parameters passed to the callback function are:

    *   A reference to the "XRD::Parser" object

    *   A reference to the "XML::LibXML::Element" being parsed

    *   Subject URI or bnode (string)

    *   Predicate URI (string)

    *   Object URI or bnode (string)

    The callback should return 1 to tell the parser to skip this triple (not
    add it to the graph); return 0 otherwise.

  pretriple_literal
    This is the equivalent of pretriple_resource, but is only called for
    triples with a literal object value.

    The parameters passed to the callback function are:

    *   A reference to the "XRD::Parser" object

    *   A reference to the "XML::LibXML::Element" being parsed

    *   Subject URI or bnode (string)

    *   Predicate URI (string)

    *   Object literal (string)

    *   Datatype URI (string or undef)

    *   Language (string or undef)

    The callback should return 1 to tell the parser to skip this triple (not
    add it to the graph); return 0 otherwise.

  ontriple
    This is called once a triple is ready to be added to the graph. (After
    the pretriple callbacks.) The parameters passed to the callback function
    are:

    *   A reference to the "XRD::Parser" object

    *   A reference to the "XML::LibXML::Element" being parsed

    *   An RDF::Trine::Statement object.

    The callback should return 1 to tell the parser to skip this triple (not
    add it to the graph); return 0 otherwise. The callback may modify the
    RDF::Trine::Statement object.

WHY RDF?
    It abstracts away the structure of the XRD file, exposing just the
    meaning of its contents. Two XRD files with the same meaning should end
    up producing more or less the same RDF data, even if they differ
    significantly at the syntactic level.

    If you care about the syntax of an XRD file, then use XML::LibXML.

SEE ALSO
    RDF::Trine, RDF::Query, RDF::RDFa::Parser.

    <http://www.perlrdf.org/>.

AUTHOR
    Toby Inkster, <tobyink@cpan.org>

COPYRIGHT AND LICENCE
    Copyright (C) 2009-2012 by Toby Inkster

    This library is free software; you can redistribute it and/or modify it
    under the same terms as Perl itself.

DISCLAIMER OF WARRANTIES
    THIS PACKAGE IS PROVIDED "AS IS" AND WITHOUT ANY EXPRESS OR IMPLIED
    WARRANTIES, INCLUDING, WITHOUT LIMITATION, THE IMPLIED WARRANTIES OF
    MERCHANTIBILITY AND FITNESS FOR A PARTICULAR PURPOSE.