File: record-model-domxml.html

package info (click to toggle)
idzebra 2.2.10-1
  • links: PTS, VCS
  • area: main
  • in suites:
  • size: 10,644 kB
  • sloc: ansic: 54,389; xml: 27,054; sh: 6,211; makefile: 1,099; perl: 210; tcl: 64
file content (36 lines) | stat: -rw-r--r-- 9,951 bytes parent folder | download | duplicates (2)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
<html><head><meta charset="ISO-8859-1"><title>Chapter7.DOM XML Record Model and Filter Module</title><meta name="generator" content="DocBook XSL Stylesheets Vsnapshot"><link rel="home" href="index.html" title="Zebra - User's Guide and Reference"><link rel="up" href="index.html" title="Zebra - User's Guide and Reference"><link rel="prev" href="administration-extended-services.html" title="10.Extended Services: Remote Insert, Update and Delete"><link rel="next" href="record-model-domxml-pipeline.html" title="2.DOM XML filter pipeline configuration"></head><body><link rel="stylesheet" type="text/css" href="common/style1.css"><div class="navheader"><table width="100%" summary="Navigation header"><tr><th colspan="3" align="center">Chapter7.<acronym class="acronym">DOM</acronym> <acronym class="acronym">XML</acronym> Record Model and Filter Module</th></tr><tr><td width="20%" align="left"><a accesskey="p" href="administration-extended-services.html">Prev</a></td><th width="60%" align="center"></th><td width="20%" align="right"><a accesskey="n" href="record-model-domxml-pipeline.html">Next</a></td></tr></table><hr></div><div class="chapter"><div class="titlepage"><div><div><h1 class="title"><a name="record-model-domxml"></a>Chapter7.<acronym class="acronym">DOM</acronym> <acronym class="acronym">XML</acronym> Record Model and Filter Module</h1></div></div></div><div class="toc"><p><b>Table of Contents</b></p><dl class="toc"><dt><span class="section"><a href="record-model-domxml.html#record-model-domxml-filter">1. <acronym class="acronym">DOM</acronym> Record Filter Architecture</a></span></dt><dt><span class="section"><a href="record-model-domxml-pipeline.html">2. <acronym class="acronym">DOM</acronym> <acronym class="acronym">XML</acronym> filter pipeline configuration</a></span></dt><dd><dl><dt><span class="section"><a href="record-model-domxml-pipeline.html#record-model-domxml-pipeline-input">2.1. Input pipeline</a></span></dt><dt><span class="section"><a href="record-model-domxml-pipeline.html#record-model-domxml-pipeline-extract">2.2. Extract pipeline</a></span></dt><dt><span class="section"><a href="record-model-domxml-pipeline.html#record-model-domxml-pipeline-store">2.3. Store pipeline</a></span></dt><dt><span class="section"><a href="record-model-domxml-pipeline.html#record-model-domxml-pipeline-retrieve">2.4. Retrieve pipeline</a></span></dt><dt><span class="section"><a href="record-model-domxml-pipeline.html#record-model-domxml-canonical-index">2.5. Canonical Indexing Format</a></span></dt><dd><dl><dt><span class="section"><a href="record-model-domxml-pipeline.html#record-model-domxml-canonical-index-pi">2.5.1. Processing-instruction governed indexing format</a></span></dt><dt><span class="section"><a href="record-model-domxml-pipeline.html#record-model-domxml-canonical-index-element">2.5.2. Magic element governed indexing format</a></span></dt><dt><span class="section"><a href="record-model-domxml-pipeline.html#record-model-domxml-canonical-index-semantics">2.5.3. Semantics of the indexing formats</a></span></dt></dl></dd></dl></dd><dt><span class="section"><a href="record-model-domxml-conf.html">3. <acronym class="acronym">DOM</acronym> Record Model Configuration</a></span></dt><dd><dl><dt><span class="section"><a href="record-model-domxml-conf.html#record-model-domxml-index">3.1. <acronym class="acronym">DOM</acronym> Indexing Configuration</a></span></dt><dt><span class="section"><a href="record-model-domxml-conf.html#record-model-domxml-index-marc">3.2. <acronym class="acronym">DOM</acronym> Indexing <acronym class="acronym">MARCXML</acronym></a></span></dt><dt><span class="section"><a href="record-model-domxml-conf.html#record-model-domxml-index-wizzard">3.3. <acronym class="acronym">DOM</acronym> Indexing Wizardry</a></span></dt><dt><span class="section"><a href="record-model-domxml-conf.html#record-model-domxml-debug">3.4. Debuggig <acronym class="acronym">DOM</acronym> Filter Configurations</a></span></dt></dl></dd></dl></div><p>
   The record model described in this chapter applies to the fundamental,
   structured <acronym class="acronym">XML</acronym>
   record type <code class="literal"><acronym class="acronym">DOM</acronym></code>, introduced in
   <a class="xref" href="architecture-maincomponents.html#componentmodulesdom" title="2.5.1.DOM XML Record Model and Filter Module">Section2.5.1, &#8220;<acronym class="acronym">DOM</acronym> <acronym class="acronym">XML</acronym> Record Model and Filter Module&#8221;</a>. The <acronym class="acronym">DOM</acronym> <acronym class="acronym">XML</acronym> record model
   is experimental, and its inner workings might change in future
   releases of the <span class="application">Zebra</span> Information Server.
  </p><div class="section"><div class="titlepage"><div><div><h2 class="title" style="clear: both"><a name="record-model-domxml-filter"></a>1.<acronym class="acronym">DOM</acronym> Record Filter Architecture</h2></div></div></div><p>
    The <acronym class="acronym">DOM</acronym> <acronym class="acronym">XML</acronym> filter uses a standard <acronym class="acronym">DOM</acronym> <acronym class="acronym">XML</acronym> structure as
    internal data model, and can therefore parse, index, and display
    any <acronym class="acronym">XML</acronym> document type. It is well suited to work on
    standardized <acronym class="acronym">XML</acronym>-based formats such as Dublin Core, MODS, METS,
    MARCXML, OAI-PMH, RSS, and performs equally  well on any other
    non-standard <acronym class="acronym">XML</acronym> format.
   </p><p>
    A parser for binary <acronym class="acronym">MARC</acronym> records based on the ISO2709 library
    standard is provided, it transforms these to the internal
    <acronym class="acronym">MARCXML</acronym> <acronym class="acronym">DOM</acronym> representation. Other binary document parsers
    are planned to follow.
   </p><p>
    The <acronym class="acronym">DOM</acronym> filter architecture consists of four
    different pipelines, each being a chain of arbitrarily many successive
    <acronym class="acronym">XSLT</acronym> transformations of the internal <acronym class="acronym">DOM</acronym> <acronym class="acronym">XML</acronym>
    representations of documents.
   </p><div class="figure"><a name="record-model-domxml-architecture-fig"></a><p class="title"><b>Figure7.1.<acronym class="acronym">DOM</acronym> <acronym class="acronym">XML</acronym> filter architecture</b></p><div class="figure-contents"><div class="mediaobject"><img src="domfilter.png" alt="[Here there should be a diagram showing the DOM XML filter architecture, but is seems that your tool chain has not been able to include the diagram in this document.]"></div></div></div><br class="figure-break"><div class="table"><a name="record-model-domxml-architecture-table"></a><p class="title"><b>Table7.1.<acronym class="acronym">DOM</acronym> <acronym class="acronym">XML</acronym> filter pipelines overview</b></p><div class="table-contents"><table class="table" summary="DOM XML filter pipelines overview" border="1"><colgroup><col><col><col><col><col></colgroup><thead><tr><th>Name</th><th>When</th><th>Description</th><th>Input</th><th>Output</th></tr></thead><tbody><tr><td><code class="literal">input</code></td><td>first</td><td>input parsing and initial
	transformations to common <acronym class="acronym">XML</acronym> format</td><td>Input raw <acronym class="acronym">XML</acronym> record buffers, <acronym class="acronym">XML</acronym>  streams and
	binary <acronym class="acronym">MARC</acronym> buffers</td><td>Common <acronym class="acronym">XML</acronym> <acronym class="acronym">DOM</acronym></td></tr><tr><td><code class="literal">extract</code></td><td>second</td><td>indexing term extraction
	transformations</td><td>Common <acronym class="acronym">XML</acronym> <acronym class="acronym">DOM</acronym></td><td>Indexing <acronym class="acronym">XML</acronym> <acronym class="acronym">DOM</acronym></td></tr><tr><td><code class="literal">store</code></td><td>second</td><td> transformations before internal document
	storage</td><td>Common <acronym class="acronym">XML</acronym> <acronym class="acronym">DOM</acronym></td><td>Storage <acronym class="acronym">XML</acronym> <acronym class="acronym">DOM</acronym></td></tr><tr><td><code class="literal">retrieve</code></td><td>third</td><td>multiple document retrieve transformations from
	storage to different output
	formats are possible</td><td>Storage <acronym class="acronym">XML</acronym> <acronym class="acronym">DOM</acronym></td><td>Output <acronym class="acronym">XML</acronym> syntax in requested formats</td></tr></tbody></table></div></div><br class="table-break"><p>
    The <acronym class="acronym">DOM</acronym> <acronym class="acronym">XML</acronym> filter pipelines use <acronym class="acronym">XSLT</acronym> (and if  supported on
    your platform, even <acronym class="acronym">EXSLT</acronym>), it brings thus full <acronym class="acronym">XPATH</acronym>
    support to the indexing, storage and display rules of not only
    <acronym class="acronym">XML</acronym> documents, but also binary <acronym class="acronym">MARC</acronym> records.
   </p></div></div><div class="navfooter"><hr><table width="100%" summary="Navigation footer"><tr><td width="40%" align="left"><a accesskey="p" href="administration-extended-services.html">Prev</a></td><td width="20%" align="center"></td><td width="40%" align="right"><a accesskey="n" href="record-model-domxml-pipeline.html">Next</a></td></tr><tr><td width="40%" align="left" valign="top">10.Extended Services: Remote Insert, Update and Delete</td><td width="20%" align="center"><a accesskey="h" href="index.html">Home</a></td><td width="40%" align="right" valign="top">2.<acronym class="acronym">DOM</acronym> <acronym class="acronym">XML</acronym> filter pipeline configuration</td></tr></table></div></body></html>