1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36
|
<html><head><meta charset="ISO-8859-1"><title>Chapter7.DOM XML Record Model and Filter Module</title><meta name="generator" content="DocBook XSL Stylesheets Vsnapshot"><link rel="home" href="index.html" title="Zebra - User's Guide and Reference"><link rel="up" href="index.html" title="Zebra - User's Guide and Reference"><link rel="prev" href="administration-extended-services.html" title="10.Extended Services: Remote Insert, Update and Delete"><link rel="next" href="record-model-domxml-pipeline.html" title="2.DOM XML filter pipeline configuration"></head><body><link rel="stylesheet" type="text/css" href="common/style1.css"><div class="navheader"><table width="100%" summary="Navigation header"><tr><th colspan="3" align="center">Chapter7.<acronym class="acronym">DOM</acronym> <acronym class="acronym">XML</acronym> Record Model and Filter Module</th></tr><tr><td width="20%" align="left"><a accesskey="p" href="administration-extended-services.html">Prev</a></td><th width="60%" align="center"></th><td width="20%" align="right"><a accesskey="n" href="record-model-domxml-pipeline.html">Next</a></td></tr></table><hr></div><div class="chapter"><div class="titlepage"><div><div><h1 class="title"><a name="record-model-domxml"></a>Chapter7.<acronym class="acronym">DOM</acronym> <acronym class="acronym">XML</acronym> Record Model and Filter Module</h1></div></div></div><div class="toc"><p><b>Table of Contents</b></p><dl class="toc"><dt><span class="section"><a href="record-model-domxml.html#record-model-domxml-filter">1. <acronym class="acronym">DOM</acronym> Record Filter Architecture</a></span></dt><dt><span class="section"><a href="record-model-domxml-pipeline.html">2. <acronym class="acronym">DOM</acronym> <acronym class="acronym">XML</acronym> filter pipeline configuration</a></span></dt><dd><dl><dt><span class="section"><a href="record-model-domxml-pipeline.html#record-model-domxml-pipeline-input">2.1. Input pipeline</a></span></dt><dt><span class="section"><a href="record-model-domxml-pipeline.html#record-model-domxml-pipeline-extract">2.2. Extract pipeline</a></span></dt><dt><span class="section"><a href="record-model-domxml-pipeline.html#record-model-domxml-pipeline-store">2.3. Store pipeline</a></span></dt><dt><span class="section"><a href="record-model-domxml-pipeline.html#record-model-domxml-pipeline-retrieve">2.4. Retrieve pipeline</a></span></dt><dt><span class="section"><a href="record-model-domxml-pipeline.html#record-model-domxml-canonical-index">2.5. Canonical Indexing Format</a></span></dt><dd><dl><dt><span class="section"><a href="record-model-domxml-pipeline.html#record-model-domxml-canonical-index-pi">2.5.1. Processing-instruction governed indexing format</a></span></dt><dt><span class="section"><a href="record-model-domxml-pipeline.html#record-model-domxml-canonical-index-element">2.5.2. Magic element governed indexing format</a></span></dt><dt><span class="section"><a href="record-model-domxml-pipeline.html#record-model-domxml-canonical-index-semantics">2.5.3. Semantics of the indexing formats</a></span></dt></dl></dd></dl></dd><dt><span class="section"><a href="record-model-domxml-conf.html">3. <acronym class="acronym">DOM</acronym> Record Model Configuration</a></span></dt><dd><dl><dt><span class="section"><a href="record-model-domxml-conf.html#record-model-domxml-index">3.1. <acronym class="acronym">DOM</acronym> Indexing Configuration</a></span></dt><dt><span class="section"><a href="record-model-domxml-conf.html#record-model-domxml-index-marc">3.2. <acronym class="acronym">DOM</acronym> Indexing <acronym class="acronym">MARCXML</acronym></a></span></dt><dt><span class="section"><a href="record-model-domxml-conf.html#record-model-domxml-index-wizzard">3.3. <acronym class="acronym">DOM</acronym> Indexing Wizardry</a></span></dt><dt><span class="section"><a href="record-model-domxml-conf.html#record-model-domxml-debug">3.4. Debuggig <acronym class="acronym">DOM</acronym> Filter Configurations</a></span></dt></dl></dd></dl></div><p>
The record model described in this chapter applies to the fundamental,
structured <acronym class="acronym">XML</acronym>
record type <code class="literal"><acronym class="acronym">DOM</acronym></code>, introduced in
<a class="xref" href="architecture-maincomponents.html#componentmodulesdom" title="2.5.1.DOM XML Record Model and Filter Module">Section2.5.1, “<acronym class="acronym">DOM</acronym> <acronym class="acronym">XML</acronym> Record Model and Filter Module”</a>. The <acronym class="acronym">DOM</acronym> <acronym class="acronym">XML</acronym> record model
is experimental, and its inner workings might change in future
releases of the <span class="application">Zebra</span> Information Server.
</p><div class="section"><div class="titlepage"><div><div><h2 class="title" style="clear: both"><a name="record-model-domxml-filter"></a>1.<acronym class="acronym">DOM</acronym> Record Filter Architecture</h2></div></div></div><p>
The <acronym class="acronym">DOM</acronym> <acronym class="acronym">XML</acronym> filter uses a standard <acronym class="acronym">DOM</acronym> <acronym class="acronym">XML</acronym> structure as
internal data model, and can therefore parse, index, and display
any <acronym class="acronym">XML</acronym> document type. It is well suited to work on
standardized <acronym class="acronym">XML</acronym>-based formats such as Dublin Core, MODS, METS,
MARCXML, OAI-PMH, RSS, and performs equally well on any other
non-standard <acronym class="acronym">XML</acronym> format.
</p><p>
A parser for binary <acronym class="acronym">MARC</acronym> records based on the ISO2709 library
standard is provided, it transforms these to the internal
<acronym class="acronym">MARCXML</acronym> <acronym class="acronym">DOM</acronym> representation. Other binary document parsers
are planned to follow.
</p><p>
The <acronym class="acronym">DOM</acronym> filter architecture consists of four
different pipelines, each being a chain of arbitrarily many successive
<acronym class="acronym">XSLT</acronym> transformations of the internal <acronym class="acronym">DOM</acronym> <acronym class="acronym">XML</acronym>
representations of documents.
</p><div class="figure"><a name="record-model-domxml-architecture-fig"></a><p class="title"><b>Figure7.1.<acronym class="acronym">DOM</acronym> <acronym class="acronym">XML</acronym> filter architecture</b></p><div class="figure-contents"><div class="mediaobject"><img src="domfilter.png" alt="[Here there should be a diagram showing the DOM XML filter architecture, but is seems that your tool chain has not been able to include the diagram in this document.]"></div></div></div><br class="figure-break"><div class="table"><a name="record-model-domxml-architecture-table"></a><p class="title"><b>Table7.1.<acronym class="acronym">DOM</acronym> <acronym class="acronym">XML</acronym> filter pipelines overview</b></p><div class="table-contents"><table class="table" summary="DOM XML filter pipelines overview" border="1"><colgroup><col><col><col><col><col></colgroup><thead><tr><th>Name</th><th>When</th><th>Description</th><th>Input</th><th>Output</th></tr></thead><tbody><tr><td><code class="literal">input</code></td><td>first</td><td>input parsing and initial
transformations to common <acronym class="acronym">XML</acronym> format</td><td>Input raw <acronym class="acronym">XML</acronym> record buffers, <acronym class="acronym">XML</acronym> streams and
binary <acronym class="acronym">MARC</acronym> buffers</td><td>Common <acronym class="acronym">XML</acronym> <acronym class="acronym">DOM</acronym></td></tr><tr><td><code class="literal">extract</code></td><td>second</td><td>indexing term extraction
transformations</td><td>Common <acronym class="acronym">XML</acronym> <acronym class="acronym">DOM</acronym></td><td>Indexing <acronym class="acronym">XML</acronym> <acronym class="acronym">DOM</acronym></td></tr><tr><td><code class="literal">store</code></td><td>second</td><td> transformations before internal document
storage</td><td>Common <acronym class="acronym">XML</acronym> <acronym class="acronym">DOM</acronym></td><td>Storage <acronym class="acronym">XML</acronym> <acronym class="acronym">DOM</acronym></td></tr><tr><td><code class="literal">retrieve</code></td><td>third</td><td>multiple document retrieve transformations from
storage to different output
formats are possible</td><td>Storage <acronym class="acronym">XML</acronym> <acronym class="acronym">DOM</acronym></td><td>Output <acronym class="acronym">XML</acronym> syntax in requested formats</td></tr></tbody></table></div></div><br class="table-break"><p>
The <acronym class="acronym">DOM</acronym> <acronym class="acronym">XML</acronym> filter pipelines use <acronym class="acronym">XSLT</acronym> (and if supported on
your platform, even <acronym class="acronym">EXSLT</acronym>), it brings thus full <acronym class="acronym">XPATH</acronym>
support to the indexing, storage and display rules of not only
<acronym class="acronym">XML</acronym> documents, but also binary <acronym class="acronym">MARC</acronym> records.
</p></div></div><div class="navfooter"><hr><table width="100%" summary="Navigation footer"><tr><td width="40%" align="left"><a accesskey="p" href="administration-extended-services.html">Prev</a></td><td width="20%" align="center"></td><td width="40%" align="right"><a accesskey="n" href="record-model-domxml-pipeline.html">Next</a></td></tr><tr><td width="40%" align="left" valign="top">10.Extended Services: Remote Insert, Update and Delete</td><td width="20%" align="center"><a accesskey="h" href="index.html">Home</a></td><td width="40%" align="right" valign="top">2.<acronym class="acronym">DOM</acronym> <acronym class="acronym">XML</acronym> filter pipeline configuration</td></tr></table></div></body></html>
|