File: usecases.html

package info (click to toggle)
hsc 0.916-2
links: PTS
area: main
in suites: hamm, slink
size: 2,584 kB
ctags: 2,277
sloc: ansic: 17,375; makefile: 396
file content (271 lines) | stat: -rw-r--r-- 14,426 bytes
<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 3.2//EN">
<HTML>
<HEAD>
<TITLE>hsc - Some Use-Cases</TITLE>
<LINK REV="owns" TITLE="Thomas Aglassinger" HREF="mailto:agi@giga.or.at">
<LINK REL="Next" HREF="questions.html">
<LINK REL="Copyright" HREF="copy.html">
<LINK REL="Previous" HREF="project/hscpaltrow.html">
<META name="ROBOTS" content="NOINDEX, NOFOLLOW">
</HEAD>
<BODY>
<A HREF="index.html"><IMG SRC="image/main.gif" ALT="Contents" ALIGN="middle" WIDTH="70" HEIGHT="16"></A>
<IMG SRC="image/noindex.gif" ALT="-----" ALIGN="middle" WIDTH="70" HEIGHT="16">
<A HREF="copy.html"><IMG SRC="image/copy.gif" ALT="Copyright" ALIGN="middle" WIDTH="70" HEIGHT="16"></A>
<A HREF="index.html"><IMG SRC="image/back.gif" ALT="Up" ALIGN="middle" WIDTH="70" HEIGHT="16"></A>
<A HREF="project/hscpaltrow.html"><IMG SRC="image/prev.gif" ALT="Previous" ALIGN="middle" WIDTH="70" HEIGHT="16"></A>
<A HREF="questions.html"><IMG SRC="image/next.gif" ALT="Next" ALIGN="middle" WIDTH="70" HEIGHT="16"></A>
<HR>
<H1>Some Use-Cases</H1>
This chapter gives some hints how you can do certain things you might
be interested in. It is probably only interesting for experienced
users.
<H2>Multi-lingual Documents</H2>
This will shortly outline how to create multiple versions of one
source file, for example to support more than one language. For
example, you want to have a document with a picture being the same in
all flavours, but a text depending on the language. Maybe you have a
source file <I>hugo.hsc</I> with the interesting part looking
like

<PRE>
&lt;img src="hugo.gif" alt="hugo&gt;
&lt;english&gt;This is Hugo.&lt;/english&gt;
&lt;suomi&gt;T&auml;m&auml; on Hugo.&lt;/suomi&gt;
</PRE>
and as result, you want two documents: an English one

<PRE>
&lt;img src="hugo.gif" alt="hugo&gt;
This is Hugo.
</PRE>
and a Finnish one

<PRE>
&lt;img src="hugo.gif" alt="hugo&gt;
T&auml;m&auml; on Hugo.
</PRE>
This can easily be achieved by defined two macro set, one
begin stored as <I>english.hsc</I>

<PRE>
&lt;$macro english /close&gt;&lt;$content&gt;&lt;/$english&gt;
&lt;$macro suomi /close&gt;&lt;/$suomi&gt;
</PRE>
and another one stored as <I>suomi.hsc</I>

<PRE>
&lt;$macro english /close&gt;&lt;/$english&gt;
&lt;$macro suomi /close&gt;&lt;$content&gt;&lt;/$suomi&gt;
</PRE>
<P>The first one defines two container macros, with <CODE>&lt;english&gt;</CODE>
simply every time inserting the whole content passed to it, and
<CODE>&lt;suomi&gt;</CODE> always removing any content enclosed in it.</P>
If you now invoke <KBD>hsc</KBD> with a call like

<PRE>hsc english.hsc hugo.hsc to en-hugo.html</PRE>
it will look like the first output document described above. To gain
a result looking like the second one, you only have to use

<PRE>hsc suomi.hsc hugo.hsc to fi-hugo.html</PRE>
<P>This is simply because the macros declared in
<I>suomi.hsc</I> work just the other way round like those in
<I>english.hsc</I>: everything enclosed in <CODE>&lt;english&gt;</CODE> will
be ignored, and everything being part of <CODE>&lt;suomi&gt;</CODE> remains.</P>
Of course you can have mutiple occurrences of both macros, and of
course you can define similar macros for other languages.
<H2>Extended Character Encodings</H2>
<P>This version of <KBD>hsc</KBD> officially only supports Latin-1 as input
character set. The exact definition of that is a bit messy, but
basically it refers to most of those 255 characters you can input with
your Amiga-Keyboard.</P>
<P>For this character set, all functions described herein should work,
especially the CLI-option <A HREF="options.html#rplcent"><KBD>RPLCENT</KBD></A>.</P>
<P>Although Latin-1 is widely used within most decadent western
countries, it does not provide all characters some people might need.
For instance those from China and Japan, as their writing systems work
completely different.</P>
<P>As the trivial idea if Latin was to use 8 bit instead of the rotten
7 bit of ASCII (note that the ``A'' in ASCII, stands for
American), the trivial idea of popular encodings like JIS, Shift-JIS or
EUC is to use 8 to 24 bit to encode one character.</P>
<P>Now what does <KBD>hsc</KBD> say if you feed such a document to it?</P>
<P>Unless you do not specify <KBD>RPLCENT</KBD>, it should work
without much bothering about it. However, you will need a w3-browser
that also can display these encodings, and some fiddling with
<CODE>&lt;META&gt;</CODE> and related tags.</P>
<P>If you think you are funny and enable <KBD>RPLCENT</KBD>, <KBD>hsc</KBD>
will still not mind your input. But with great pleasure it will cut
all your nice multi-byte characters into decadent western 8-bit
``cripplets'' (note the pun). And your browser will display loads
of funny western characters - but not a single funny Japanese one.</P>
<P>Recently an old western approach to these encodings problems has
gained popularity: Unicode - that's the name of the beast - was
created as some waste product of the Taligent project around 1998 or
so, as far as I recall.</P>
<P>Initially created as an unpopular gadget not supported by anything,
it is now in everybody's mouth, because Java, the language-of-hype,
several MS-DOS based operating systems and now - finally - the rotten
hypertext language-of-hype support it. At least to some limited
extent. (Technical note: Usually you only read of UCS-2 instead of
UCS-4 in all those specifications, and maybe some blurred proposals to
use UTF-16 later.)</P>
<P>As <KBD>hsc</KBD> is written in the rotten C-language (an American product,
by the way), it can not cope with zero-bytes in its input data, and
therefore is unable to read data encoded in UCS-4, UTF-16 or (w&uuml;rg,
kotz, reiha) UCS-2; it simply will stop after the first zero in the
input.</P>
<P>Because the rotten C-language is so widely used, there are some
zero-byte work-around formats for Unicode, most remarkably UTF-8 and
UTF-7. These work together with <KBD>hsc</KBD>, although with the same
limitations you have to care for when using the eastern encodings
mentioned earlier.</P>
<P>Note that it needs at least five encodings to make Unicode work
with most software - again in alphabetical order: UCS-2, UCS-4,
UTF-16, UTF-7 and UTF-8. I wonder what the "Uni" stands for...</P>
Anyway, as conclusion: you can use several extended character sets,
but you must not enable <KBD>RPLCENT</KBD>.
<H2>Html 4.0</H2>
<P>Recently, html-4.0 was released, and it sucks surprisingly less (as
far as "sucks less" is applicable at all to html). Of course there
currently is no browser capable of displaying all these things, but
nevertheless you can use <KBD>hsc</KBD> to author for it - with some
limitations. This will shortly outline how.</P>
<P>As already mentioned, html now supports those extended character
encodings. See above how to deal with input files using such an
encoding, and which to avoid.</P>
<P>If your system does not allow you to input funny characters (for
instance one can easily spend ATS 500.000 on a Workstation just for
being absolutely unable to enter a simple ``&auml;''), you can use
numeric entities, both in their decimal or hexadecimal representation:
for example, to insert a Greek Alpha, you can use
<CODE>&amp;#913</CODE> or <CODE>&amp;#x391</CODE>, <KBD>hsc</KBD> will accept
both. However, you still can not define entities beyond 8-bit range
using <A HREF="features/prefs.html#defent"><CODE>&lt;$defent&gt;</CODE></A>.</P>
<P>Some highlights are that the <CODE>ALT</CODE> attribute of <CODE>&lt;IMG&gt;</CODE>
is no required and that there are now loads of ``URIs'' instead of
``URLs'' around. Nothing new for old <KBD>hsc</KBD>-users... he he he.</P>
<P>Another interesting thing is that the DTD now contains some
meta-information that was not part of earlier DTDs so it maybe can
make sense to use the DTD as a base for <I>hsc.prefs</I>.</P>
<P>Therefor, you can download some shabby ARexx-scripts from the
support-w3-page for <KBD>hsc</KBD> that try to do exactly that task. The
problem is that they are extremely lousy and the sgml-inventors might
suggest the death penalty for my pseudo-sgml-parser, a pathetic
``masterpiece'' of reverse engineering. That's why you can not
find it on Aminet or somewhere else.</P>
<P>But most likely you will not need this at all as I already included
the output of these scripts in the main archive. Look for
<I>hsc-html-40.prefs</I> and use it instead of <I>hsc.prefs</I>.
Remember to create a backup of your old <I>hsc.prefs</I> before.</P>
Viewing it, you will stumble across a currently nowhere documented tag
called <CODE>&lt;$varlist&gt;</CODE>. This is introduced to make it easier to
refer to groups of often used attributes with one name. For example,
you can use

<PRE>&lt;$varlist coreattrs id:id class:string style:string title:string&gt;</PRE>
to daclare such a group with the name <CODE>coreattrs</CODE>,
consisting of the attributes <CODE>id</CODE>, <CODE>style</CODE> and
<CODE>title</CODE>. Later on, in a declaration of some tag, you can
write this name enclosed into square brackets, instead of listening
all these attributes , like

<PRE>&lt;$deftag A .. [coreattrs] ..&gt;</PRE>
instead of

<PRE>&lt;$deftag A .. id:id class:string..&gt;</PRE>
Although this also works within macro declarations, it is officially
undocumented and therefor discouraged to be used outside <I>hsc.prefs</I>.
<P>The main reason for these scripts: They should point out how this
principle could work. The current implementation is near to useless,
but maybe someone writes a reasonable version. Definitely not me, as
html-1.0 plus tables is all I ever will need. It's up to you, all you
self-satisfied, passive and degenerated users.</P>
<H2>Creating A Postscript Version</H2>
<P>As you can now optionally read this manual in a Postscript version,
there might be some interest how it was done.</P>
<P>The rudimentarily bearable application used for conversion is (very
originally) called <KBD>html2ps</KBD> and can be obtained from <A
HREF="http://www.tdb.uu.se/~jan/html2ps.html">http://www.tdb.uu.se/~jan/html2ps.html</A>. As
common with such tools, "it started out as a small hack" and "what
really needs to be done is a complete rewriting of the code", but "it
is quite unlikely that this [...] will take place". The usual standard
disclaimer of every public Perl-script. All quotes taken from the
manual to <KBD>html2ps</KBD>.</P>
<P>Basically the html- and the Postscript-version contain the same
words. However, there are still some differences, for example the
printed version does not need the toolbar for navigation provided at
the top of every html-page.</P>
<P>Therefore, I wrote two macros, <CODE>&lt;html-only&gt;</CODE> and
<CODE>&lt;postscript-only&gt;</CODE>. The principle works exactly like the one
described for <CODE>&lt;english&gt;</CODE> and <CODE>&lt;suomi&gt;</CODE> earlier in this
chapter, and you can find them in <A HREF="../docs-source/inc/html.hsc"><I>docs-source/inc/html.hsc</I></A> and <A HREF="../docs-source/inc/ps.hsc"><I>docs-source/inc/ps.hsc</I></A>.</P>
<P>However, there is a small difference to the multi-lingual examples,
as I do not really want to create two versions all the time. Instead,
I prefer to create either create a fully hypertext featured version or
a crippled Postscript-prepared html-document in the same location.</P>
<P>You can inspect <A HREF="../docs-source/Makefile"><I>docs-source/Makefile</I></A> how this
is done: if <KBD>make</KBD> is invoked without any special options, the
hypertext version is created. But if you instead use <KBD>make
PS=1</KBD> and therefor define a symbol named <CODE>PS</CODE>, the
pattern rule responsible for creating the html-documents acts
differently and produces a reduced, Postscript-prepared document
without toolbar.</P>
Basically, the rule looks like this:

<PRE>
$(DESTDIR)%.html : %.hsc
ifdef PS
        @$(HSC) inc/ps.hsc   $(HSCFLAGS) $&lt;
else
        @$(HSC) inc/html.hsc $(HSCFLAGS) $&lt;
endif
</PRE>
<P>Needless to say that the conditional in the <I>Makefile</I> does not
work with every <KBD>make</KBD> - I used <KBD>GNUmake</KBD> for that, your
<KBD>make</KBD>-tool maybe has a slightly different syntax.</P>
<P>For my convenience, there are two rules called <CODE>rebuild</CODE>
and <CODE>rebuild_ps</CODE> with their meanings being obvious: they
rebuild the whole manual in the desired flavour.</P>
<P>So after a successful <CODE>make rebuild_ps</CODE>, everything only
waits for <KBD>html2ps</KBD>. Maybe you want to have a look at the
<A HREF="../docs-source/html2ps.config"><I>docs-source/html2ps.config</I></A> used, although it is
strait forward and does not contain anything special. This should not
need any further comments, as there is a quite useful manual supplied
with it.</P>
<P>However, making <KBD>html2ps</KBD> work with an Amiga deserves some
remarks. As you might already have guessed, you will need the
Perl-archives of GG/ADE - no comments on that, everybody interested
should know what and where GG is.</P>
I suppose you can try the full Unix-alike approach with <KBD>hsc</KBD> compiled
for AmigaOS/ixemul and GG more or less taking over your machine, and
therefor directly invoke <KBD>perl</KBD>. This will require a rule
like

<PRE>
ps :
        html2ps -W l -f html2ps.config -o ../../hsc.ps ../docs/index.html
</PRE>
<P>As I am a dedicated hater of this, I used the AmigaOS-binary, a
SAS-compiled <KBD>GNUmake</KBD> and the standard CLI. A usually
quite successful way to make such things work is with the help of
<KBD>ksh</KBD>, which, for your confusion, is in a archive at GG
called something like <I>pdksh-xxx.tgz</I> (for ``Public
Domain ksh''). Invoking <KBD>ksh</KBD> with no arguments will
start a whole shell-session (w&uuml;rg!), but you can use the switch
<KBD>-c</KBD> to pass a single command to be executed. After that,
<KBD>ksh</KBD> will automatically exit, and you are back in your
cosy CLI, just as if nothing evil has had happened seconds before.</P>
<P>So finally the rule to convert all those html-files into one
huge Postscrip file on my machine is:

<PRE>
ps :
        ksh -c "perl /bin/html2ps -W l -f html2ps.config -o ../../hsc.ps ../docs/index.html"
</PRE>
<P>Note that <KBD>html2ps</KBD> is smart enough to follow those
(normally invisible) <CODE>&lt;LINK REL="next" ..&gt;</CODE> tags being part of
the html-documents, so only the first file is provided as argument,
and it will automatically convert the other ones.</P>.
<P>Well, it least you see it can be done.</P>
</BODY></HTML>