1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49
|
<!DOCTYPE HTML PUBLIC "-//IETF//DTD HTML 2.0//EN">
<!--Converted with LaTeX2HTML 96.1-h (September 30, 1996) by Nikos Drakos (nikos@cbl.leeds.ac.uk), CBLU, University of Leeds -->
<HTML>
<HEAD>
<TITLE>Choosing a Faster BLAS Library</TITLE>
<META NAME="description" CONTENT="Choosing a Faster BLAS Library">
<META NAME="keywords" CONTENT="slug">
<META NAME="resource-type" CONTENT="document">
<META NAME="distribution" CONTENT="global">
<LINK REL=STYLESHEET HREF="slug.css">
</HEAD>
<BODY LANG="EN" >
<A NAME="tex2html3822" HREF="node130.html"><IMG WIDTH=37 HEIGHT=24 ALIGN=BOTTOM ALT="next" SRC="http://www.netlib.org/utk/icons/next_motif.gif"></A> <A NAME="tex2html3820" HREF="node127.html"><IMG WIDTH=26 HEIGHT=24 ALIGN=BOTTOM ALT="up" SRC="http://www.netlib.org/utk/icons/up_motif.gif"></A> <A NAME="tex2html3814" HREF="node128.html"><IMG WIDTH=63 HEIGHT=24 ALIGN=BOTTOM ALT="previous" SRC="http://www.netlib.org/utk/icons/previous_motif.gif"></A> <A NAME="tex2html3824" HREF="node1.html"><IMG WIDTH=65 HEIGHT=24 ALIGN=BOTTOM ALT="contents" SRC="http://www.netlib.org/utk/icons/contents_motif.gif"></A> <A NAME="tex2html3825" HREF="node190.html"><IMG WIDTH=43 HEIGHT=24 ALIGN=BOTTOM ALT="index" SRC="http://www.netlib.org/utk/icons/index_motif.gif"></A> <BR>
<B> Next:</B> <A NAME="tex2html3823" HREF="node130.html">Tuning the Distribution Parameters </A>
<B>Up:</B> <A NAME="tex2html3821" HREF="node127.html">Performance Improvement</A>
<B> Previous:</B> <A NAME="tex2html3815" HREF="node128.html">Choosing a Faster BLACS </A>
<BR> <P>
<H2><A NAME="SECTION04542000000000000000">Choosing a Faster BLAS Library</A></H2>
<P>
Highly efficient machine-specific implementations of the BLAS are
available for many modern high-performance computers.
Users who cannot obtain an efficient BLAS for
your architecture may be able to
create one from by using a set of BLAS that
requires only an efficient implementation of the matrix-matrix multiply
BLAS routine xGEMM [<A HREF="node189.html#dayde94a">35</A>, <A HREF="node189.html#kagstrom95b">90</A>], combined
with an automatically generated machine-specific and
efficient implementation
of xGEMM [<A HREF="node189.html#lawn111">16</A>].
<P>
Users who are
using one of the computers
listed in this chapter should refer to Tables <A HREF="node113.html#tabnode">5.2</A>
and <A HREF="node113.html#tabnode2">5.3</A>
to see which library we used for timing.
Otherwise,
the computer vendor may be able to provide information
about optimized BLAS for a specific computer.
<P>
A reference Fortran 77 implementation of the BLAS is available
from the <EM>blas</EM> directory on <EM>netlib</EM>.
<BLOCKQUOTE> <TT>http://www.netlib.org/blas/blas.shar</TT>
</BLOCKQUOTE><BR> <HR>
<P><ADDRESS>
<I>Susan Blackford <BR>
Tue May 13 09:21:01 EDT 1997</I>
</ADDRESS>
</BODY>
</HTML>
|