File: node129.html

package info (click to toggle)
scalapack-doc 1.5-11
  • links: PTS
  • area: main
  • in suites: bullseye, buster, stretch
  • size: 10,336 kB
  • ctags: 4,931
  • sloc: makefile: 47; sh: 18
file content (49 lines) | stat: -rw-r--r-- 2,839 bytes parent folder | download | duplicates (4)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
<!DOCTYPE HTML PUBLIC "-//IETF//DTD HTML 2.0//EN">
<!--Converted with LaTeX2HTML 96.1-h (September 30, 1996) by Nikos Drakos (nikos@cbl.leeds.ac.uk), CBLU, University of Leeds -->
<HTML>
<HEAD>
<TITLE>Choosing a Faster BLAS Library</TITLE>
<META NAME="description" CONTENT="Choosing a Faster BLAS Library">
<META NAME="keywords" CONTENT="slug">
<META NAME="resource-type" CONTENT="document">
<META NAME="distribution" CONTENT="global">
<LINK REL=STYLESHEET HREF="slug.css">
</HEAD>
<BODY LANG="EN" >
 <A NAME="tex2html3822" HREF="node130.html"><IMG WIDTH=37 HEIGHT=24 ALIGN=BOTTOM ALT="next" SRC="http://www.netlib.org/utk/icons/next_motif.gif"></A> <A NAME="tex2html3820" HREF="node127.html"><IMG WIDTH=26 HEIGHT=24 ALIGN=BOTTOM ALT="up" SRC="http://www.netlib.org/utk/icons/up_motif.gif"></A> <A NAME="tex2html3814" HREF="node128.html"><IMG WIDTH=63 HEIGHT=24 ALIGN=BOTTOM ALT="previous" SRC="http://www.netlib.org/utk/icons/previous_motif.gif"></A> <A NAME="tex2html3824" HREF="node1.html"><IMG WIDTH=65 HEIGHT=24 ALIGN=BOTTOM ALT="contents" SRC="http://www.netlib.org/utk/icons/contents_motif.gif"></A> <A NAME="tex2html3825" HREF="node190.html"><IMG WIDTH=43 HEIGHT=24 ALIGN=BOTTOM ALT="index" SRC="http://www.netlib.org/utk/icons/index_motif.gif"></A> <BR>
<B> Next:</B> <A NAME="tex2html3823" HREF="node130.html">Tuning the Distribution Parameters </A>
<B>Up:</B> <A NAME="tex2html3821" HREF="node127.html">Performance Improvement</A>
<B> Previous:</B> <A NAME="tex2html3815" HREF="node128.html">Choosing a Faster BLACS </A>
<BR> <P>
<H2><A NAME="SECTION04542000000000000000">Choosing a Faster BLAS Library</A></H2>
<P>
Highly efficient machine-specific implementations of the BLAS are
available for many modern high-performance computers.
Users who cannot obtain an efficient BLAS for
your architecture may be able to 
create one from by using a set of BLAS that 
requires only an efficient implementation of the matrix-matrix multiply
BLAS routine xGEMM&nbsp;[<A HREF="node189.html#dayde94a">35</A>, <A HREF="node189.html#kagstrom95b">90</A>], combined 
with an automatically generated machine-specific and
efficient implementation
of xGEMM&nbsp;[<A HREF="node189.html#lawn111">16</A>].
<P>
Users who are
using one of the computers
listed in this chapter should refer to Tables&nbsp;<A HREF="node113.html#tabnode">5.2</A>
and&nbsp;<A HREF="node113.html#tabnode2">5.3</A>
to see which library we used for timing.
Otherwise, 
the computer vendor may be able to provide information
about optimized BLAS for a specific  computer.
<P>
A reference Fortran&nbsp;77 implementation of the BLAS is available
from the <EM>blas</EM> directory on <EM>netlib</EM>.
<BLOCKQUOTE> <TT>http://www.netlib.org/blas/blas.shar</TT>
</BLOCKQUOTE><BR> <HR>
<P><ADDRESS>
<I>Susan Blackford <BR>
Tue May 13 09:21:01 EDT 1997</I>
</ADDRESS>
</BODY>
</HTML>