1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83
|
<!DOCTYPE HTML PUBLIC "-//IETF//DTD HTML 2.0//EN">
<!--Converted with LaTeX2HTML 96.1-h (September 30, 1996) by Nikos Drakos (nikos@cbl.leeds.ac.uk), CBLU, University of Leeds -->
<HTML>
<HEAD>
<TITLE>Performance of ScaLAPACK</TITLE>
<META NAME="description" CONTENT="Performance of ScaLAPACK">
<META NAME="keywords" CONTENT="slug">
<META NAME="resource-type" CONTENT="document">
<META NAME="distribution" CONTENT="global">
<LINK REL=STYLESHEET HREF="slug.css">
</HEAD>
<BODY LANG="EN" >
<A NAME="tex2html3480" HREF="node105.html"><IMG WIDTH=37 HEIGHT=24 ALIGN=BOTTOM ALT="next" SRC="http://www.netlib.org/utk/icons/next_motif.gif"></A> <A NAME="tex2html3478" HREF="node7.html"><IMG WIDTH=26 HEIGHT=24 ALIGN=BOTTOM ALT="up" SRC="http://www.netlib.org/utk/icons/up_motif.gif"></A> <A NAME="tex2html3472" HREF="node103.html"><IMG WIDTH=63 HEIGHT=24 ALIGN=BOTTOM ALT="previous" SRC="http://www.netlib.org/utk/icons/previous_motif.gif"></A> <A NAME="tex2html3482" HREF="node1.html"><IMG WIDTH=65 HEIGHT=24 ALIGN=BOTTOM ALT="contents" SRC="http://www.netlib.org/utk/icons/contents_motif.gif"></A> <A NAME="tex2html3483" HREF="node190.html"><IMG WIDTH=43 HEIGHT=24 ALIGN=BOTTOM ALT="index" SRC="http://www.netlib.org/utk/icons/index_motif.gif"></A> <BR>
<B> Next:</B> <A NAME="tex2html3481" HREF="node105.html">Achieving High Performance with </A>
<B>Up:</B> <A NAME="tex2html3479" HREF="node7.html">Guide</A>
<B> Previous:</B> <A NAME="tex2html3473" HREF="node103.html">Extensions</A>
<BR> <P>
<H1><A NAME="SECTION04500000000000000000">Performance of ScaLAPACK</A></H1>
<A NAME="chapperformance"> </A><A NAME="3536"> </A>
<P>
This chapter presents performance numbers for ScaLAPACK
routines. The numbers are provided <EM>for illustration only</EM>
and <EM>should not</EM> be regarded as a definitive up-to-date statement of
performance. They have been selected from performance numbers
obtained in 1996-1997 during the development of version 1.4 of
ScaLAPACK. To obtain up-to-date performance figures, users should use
the timing programs provided with ScaLAPACK.
<P>
<BR> <HR>
<UL><A NAME="CHILD_LINKS"> </A>
<LI> <A NAME="tex2html3484" HREF="node105.html#SECTION04510000000000000000">Achieving High Performance with ScaLAPACK</A>
<UL>
<LI> <A NAME="tex2html3485" HREF="node106.html#SECTION04511000000000000000">Achieving High Performance on a Distributed Memory
Computer</A>
<LI> <A NAME="tex2html3486" HREF="node107.html#SECTION04512000000000000000">Achieving High Performance on a Network of Workstations</A>
</UL>
<LI> <A NAME="tex2html3487" HREF="node108.html#SECTION04520000000000000000">Performance, Portability and Scalability</A>
<UL>
<LI> <A NAME="tex2html3488" HREF="node109.html#SECTION04521000000000000000">The BLAS as the Key to (Trans)portable Efficiency</A>
<LI> <A NAME="tex2html3489" HREF="node110.html#SECTION04522000000000000000">Two-Dimensional Block Cyclic Data Distribution as a Key
to Load Balancing and Software Reuse</A>
<LI> <A NAME="tex2html3490" HREF="node111.html#SECTION04523000000000000000">BLACS as an Efficient, Portable and Adequate
Message-Passing Interface</A>
<UL>
<LI> <A NAME="tex2html3491" HREF="node112.html#SECTION04523100000000000000">Parallel Efficiency</A>
</UL>
<LI> <A NAME="tex2html3492" HREF="node113.html#SECTION04524000000000000000">ScaLAPACK Performance</A>
<LI> <A NAME="tex2html3493" HREF="node114.html#SECTION04525000000000000000">Performance of Selected BLACS and Level 3 BLAS Routines</A>
<UL>
<LI> <A NAME="tex2html3494" HREF="node115.html#SECTION04525100000000000000">Performance of Selected PBLAS routines</A>
<LI> <A NAME="tex2html3495" HREF="node116.html#SECTION04525200000000000000">Solution of Common Numerical Linear Algebra Problems</A>
</UL>
<LI> <A NAME="tex2html3496" HREF="node117.html#SECTION04526000000000000000">Solving Linear Systems of Equations</A>
<UL>
<LI> <A NAME="tex2html3497" HREF="node118.html#SECTION04526100000000000000">Solving Linear Least Squares Problems</A>
<LI> <A NAME="tex2html3498" HREF="node119.html#SECTION04526200000000000000">Eigenvalue Problems</A>
</UL>
</UL>
<LI> <A NAME="tex2html3499" HREF="node120.html#SECTION04530000000000000000">Performance Evaluation</A>
<UL>
<LI> <A NAME="tex2html3500" HREF="node121.html#SECTION04531000000000000000">Obtaining High Performance with ScaLAPACK Codes</A>
<LI> <A NAME="tex2html3501" HREF="node122.html#SECTION04532000000000000000">Checking the BLAS and BLACS Libraries</A>
<LI> <A NAME="tex2html3502" HREF="node123.html#SECTION04533000000000000000">Estimate Execution Time</A>
<LI> <A NAME="tex2html3503" HREF="node124.html#SECTION04534000000000000000">Determine Whether Reasonable Performance Is Achieved</A>
<LI> <A NAME="tex2html3504" HREF="node125.html#SECTION04535000000000000000">Identify Performance Bottlenecks</A>
<LI> <A NAME="tex2html3505" HREF="node126.html#SECTION04536000000000000000">Performance Bottlenecks in the Expert Symmetric
Eigenproblem Drivers</A>
</UL>
<LI> <A NAME="tex2html3506" HREF="node127.html#SECTION04540000000000000000">Performance Improvement</A>
<UL>
<LI> <A NAME="tex2html3507" HREF="node128.html#SECTION04541000000000000000">Choosing a Faster BLACS Library</A>
<LI> <A NAME="tex2html3508" HREF="node129.html#SECTION04542000000000000000">Choosing a Faster BLAS Library</A>
<LI> <A NAME="tex2html3509" HREF="node130.html#SECTION04543000000000000000">Tuning the Distribution Parameters for Better Performance</A>
</UL>
<LI> <A NAME="tex2html3510" HREF="node131.html#SECTION04550000000000000000">Performance of Banded and Out-of-Core Drivers</A>
</UL>
<BR> <HR>
<P><ADDRESS>
<I>Susan Blackford <BR>
Tue May 13 09:21:01 EDT 1997</I>
</ADDRESS>
</BODY>
</HTML>
|