1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 190 191 192 193 194 195 196 197 198 199 200 201 202 203 204 205 206 207 208 209 210 211 212 213 214 215 216 217 218 219 220 221 222 223 224 225 226 227 228 229 230 231 232 233 234 235 236 237 238 239 240 241 242 243 244 245 246 247 248 249 250 251 252 253 254 255 256 257 258 259 260 261 262 263 264 265 266 267 268
|
<!DOCTYPE HTML PUBLIC "-//IETF//DTD HTML 2.0//EN">
<!--Converted with LaTeX2HTML 96.1-h (September 30, 1996) by Nikos Drakos (nikos@cbl.leeds.ac.uk), CBLU, University of Leeds -->
<HTML>
<HEAD>
<TITLE>Preface </TITLE>
<META NAME="description" CONTENT="Preface ">
<META NAME="keywords" CONTENT="slug">
<META NAME="resource-type" CONTENT="document">
<META NAME="distribution" CONTENT="global">
<LINK REL=STYLESHEET HREF="slug.css">
</HEAD>
<BODY LANG="EN" >
<A NAME="tex2html1984" HREF="node5.html"><IMG WIDTH=37 HEIGHT=24 ALIGN=BOTTOM ALT="next" SRC="http://www.netlib.org/utk/icons/next_motif.gif"></A> <A NAME="tex2html1982" HREF="node3.html"><IMG WIDTH=26 HEIGHT=24 ALIGN=BOTTOM ALT="up" SRC="http://www.netlib.org/utk/icons/up_motif.gif"></A> <A NAME="tex2html1976" HREF="node3.html"><IMG WIDTH=63 HEIGHT=24 ALIGN=BOTTOM ALT="previous" SRC="http://www.netlib.org/utk/icons/previous_motif.gif"></A> <A NAME="tex2html1986" HREF="node1.html"><IMG WIDTH=65 HEIGHT=24 ALIGN=BOTTOM ALT="contents" SRC="http://www.netlib.org/utk/icons/contents_motif.gif"></A> <A NAME="tex2html1987" HREF="node190.html"><IMG WIDTH=43 HEIGHT=24 ALIGN=BOTTOM ALT="index" SRC="http://www.netlib.org/utk/icons/index_motif.gif"></A> <BR>
<B> Next:</B> <A NAME="tex2html1985" HREF="node5.html">Suggestions for Reading</A>
<B>Up:</B> <A NAME="tex2html1983" HREF="node3.html">List of Tables</A>
<B> Previous:</B> <A NAME="tex2html1977" HREF="node3.html">List of Tables</A>
<BR> <P>
<H1><A NAME="SECTION03100000000000000000">Preface </A></H1>
<P>
Following the initial release of LAPACK and the emerging importance
of distributed memory computing<A NAME="213"> </A>, work began on
adapting LAPACK to distributed-memory architectures. Since porting software
efficiently from one distributed-memory architecture to another is a
challenging task, this work is an effort to establish standards for library
development in the varied world of distributed-memory computing.
<P>
ScaLAPACK is an acronym for Scalable Linear Algebra PACKage, or Scalable LAPACK.
As in LAPACK, the ScaLAPACK routines are based on block-partitioned algorithms
in
order to minimize the frequency of data movement between different levels of the
memory hierarchy. (For distributed-memory machines, the memory hierarchy
includes the off-processor memory of other processors, in addition to the
hierarchy of registers, cache, and local memory on each processor.) The
fundamental building block of the ScaLAPACK library is a distributed-memory
version of the Level 1, 2, and 3 BLAS, called the PBLAS (Parallel BLAS). The
PBLAS are in turn built on the BLAS for computation on single nodes and on a set
of Basic Linear Algebra Communication Subprograms (BLACS) for communication
tasks that arise frequently in parallel linear algebra computations. For optimal
performance, it is necessary, first, that the BLAS be implemented efficiently
on the target machine, and second, that an efficient version of the BLACS be
available.
<P>
Versions of the BLACS exist for both MPI and PVM, as well as versions for
the Intel series (NX), IBM SP series (MPL), and Thinking Machines CM-5 (CMMD).
A vendor-optimized version of the BLACS is available for the Cray T3 series.
Thus, ScaLAPACK is portable on any computer or network of computers
that supports MPI or PVM (as well as the aforementioned native message-passing
protocols).
<P>
Most of the ScaLAPACK code is written in standard Fortran 77; the
PBLAS and the BLACS are written in C, but with Fortran 77 interfaces.
<P>
The first ScaLAPACK software was written in 1989-1990, and the appearance of
the code has undergone many changes since then in our pursuit to
resemble and enable code reuse from LAPACK.
<P>
The first public release (version 1.0) of ScaLAPACK occurred on
February 28, 1995, and subsequent releases occurred in 1996.
<P>
The ScaLAPACK library is only one facet of the ``ScaLAPACK Project,''
which is a collaborative effort involving several institutions:
<UL>
<LI> Oak Ridge National Laboratory
<LI> Rice University
<LI> University of California, Berkeley
<LI> University of California, Los Angeles
<LI> University of Illinois, Champaign-Urbana
<LI> University of Tennessee, Knoxville
</UL>
and comprises four components:
<UL>
<LI> dense and band matrix software (ScaLAPACK)
<LI> large sparse eigenvalue software (P_ARPACK)
<LI> sparse direct systems software (CAPSS)
<LI> preconditioners for large sparse iterative solvers (ParPre)
</UL>
<P>
For further information on any of the related ScaLAPACK
projects, please refer to the scalapack index on <EM>netlib</EM>:
<BLOCKQUOTE> <TT>http://www.netlib.org/scalapack/index.html</TT>
</BLOCKQUOTE>
<P>
This users guide describes version 1.5 of the dense and band matrix
software package (ScaLAPACK).
<P>
The University of Tennessee, Knoxville, provided the routines
for the solution of dense, band, and tridiagonal linear systems of
equations, condition estimation and iterative refinement, for LU and
Cholesky factorization, matrix inversion, full-rank linear least
squares problems, orthogonal and generalized orthogonal factorizations,
orthogonal transformation routines, reductions to upper Hessenberg,
bidiagonal and tridiagonal form, and reduction of a symmetric-definite
generalized eigenproblem to standard form. And finally, the BLACS,
the PBLAS, and the HPF wrappers were also written at the University
of Tennessee, Knoxville.
<P>
The University of California, Berkeley, provided the routines for the
symmetric and generalized symmetric eigenproblem and the singular value
decomposition.
<P>
Greg Henry at Intel Corporation provided the routines for the
nonsymmetric eigenproblem.
<P>
Oak Ridge National Laboratory provided the out-of-core linear solvers
for LU, Cholesky, and QR factorizations.
<P>
ScaLAPACK has been incorporated into several commercial packages,
including the NAG Parallel Library, IBM Parallel ESSL, and Cray LIBSCI,
and is being integrated into the VNI IMSL Numerical
Library, as well as software libraries for
Fujitsu, Hewlett-Packard/Convex, Hitachi, and NEC. Additional
information can be found on the respective Web pages:
<BLOCKQUOTE> <TT>http://www.nag.co.uk:80/numeric/FM.html </TT> <BR>
<TT>http://www.rs6000.ibm.com/software/sp_products/esslpara.html </TT> <BR>
<TT>http://www.cray.com/PUBLIC/product-info/sw/PE/LibSci.html</TT> <BR>
<TT>http://www.sgi.com/Products/hardware/Power/ch_complib.html</TT><BR>
<TT>http://www.vni.com/products/imsl/index.html</TT>
</BLOCKQUOTE>
<P>
A number of technical reports have been written during the development of
ScaLAPACK and published as LAPACK Working Notes by the University of
Tennessee. Refer to the following URL for a complete set of
working notes:
<BLOCKQUOTE> <TT>http://www.netlib.org/lapack/lawns/index.html</TT>
</BLOCKQUOTE>
Many of these reports subsequently appeared as journal
articles. The Bibliography gives the most recent published reference.
<P>
As the distributed-memory<A NAME="233"> </A> version of LAPACK,
ScaLAPACK has drawn
heavily on the software and documentation standards set by LAPACK.
The test and timing software for the Level 2 and 3 BLAS was
used as a model for the PBLAS test and timing software, and the ScaLAPACK
test suite was patterned after the LAPACK test suite.
Because of the large amount of software, all BLACS, PBLAS, and ScaLAPACK
routines are maintained in basefiles whereby the codes can be re-extracted
as needed. Final formatting of the software
was done using Toolpack/1 [<A HREF="node189.html#Toolpack">105</A>].
<P>
We have tried to be consistent with our documentation and
coding style throughout ScaLAPACK in the hope that it will serve
as a model for other distributed-memory software development efforts.
ScaLAPACK has been designed as a source of building blocks for larger
parallel applications.
<P>
The development of ScaLAPACK was supported
in part by National Science Foundation Grant ASC-9005933; by
the Defense Advanced Research Projects Agency under contract
DAAH04-95-1-0077, administered by the Army Research Office;
by the Division of Mathematical, Information, and Computational
Sciences, of the U.S. Department of Energy, under Contract
DE-AC05-96OR22464; and
by the National Science
Foundation Science and Technology Center Cooperative
Agreement CCR-8809615.
<P>
The performance results presented in this book were obtained using
computer resources at various sites:
<UL>
<LI>
Cray T3E, located at Lawrence Berkeley National Laboratory, National
Energy Research Scientific Computing Center (NERSC), supported by
the Director, Office of Computational and Technology
Research, Division of Mathematical, Information, and Computational
Sciences of the U.S. Department of Energy under contract number 76SF00098.
<LI>
IBM SP-2, located at the Cornell Theory Center, which receives major
funding from the National Science Foundation (NSF) and New York State,
with additional support from the Defense Advanced Research Projects
Agency (DARPA),
the National Center for Research Resources at the National Institutes
of Health (NIH), IBM Corporation, and other members of the center's
Corporate Partnership Program.
<LI>
Intel MP Paragon XPS/35, located at Intel Corporation, Portland,
Oregon.
<LI>
Intel ASCI Option Red Supercomputer Technology located in Beaverton,
Oregon.
<LI>
Network of Sun Ultra Enterprise 2 (Model 2170s) workstations, located
in the Department of Computer Science at the University of Tennessee,
funded by National Science Foundation Grant CDA-9529459, the Center of
Excellence - Science Alliance, UT Networking Services, and the UT
Computer Science Department.
<LI>
Network of Sun UltraSPARC-1 workstations, located in the Department
of Computer Science at the University of California, Berkeley
supported by DARPA Grant F30602-95-C-0014, NSF Grants CCR-9257974
and PFF-CCR-9253705, as well as California MICRO Grants. Corporate
sponsors are: the AT&T Foundation, Digital Equipment Corporation,
Exabyte Corporation, Hewlett-Packard Company, Informix Software Inc,
Intel Corporation, International Business Machines, Internet Archive,
Microsoft Corporation, Mitsubishi Electric Research Laboratories,
Myricom Inc, Siemens Corporation, Sun Microsystems, Synoptics
Corporation, Tandem Corporation, and TIBCO Inc.
</UL>
<P>
The cover of this book was designed by Andy Cleary at Lawrence Livermore
National Laboratory.
<P>
We acknowledge with gratitude the support that we have received from the
following organizations, and the help of individual members of their staff:
Cornell Theory Center,
Cray Research, a Silicon Graphics Company,
IBM (Parallel ESSL Development and Research),
Lawrence Berkeley National Laboratory, National Energy Research Scientific
Computing Center (NERSC),
Maui High Performance Computer Center,
Minnesota Supercomputing Center,
NAG Ltd.,
and Oak Ridge National Laboratory Center for Computational Sciences (CCS).
<P>
We also thank the many, many people who have contributed code,
criticism, ideas and encouragement. We especially acknowledge
the contributions of
Mark Adams,
Peter Arbenz,
Scott Betts,
Shirley Browne,
Henri Casanova,
Soumen Chakrabarti,
Mishi Derakhshan,
Frederic Desprez,
Brett Ellis,
Ray Fellers,
Markus Hegland,
Nick Higham,
Adolfy Hoisie,
Velvel Kahan,
Xiaoye Li,
Bill Magro,
Osni Marques,
Paul McMahan,
Caroline Papadopoulos,
Beresford Parlett,
Loic Prylli,
Yves Robert,
Howard Robinson,
Tom Rowan,
Shilpa Singhal,
Françoise Tisseur,
Bernard Tourancheau,
Anne Trefethen,
Robert van de Geijn,
and
Andrey Zege.
<P>
We express appreciation to all those who helped in the preparation
of this work, in particular to Gail Pieper for her tireless efforts in
proofreading the draft and improving the quality of the presentation.
<P>
Finally, we thank all the test sites that received several test releases
of the ScaLAPACK software and that ran an extensive series of
test programs for us.
<P>
<IMG WIDTH=759 HEIGHT=69 ALIGN=BOTTOM ALT="tex2html_wrap12059" SRC="img1.gif">
<P>
<HR><A NAME="tex2html1984" HREF="node5.html"><IMG WIDTH=37 HEIGHT=24 ALIGN=BOTTOM ALT="next" SRC="http://www.netlib.org/utk/icons/next_motif.gif"></A> <A NAME="tex2html1982" HREF="node3.html"><IMG WIDTH=26 HEIGHT=24 ALIGN=BOTTOM ALT="up" SRC="http://www.netlib.org/utk/icons/up_motif.gif"></A> <A NAME="tex2html1976" HREF="node3.html"><IMG WIDTH=63 HEIGHT=24 ALIGN=BOTTOM ALT="previous" SRC="http://www.netlib.org/utk/icons/previous_motif.gif"></A> <A NAME="tex2html1986" HREF="node1.html"><IMG WIDTH=65 HEIGHT=24 ALIGN=BOTTOM ALT="contents" SRC="http://www.netlib.org/utk/icons/contents_motif.gif"></A> <A NAME="tex2html1987" HREF="node190.html"><IMG WIDTH=43 HEIGHT=24 ALIGN=BOTTOM ALT="index" SRC="http://www.netlib.org/utk/icons/index_motif.gif"></A> <BR>
<B> Next:</B> <A NAME="tex2html1985" HREF="node5.html">Suggestions for Reading</A>
<B>Up:</B> <A NAME="tex2html1983" HREF="node3.html">List of Tables</A>
<B> Previous:</B> <A NAME="tex2html1977" HREF="node3.html">List of Tables</A>
<P><ADDRESS>
<I>Susan Blackford <BR>
Tue May 13 09:21:01 EDT 1997</I>
</ADDRESS>
</BODY>
</HTML>
|