1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64
|
<!DOCTYPE HTML PUBLIC "-//IETF//DTD HTML 2.0//EN">
<!--Converted with LaTeX2HTML 96.1-h (September 30, 1996) by Nikos Drakos (nikos@cbl.leeds.ac.uk), CBLU, University of Leeds -->
<HTML>
<HEAD>
<TITLE>Identify Performance Bottlenecks</TITLE>
<META NAME="description" CONTENT="Identify Performance Bottlenecks">
<META NAME="keywords" CONTENT="slug">
<META NAME="resource-type" CONTENT="document">
<META NAME="distribution" CONTENT="global">
<LINK REL=STYLESHEET HREF="slug.css">
</HEAD>
<BODY LANG="EN" >
<A NAME="tex2html3773" HREF="node126.html"><IMG WIDTH=37 HEIGHT=24 ALIGN=BOTTOM ALT="next" SRC="http://www.netlib.org/utk/icons/next_motif.gif"></A> <A NAME="tex2html3771" HREF="node120.html"><IMG WIDTH=26 HEIGHT=24 ALIGN=BOTTOM ALT="up" SRC="http://www.netlib.org/utk/icons/up_motif.gif"></A> <A NAME="tex2html3765" HREF="node124.html"><IMG WIDTH=63 HEIGHT=24 ALIGN=BOTTOM ALT="previous" SRC="http://www.netlib.org/utk/icons/previous_motif.gif"></A> <A NAME="tex2html3775" HREF="node1.html"><IMG WIDTH=65 HEIGHT=24 ALIGN=BOTTOM ALT="contents" SRC="http://www.netlib.org/utk/icons/contents_motif.gif"></A> <A NAME="tex2html3776" HREF="node190.html"><IMG WIDTH=43 HEIGHT=24 ALIGN=BOTTOM ALT="index" SRC="http://www.netlib.org/utk/icons/index_motif.gif"></A> <BR>
<B> Next:</B> <A NAME="tex2html3774" HREF="node126.html">Performance Bottlenecks in the </A>
<B>Up:</B> <A NAME="tex2html3772" HREF="node120.html">Performance Evaluation</A>
<B> Previous:</B> <A NAME="tex2html3766" HREF="node124.html">Determine Whether Reasonable Performance </A>
<BR> <P>
<H2><A NAME="SECTION04535000000000000000">Identify Performance Bottlenecks</A></H2>
<A NAME="subsecperfid"> </A>
<A NAME="4226"> </A><A NAME="4227"> </A>
<P>
The formulas mentioned in section <A HREF="node123.html#subsecestim">5.3.3</A>, in addition to
providing an estimate of performance, can help one identify whether
the performance is limited by computation, by the number
of messages, or by the volume of communication. Even if the estimate
is far from correct, the user may get some information about the
performance bottleneck by studying the computation and communication
estimates provided by those formulas.
<P>
Comparing the execution times of a problem of size <I>N</I> and one of size <I>N</I>/2
may also provide insight into the performance of the
ScaLAPACK routine being used. Let <IMG WIDTH=18 HEIGHT=23 ALIGN=MIDDLE ALT="tex2html_wrap_inline12212" SRC="img31.gif"> and <IMG WIDTH=30 HEIGHT=23 ALIGN=MIDDLE ALT="tex2html_wrap_inline12220" SRC="img33.gif"> be the time required
for a problem of size <I>N</I> and size <I>N</I>/2, respectively, on <I>P</I> processors.
<P>
<UL>
<LI> If <IMG WIDTH=87 HEIGHT=24 ALIGN=MIDDLE ALT="tex2html_wrap_inline17099" SRC="img419.gif">,
the physical memory of each node may be exceeded.
<LI> If <IMG WIDTH=84 HEIGHT=24 ALIGN=MIDDLE ALT="tex2html_wrap_inline17101" SRC="img420.gif">, the performance
may be limited by the rate at which flops
are performed. If the flop rate
is significantly less than expected,
the user should check the data distribution (try the standard data
distribution suggested in section <A HREF="node106.html#distmemcomp">5.1.1</A>)
and the underlying BLAS.
<LI> If <IMG WIDTH=84 HEIGHT=24 ALIGN=MIDDLE ALT="tex2html_wrap_inline17103" SRC="img421.gif">, the major performance
factor may be bandwidth (<IMG WIDTH=28 HEIGHT=27 ALIGN=MIDDLE ALT="tex2html_wrap_inline12064" SRC="img2.gif">). This is
what one should obtain for medium values of <I>N</I>.
<LI> If <IMG WIDTH=84 HEIGHT=24 ALIGN=MIDDLE ALT="tex2html_wrap_inline17109" SRC="img422.gif">, the major performance
factor may be latency (<IMG WIDTH=18 HEIGHT=23 ALIGN=MIDDLE ALT="tex2html_wrap_inline12208" SRC="img30.gif">). This is what
one should obtain for small values of <I>N</I>.
</UL>
This performance
analysis suggests which computer characteristic
is most likely limiting the performance. It
cannot say whether
one is getting good performance.
<P>
<BR> <HR>
<P><ADDRESS>
<I>Susan Blackford <BR>
Tue May 13 09:21:01 EDT 1997</I>
</ADDRESS>
</BODY>
</HTML>
|