File: node125.html

package info (click to toggle)
scalapack-doc 1.5-11
  • links: PTS
  • area: main
  • in suites: bullseye, buster, stretch
  • size: 10,336 kB
  • ctags: 4,931
  • sloc: makefile: 47; sh: 18
file content (64 lines) | stat: -rw-r--r-- 4,133 bytes parent folder | download | duplicates (4)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
<!DOCTYPE HTML PUBLIC "-//IETF//DTD HTML 2.0//EN">
<!--Converted with LaTeX2HTML 96.1-h (September 30, 1996) by Nikos Drakos (nikos@cbl.leeds.ac.uk), CBLU, University of Leeds -->
<HTML>
<HEAD>
<TITLE>Identify Performance Bottlenecks</TITLE>
<META NAME="description" CONTENT="Identify Performance Bottlenecks">
<META NAME="keywords" CONTENT="slug">
<META NAME="resource-type" CONTENT="document">
<META NAME="distribution" CONTENT="global">
<LINK REL=STYLESHEET HREF="slug.css">
</HEAD>
<BODY LANG="EN" >
 <A NAME="tex2html3773" HREF="node126.html"><IMG WIDTH=37 HEIGHT=24 ALIGN=BOTTOM ALT="next" SRC="http://www.netlib.org/utk/icons/next_motif.gif"></A> <A NAME="tex2html3771" HREF="node120.html"><IMG WIDTH=26 HEIGHT=24 ALIGN=BOTTOM ALT="up" SRC="http://www.netlib.org/utk/icons/up_motif.gif"></A> <A NAME="tex2html3765" HREF="node124.html"><IMG WIDTH=63 HEIGHT=24 ALIGN=BOTTOM ALT="previous" SRC="http://www.netlib.org/utk/icons/previous_motif.gif"></A> <A NAME="tex2html3775" HREF="node1.html"><IMG WIDTH=65 HEIGHT=24 ALIGN=BOTTOM ALT="contents" SRC="http://www.netlib.org/utk/icons/contents_motif.gif"></A> <A NAME="tex2html3776" HREF="node190.html"><IMG WIDTH=43 HEIGHT=24 ALIGN=BOTTOM ALT="index" SRC="http://www.netlib.org/utk/icons/index_motif.gif"></A> <BR>
<B> Next:</B> <A NAME="tex2html3774" HREF="node126.html">Performance Bottlenecks in the </A>
<B>Up:</B> <A NAME="tex2html3772" HREF="node120.html">Performance Evaluation</A>
<B> Previous:</B> <A NAME="tex2html3766" HREF="node124.html">Determine Whether Reasonable Performance </A>
<BR> <P>
<H2><A NAME="SECTION04535000000000000000">Identify Performance Bottlenecks</A></H2>
        <A NAME="subsecperfid">&#160;</A>
<A NAME="4226">&#160;</A><A NAME="4227">&#160;</A>
<P>
The formulas mentioned in section&nbsp;<A HREF="node123.html#subsecestim">5.3.3</A>, in addition to
providing an estimate of performance, can help one identify whether
the performance is limited by computation, by the number 
of messages, or by the volume of communication.  Even if the estimate
is far from correct, the user  may get some information about the
performance bottleneck by studying the computation and communication
estimates provided by those formulas.
<P>
Comparing the execution times of a problem of size <I>N</I> and one of size <I>N</I>/2
may also provide insight into the performance of the
ScaLAPACK routine being used.  Let <IMG WIDTH=18 HEIGHT=23 ALIGN=MIDDLE ALT="tex2html_wrap_inline12212" SRC="img31.gif"> and <IMG WIDTH=30 HEIGHT=23 ALIGN=MIDDLE ALT="tex2html_wrap_inline12220" SRC="img33.gif"> be the time required
for a problem of size <I>N</I> and size <I>N</I>/2, respectively, on <I>P</I> processors.
<P>
<UL>
<LI> If <IMG WIDTH=87 HEIGHT=24 ALIGN=MIDDLE ALT="tex2html_wrap_inline17099" SRC="img419.gif">, 
	the physical memory of each node may be exceeded.
<LI> If <IMG WIDTH=84 HEIGHT=24 ALIGN=MIDDLE ALT="tex2html_wrap_inline17101" SRC="img420.gif">, the performance 
	may be limited by the rate at which flops 
	are performed.  If the flop rate 
	is significantly less than expected, 
	the user should check the data distribution (try the standard data
	distribution suggested in section&nbsp;<A HREF="node106.html#distmemcomp">5.1.1</A>) 
	and the underlying BLAS.
<LI> If <IMG WIDTH=84 HEIGHT=24 ALIGN=MIDDLE ALT="tex2html_wrap_inline17103" SRC="img421.gif">, the major performance
       factor may be bandwidth (<IMG WIDTH=28 HEIGHT=27 ALIGN=MIDDLE ALT="tex2html_wrap_inline12064" SRC="img2.gif">). This is
       what one should obtain for medium values of <I>N</I>.
<LI> If <IMG WIDTH=84 HEIGHT=24 ALIGN=MIDDLE ALT="tex2html_wrap_inline17109" SRC="img422.gif">, the major performance
       factor may be latency (<IMG WIDTH=18 HEIGHT=23 ALIGN=MIDDLE ALT="tex2html_wrap_inline12208" SRC="img30.gif">). This is what
       one should obtain for small values of <I>N</I>.
</UL>
This performance
analysis suggests which computer characteristic
is most likely limiting the performance.  It 
cannot say whether 
one is getting good performance.
<P>
<BR> <HR>
<P><ADDRESS>
<I>Susan Blackford <BR>
Tue May 13 09:21:01 EDT 1997</I>
</ADDRESS>
</BODY>
</HTML>