1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82
|
<html lang="en">
<head>
<title>Advanced distributed-transpose interface - FFTW 3.3.2</title>
<meta http-equiv="Content-Type" content="text/html">
<meta name="description" content="FFTW 3.3.2">
<meta name="generator" content="makeinfo 4.13">
<link title="Top" rel="start" href="index.html#Top">
<link rel="up" href="FFTW-MPI-Transposes.html#FFTW-MPI-Transposes" title="FFTW MPI Transposes">
<link rel="prev" href="Basic-distributed_002dtranspose-interface.html#Basic-distributed_002dtranspose-interface" title="Basic distributed-transpose interface">
<link rel="next" href="An-improved-replacement-for-MPI_005fAlltoall.html#An-improved-replacement-for-MPI_005fAlltoall" title="An improved replacement for MPI_Alltoall">
<link href="http://www.gnu.org/software/texinfo/" rel="generator-home" title="Texinfo Homepage">
<!--
This manual is for FFTW
(version 3.3.2, 28 April 2012).
Copyright (C) 2003 Matteo Frigo.
Copyright (C) 2003 Massachusetts Institute of Technology.
Permission is granted to make and distribute verbatim copies of
this manual provided the copyright notice and this permission
notice are preserved on all copies.
Permission is granted to copy and distribute modified versions of
this manual under the conditions for verbatim copying, provided
that the entire resulting derived work is distributed under the
terms of a permission notice identical to this one.
Permission is granted to copy and distribute translations of this
manual into another language, under the above conditions for
modified versions, except that this permission notice may be
stated in a translation approved by the Free Software Foundation.
-->
<meta http-equiv="Content-Style-Type" content="text/css">
<style type="text/css"><!--
pre.display { font-family:inherit }
pre.format { font-family:inherit }
pre.smalldisplay { font-family:inherit; font-size:smaller }
pre.smallformat { font-family:inherit; font-size:smaller }
pre.smallexample { font-size:smaller }
pre.smalllisp { font-size:smaller }
span.sc { font-variant:small-caps }
span.roman { font-family:serif; font-weight:normal; }
span.sansserif { font-family:sans-serif; font-weight:normal; }
--></style>
</head>
<body>
<div class="node">
<a name="Advanced-distributed-transpose-interface"></a>
<a name="Advanced-distributed_002dtranspose-interface"></a>
<p>
Next: <a rel="next" accesskey="n" href="An-improved-replacement-for-MPI_005fAlltoall.html#An-improved-replacement-for-MPI_005fAlltoall">An improved replacement for MPI_Alltoall</a>,
Previous: <a rel="previous" accesskey="p" href="Basic-distributed_002dtranspose-interface.html#Basic-distributed_002dtranspose-interface">Basic distributed-transpose interface</a>,
Up: <a rel="up" accesskey="u" href="FFTW-MPI-Transposes.html#FFTW-MPI-Transposes">FFTW MPI Transposes</a>
<hr>
</div>
<h4 class="subsection">6.7.2 Advanced distributed-transpose interface</h4>
<p>The above routines are for a transpose of a matrix of numbers (of type
<code>double</code>), using FFTW's default block sizes. More generally, one
can perform transposes of <em>tuples</em> of numbers, with
user-specified block sizes for the input and output:
<pre class="example"> fftw_plan fftw_mpi_plan_many_transpose
(ptrdiff_t n0, ptrdiff_t n1, ptrdiff_t howmany,
ptrdiff_t block0, ptrdiff_t block1,
double *in, double *out, MPI_Comm comm, unsigned flags);
</pre>
<p><a name="index-fftw_005fmpi_005fplan_005fmany_005ftranspose-403"></a>
In this case, one is transposing an <code>n0</code> by <code>n1</code> matrix of
<code>howmany</code>-tuples (e.g. <code>howmany = 2</code> for complex numbers).
The input is distributed along the <code>n0</code> dimension with block size
<code>block0</code>, and the <code>n1</code> by <code>n0</code> output is distributed
along the <code>n1</code> dimension with block size <code>block1</code>. If
<code>FFTW_MPI_DEFAULT_BLOCK</code> (0) is passed for a block size then FFTW
uses its default block size. To get the local size of the data on
each process, you should then call <code>fftw_mpi_local_size_many_transposed</code>.
<a name="index-FFTW_005fMPI_005fDEFAULT_005fBLOCK-404"></a><a name="index-fftw_005fmpi_005flocal_005fsize_005fmany_005ftransposed-405"></a>
</body></html>
|