File: mca.rst

package info (click to toggle)
openmpi 5.0.8-3
  • links: PTS, VCS
  • area: main
  • in suites:
  • size: 201,692 kB
  • sloc: ansic: 613,078; makefile: 42,353; sh: 11,194; javascript: 9,244; f90: 7,052; java: 6,404; perl: 5,179; python: 1,859; lex: 740; fortran: 61; cpp: 20; tcl: 12
file content (339 lines) | stat: -rw-r--r-- 13,501 bytes parent folder | download | duplicates (12)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
.. _label-mca:

The Modular Component Architecture (MCA)
========================================

PMIx is a highly-customizable system; it can be configured via
configuration files, command line parameters, and environment
variables.  The main functionality of PMIx's configuration system
is through the Modular Component Architecture (MCA).

* This section describes the MCA itself and how to set MCA parameters at
  run time.
* Later sections in this documentation describe different parts of
  PMIx's functionality, and the specific names and values of MCA
  parameters that can be used to affect PMIx's behavior.

/////////////////////////////////////////////////////////////////////////

Terminology
-----------

The Modular Component Architecture (MCA) is the backbone for much of
PMIx's functionality.  It is a series of *frameworks*,
*components*, and *modules* that are assembled at run-time to create
the PMIx implementation.

MCA *parameters* (also known as MCA *variables*) are used to customize
PMIx's behavior at run-time.

Each of these entities are described below.

Frameworks
^^^^^^^^^^

An MCA framework manages zero or more components at run-time and is
targeted at a specific task (e.g., providing support for network
functionality).  Although each MCA framework supports only a single
type of component, it may support multiple components of that type.

Some of the more common frameworks that users may want or need to
customize include the following:

* ``ptl``: PMIx Transport Layer; users may need to assist the
  messaging layer with selection of network interfaces and
  controls parameters
* ``pmdl``: Programming Model and Library; identify environmental
  variables to be forwarded to application processes

There are many frameworks within PMIx; the exact set varies
between different versions of PMIx.  You can use the
:ref:`pmix_info(1) <man1-pmix_info>` command to see the full list of
frameworks that are included in PMIx |opmix_ver|.

Components
^^^^^^^^^^

An MCA component is an implementation of a framework's formal
interface.  It is a standalone collection of code that can be bundled
into a plugin that can be inserted into the PMIx code base, either
at run-time and/or compile-time.

.. note:: Good synonyms for PMIx's "component" concept are
          "plugin", or "add-on".

The exact set of components varies between different versions of Open
MPI.  PMIx's code base includes support for many components, but
not all of them may be present or available on your system.  You can
use the :ref:`pmix_info(1) <man1-pmix_info>` command to see what
components are included in PMIx |opmix_ver| on your system.

Modules
^^^^^^^

An MCA module (frequently called a ``plugin``) is an instance of a
component.  While it is possible for a component to have multiple
active plugins, it would be a very rare component that did so.

Parameters (variables)
^^^^^^^^^^^^^^^^^^^^^^

MCA *parameters* (sometimes called MCA *variables*) are the basic unit
of run-time tuning for PMIx.  They are simple "key = value" pairs
that are used extensively throughout PMIx.  The general rules of
thumb that the developers use are:

#. Instead of using a constant for an important value, make it an MCA
   parameter so users can adjust it if necessary.
#. If a task can be implemented in multiple, user-discernible ways,
   implement as many as possible, and use an MCA parameter to
   choose between them at run-time.

For example, the PMIx server sets a maximum value on the number of
events it will cache. Events received after that point cause the
oldest events in the cache to be released. This allows the server
to provide a process with events it registers to receive, even if
the registration occurs after the event has been received by the
server - thereby reducing the likelihood of a process missing an
event due to a race condition. Applications that generate a large
volume of events (e.g., an application relying on the PMIx Group
invite/join method for creating collections of processes) might
require a significantly larger cache than the default size.

/////////////////////////////////////////////////////////////////////////

.. _label-running-setting-mca-param-values:

Setting MCA parameter values
----------------------------

MCA parameters may be set in several different ways.

.. admonition:: Rationale
   :class: tip

   Having multiple methods to set MCA parameters allows, for example,
   system administrators to fine-tune the PMIx installation for
   their hardware / environment such that normal users can simply use
   the default values (that were set by the system administrators).

   HPC environments |mdash| and the applications that run on them
   |mdash| tend to be unique.  Providing extensive run-time tuning
   capabilities through MCA parameters allows the customization of
   PMIx to each system's / user's / application's particular
   needs.

The following are the different methods to set MCA parameters, listed
in priority order:

#. Environment variables
#. Configuration files


Environment variables
^^^^^^^^^^^^^^^^^^^^^

Environment variables are given the highest priority.  Any environment variable
named ``PMIX_MCA_<param_name>`` will be examined for use. Note that
misspelling of names will cause the value to be ignored - i.e.,
PMIx does not report unrecognized MCA parameters, it simply ignores
them.

.. note:: Just like with command line values, setting environment
          variables to values with multiple words requires shell
          quoting, such as:

          .. code-block:: sh

             shell$ export OMPI_MCA_param="value with multiple words"


Configuration files
^^^^^^^^^^^^^^^^^^^

Simple configuration text files can also be used to set MCA
parameter values.  Parameters are set one per line (comments are
permitted).  For example:

.. code-block:: ini

   # This is a comment
   # Set an MCA parameter
   mca_component_show_load_errors = 1

Note that quotes are *not* necessary for setting multi-word values
in MCA parameter files.  Indeed, if you use quotes in the MCA
parameter file, they will be used as part of the value itself.  For
example:

.. code-block:: ini

   # The following two values are different:
   param1 = value with multiple words
   param2 = "value with multiple words"

By default, two files are searched (in order):

#. ``$HOME/.pmix/mca-params.conf``: The user-supplied set of
   values has the higher precedence.
#. ``$prefix/etc/pmix-mca-params.conf``: The system-supplied set
   of values has a lower precedence.

More specifically, the MCA parameter ``mca_param_files`` specifies a
colon-delimited path of files to search for MCA parameters.  Files to
the left have lower precedence; files to the right are higher
precedence.

.. note:: Keep in mind that, just like components, these parameter
          files are *only* relevant where they are "visible". Typically,
          these files are read by the host daemon responsible for
          launching an application and then forwarded to all
          daemons (and their child application processes) in their
          environment.

.. warning:: Setting PMIx MCA parameters via configuration files
             entails editing (by default) the following files:

             ``$HOME/.pmix/mca-params.conf`` or
             ``$prefix/etc/pmix-mca-params.conf``

/////////////////////////////////////////////////////////////////////////

.. _label-running-selecting-framework-components:

Selecting which PMIx components are used at run time
--------------------------------------------------------

Each MCA framework has a top-level MCA parameter that helps guide
which components are selected to be used at run-time.  Specifically,
every framework has an MCA parameter of the same name that can be used
to *include* or *exclude* components from a given run.

For example, the ``pmdl`` MCA parameter can used to control which PMDL
components are used.  It takes a comma-delimited list of component
names, and may be optionally prefixed with ``^``.  For example:

.. note:: The Programming Model ``PMDL`` framework provides support
  for a range of programming models and libraries, including collection
  of default parameters and environmental variables for forwarding and
  setting of library-specific environmental variables

.. code-block:: sh

   # Tell PMIx to include *only* the PMDL components listed here and
   # implicitly ignore all the rest:
   export PMIX_MCA_pmdl=ompi,oshmem ...

   # Tell PMIx to exclude the ompi and oshmem PMDL components
   # and implicitly include all the rest
   export PMIX_MCA_pmdl=^ompi,oshmem ...

Note that ``^`` can *only* be the prefix of the *entire*
comma-delimited list because the inclusive and exclusive behavior are
mutually exclusive.  Specifically, since the exclusive behavior means
"use all components *except* these", it does not make sense to mix it
with the inclusive behavior of not specifying it (i.e., "use all of
these components").  Hence, something like this:

.. code-block:: sh

   export PMIX_MCA_pmdl=ompi,^oshmem ...

does not make sense |mdash| and will cause an error |mdash| because it
says "use only the ``ompi`` component" but
also "use all components except ``oshmem``".  These two statements
clearly contradict each other.

/////////////////////////////////////////////////////////////////////////

Common MCA parameters
---------------------

PMIx has a *large* number of MCA parameters available.  Users can
use the :ref:`pmix_info(1) <man1-pmix_info>` command to see *all*
available MCA parameters.

The vast majority of these MCA parameters, however, are not useful to
most users.  Although the full list of MCA parameters can be found in the output of
``pmix_info(1)``, the following list of commonly-used parameters is
presented here so that they can easily be found via internet searches:

* Individual framework names with the ``_base_verbose`` suffix
  appended (e.g., ``ptl_base_verbose``, ``pmdl_base_verbose``, etc.)
  can be used to set the general verbosity level of all the components
  in that framework.

  * This can be helpful when troubleshooting why certain components
    are or are not being selected at run time.

* The PMIx Transport Layer supports "include" and "exclude"
  types of components (e.g., ``ptl_tcp_if_include`` and
  ``ptl_tcp_if_exclude``).  The "include" parameters specify an
  explicit set of network interfaces to use; the "exclude" parameters
  specify an explicit set of network interfaces to ignore.  Check the
  output from :ref:`pmix_info(1)'s <man1-pmix_info>` to see the full list
  of PTL-related parameters.

  .. important:: You can only use the "include" *or* the "exclude"
                 parameter |mdash| they are mutually exclusive from each
                 other.
* ``mca_base_component_show_load_errors``: By default, PMIx
  emits a warning message if it fails to open a DSO component at run
  time.  This typically happens when a shared library that the DSO
  requires is not available.

  .. admonition:: Rationale
     :class: tip

     In prior versions of PMIx, components defaulted to building
     as DSOs (vs. being included in the parent library,
     ``libpmix.so``).  On misconfigured systems, sometimes libraries
     required by various components would not be present, thereby causing
     those components to fail to open at run time.

     Having PMIx warn about such failures to load was useful
     because it alerted users to the misconfiguration.

  .. note:: By default, PMIx |opmix_ver| includes all components in
            its base libraries (e.g., on Linux, ``libpmix.so`` includes
            all the components that were built with PMIx, and
            therefore no component need to be opened dynamically), and
            does not build its components as DSOs.

            This MCA parameter *only* affects the behavior when a
            component DSO fails to open.

  This MCA parameter can take four general values:

  #. ``yes`` or a boolean "true" value (e.g., ``1``): PMIx will
     emit a warning about every component DSO that fails to load.

  #. ``no`` or a boolean "false" value (e.g., ``0``): PMIx will
     never emit warnings about component DSOs that fail to load.

  #. A comma-delimited list of frameworks and/or components: PMIx
     will emit a warning about any dynamic component that fails to
     open and matches a token in the list. "Match" is defined as:

     * If a token in the list is only a framework name, then any
       component in that framework will match.
     * If a token in the list specifies both a framework name and a
       component name (in the form ``framework/component``), then
       only the specified component in the specified framework will
       match.

     For example, if the value of this MCA parameter is
     ``pmdl,pnet/opa``, then PMIx will warn if any component in
     the PMDL framework or if the OPA PNET component fails to load at run
     time.

  #. The value can also be a ``^`` character followed by a
     comma-delimited list of ``framework[/component]`` values: This
     is similar to the comma-delimited list of tokens, except it will
     only emit warnings about dynamic components that fail to load
     and do *not* match a token in the list.

     For example, if the value of this MCA parameter is
     ``^pmdl,pnet/opa``, then PMIx will only warn about the
     failure to load DSOs that are neither in the PMDL
     framework nor are the OPA PNET component.