File: 100_modulefile_examples.rst

package info (click to toggle)
lmod 8.7.60-1
  • links: PTS, VCS
  • area: main
  • in suites: forky, sid, trixie
  • size: 63,000 kB
  • sloc: sh: 6,266; makefile: 2,837; ansic: 1,513; tcl: 1,382; python: 1,050; csh: 112
file content (339 lines) | stat: -rw-r--r-- 14,280 bytes parent folder | download | duplicates (2)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
Modulefile Examples from simple to complex
==========================================

Most of the time a modulefile is just a collection of setting
environment variables and prepending to PATH or other path like
variables. However, modulefiles are actually programs so you can
do a great deal if necessary.

Here we show some of the techniques that site's or user's might use in
their modulefiles.  In addition, there are many examples in the Lmod
source tree.  The ``rt`` directory contains the regression testing
suite.  Each subdirectory in ``rt`` is a separate test and below that
there will be many modulefiles.  For example see all the modulefile
associated with the load test can be found in ``rt/load/mf``.

Some simple modulefiles
~~~~~~~~~~~~~~~~~~~~~~~

An application modulefile might add to ``$PATH``, set a few other
environment variables and provide help message as well as ``whatis()``
strings.  For example the valgrind memory usage tester might look
like::

    help([[
    To use the valgrind utility on an executable called a.out:

    valgrind ./a.out
    ]])

    local version = "3.7.0"
    local base    = pathJoin("/apps/valgrind",version)
    prepend_path("PATH", pathJoin(base,"bin"))  -- /app/valgrind/3.7.0/bin
    setenv(      "SITE_VALGRIND_DIR", base)
    setenv(      "SITE_VALGRIND_INC", pathJoin(base,"include"))
    setenv(      "SITE_VALGRIND_LIB", pathJoin(base,"lib"))

    whatis("Name: ".. pkgName)
    whatis("Version: " .. fullVersion)
    whatis("Category: tools")
    whatis("URL: http://www.valgrind.org")
    whatis("Description: memory usage tester")


A library module might look like::

    help([[
    ... 
    ]])
    whatis("Name: boost")
    whatis("Version: 1.64")
    whatis("Category: Lmod/Modulefiles")
    whatis("Keywords: System, Library, C++")
    whatis("URL: http://www.boost.org")
    whatis("Description: Boost provides free peer-reviewed portable C++ source libraries.")

    setenv("SITE_BOOST_DIR","/apps/intel17/boost/1.64")
    setenv("SITE_BOOST_LIB","/apps/intel17/boost/1.64/lib")
    setenv("SITE_BOOST_INC","/apps/intel17/boost/1.64/include")
    setenv("SITE_BOOST_BIN","/apps/intel17/boost/1.64/bin")
    setenv("BOOST_ROOT","/apps/intel17/boost/1.64")
    prepend_path("LD_LIBRARY_PATH","/apps/intel17/boost/1.64/lib")
    prepend_path("PATH","/apps/intel17/boost/1.64/bin")


.. _generic_modules-label:

Introspection
~~~~~~~~~~~~~

Lmod provides inspection functions that describe the name
and version of a modulefile as well as the path to the modulefile.
These functions provide a way to write "generic" modulefiles,
i.e. modulefiles that can fill in its values based on the location of the
file itself.

These ideas work best in the software hierarchy style of modulefiles.
For example: suppose the following is a modulefile for Git.  Its
modulefile is located in the "/apps/mfiles/Core/git" directory and
software is installed in "/apps/git/<version>".  The following
modulefile would work for every version of git::

   local pkg = pathJoin("/apps",myModuleName(),myModuleVersion())
   local bin = pathJoin(pkg,"bin"))
   prepend_path("PATH",bin)

   whatis("Name:        ", myModuleName())
   whatis("Version:     ", myModuleVersion())
   whatis("Description: ", "Git is a fast distributive version control system")

The contents of this modulefile can be used for multiple versions of
the git software, because the local variable bin changes the location
of the bin directory to match the version of the used as the name of
the file.  So if the module file is in
`/apps/mfiles/Core/git/2.3.4.lua` then the local variable `bin` will
be `/apps/git/2.3.4`.


Relative Paths
~~~~~~~~~~~~~~

Suppose you are interested in modules where the module and application
location are relative. Suppose that you have an $APPS directory, and
below that you have modulefiles and packages, and you would like the
modulefiles to find the absolute path of the package location. This
can be done with the ``myFileName()`` function and some lua code::

     local fn      = myFileName()                      -- 1
     local full    = myModuleFullName()                -- 2
     local loc     = fn:find(full,1,true)-2            -- 3
     local mdir    = fn:sub(1,loc)                     -- 4
     local appsDir = mdir:gsub("(.*)/","%1")           -- 5
     local pkg     = pathJoin(appsDir, full)           -- 6


To make this example concrete, let's assume that applications are in
``/home/user/apps`` and the modulefiles are in ``/home/user/apps/mfiles``.
So if the modulefile is located at
``/home/user/apps/mfiles/git/1.2.lua``,
then that is the value of ``fn`` at line 1.  The ``full`` variable at
line 2 will have ``git/1.2``.  What we want is to remove the name of
the modulefile and find its parent directory.  So we use Lua string
member function on ``fn`` to find where ``full`` starts.  In most cases
``fn:find(full)`` would work to find where the "git" starts in ``fn``
The trouble is that the Lua find function is expecting a regular
expression and in particular ``.`` and ``-`` are regular expression
characters.  So here we are using ``fn:find(full,1,true)`` to tell Lua
to treat each character as is with no special meaning.

Line 3 also subtracts 2.  The find command reports the location of the
start of the string where the "g" in "git" is, We want the value of
``mdir`` to be ``/home/user/apps/mfiles`` so we need to subtract 2.
This makes ``mdir`` have the right value.  One note is that Lua is a
one based language, so locations in strings start at one.

It was important for the value of ``mdir`` to remove the trailing
``/`` so that line 5 will do its magic.  We want the parent directory
of ``mdir``, so the regular expressions says greedily grab every
character until the trailing ``/`` and the ``%1`` says to capture the
string found in and use that to set ``appsdir`` to
``/home/user/apps``.  Finally we wish to set ``pkg`` to the location
of the actual application so we combine the value of ``appsdir`` and
``full`` to set ``pkg`` to ``/home/user/apps/git/1.2``.

The nice thing about this Lua code is that it figures out the location
of the package no matter where it is, as long as the relation between
apps directories and modulefiles is consistent.

Creating modules like this can be complicated. See
:ref:`debugging_modulefiles-label` for helpful tips.


Generic Modules with the Hierarchy
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

This works great for Core modules. It is a little more complicated for
Compiler or MPI/Compiler dependent modules but quite useful. For a
concrete example, lets cover how to handle the boost C++ library.
This is obviously a compiler dependent module. Suppose you have the
gnu compiler collection (gcc) and the  intel compiler collection
(intel), which means that you'll have a gcc version and an intel
version for each version of booth.

In order to have generic modules for compiler dependent modules, there
must be some conventions to make this work.  A suggested way to do
this is the following:

#. Core modules are placed in `/apps/mfiles/Core`.  These are the
   compilers, programs like git and so on.
#. Core software goes in `/apps/<app-name>/<app-version>`.
   So git version 2.3.4 goes in  `/apps/git/2.3.4`
#. Compiler-dependent modulefiles go in
   `/apps/mfiles/Compiler/<compiler>/<compiler-version>/<app-name>/<app-version>`
   using the **two-digit** rule (discussed below).  So the Boost
   1.55.0 modulefile built with gcc/4.8.3 would be found in
   `/apps/mfiles/Compiler/gcc/4.8/boost/1.55.0.lua`
#. Compiler-dependent packages go in
   `/apps/<compiler-version>/<app-name>/<app-version>`.  So the same
   Boost 1.55.0 package built with gcc 4.8.3 would be placed in
   `/apps/gcc-4_8/boost/1.55.0`

The above convention depends on the **two-digit** rule.  For compilers
and mpi stack, we are making the assumption that compiler dependent
libraries built with gcc 4.8.1 can be used with gcc 4.8.3. This is not
always safe but it works well enough in practice.  The above
convention also assumes that the boost 1.55.0 package will be placed
in `/apps/gcc-4_8/boost/1.55.0`.  It couldn't go in
*/apps/gcc/4.8/...* because that is where the gcc 4.8 package would
be placed and it is not a good idea to co-mingle two different
packages in the same tree.  Another possible choice would be
*/apps/gcc-4.8/boost/1.55.0*.  It is my view that it looks too much
like the gcc version 4.8 package location where as *gcc-4_8* doesn't.

With all of the above assumptions, we can now create a generic module
file for compiler dependent modules such as Boost.  In order to make
this work, we will need to use the `hierarchyA` function.  This
function parses the path of the modulefile to return the pieces we
need to create a generic boost modulefile::

   hierA = hierarchyA(myModuleFullName(),1)

The `myModuleFullName()` function returns the full name of the
module.  So if the module is named **boost/1.55.0**, then that is what
it will return.  If your site uses module names like `lib/boost/1.55.0`
then it will return that correctly as well. The *1* tells Lmod to
return just one component from the path.  So if the modulefile is
located at `/apps/mfiles/Compiler/gcc/4.8/boost/1.55.0.lua`, then
`myModuleFullName()` returns **boost/1.55.0** and the `hierarchyA`
function returns an array with 1 entry.  In this case it returns::

   { "gcc/4.8" }

The rest of the module file then can make use to this result to form
the paths::

    local pkgName     = myModuleName()
    local fullVersion = myModuleVersion()
    local hierA       = hierarchyA(myModuleFullName(),1)
    local compilerD   = hierA[1]:gsub("/","-"):gsub("%.","_")
    local base        = pathJoin("/apps",compilerD,pkgName,fullVersion)

    whatis("Name: "..pkgName)
    whatis("Version "..fullVersion)
    whatis("Category: library")
    whatis("Description: Boost provides free peer-reviewed "..
                        " portable C++ source libraries.")
    whatis("URL: http://www.boost.org")
    whatis("Keyword: library, c++")

    setenv("TACC_BOOST_LIB", pathJoin(base,"lib"))
    setenv("TACC_BOOST_INC", pathJoin(base,"include"))

The important trick is the building of the `compilerD` variable.  It
converts the `gcc/4.8` into `gcc-4_8`.  This makes the `base` variable
be: `/apps/gcc-4_8/boost/1.55.0`.

Creating modules like this can be complicated. See
:ref:`debugging_modulefiles-label` for helpful tips.

A proposed directory structure of /apps/mfiles/Compiler would be::


    .base/    gcc/  intel/

    .base/
    boost/generic.lua

    gcc/4.8/boost/

    1.55.0.lua ->  ../../../.base/boost/generic.lua

    intel/15.0.2/boost/

    1.55.0.lua -> ../../../.base/boost/generic.lua

In this way the `.base/boost/generic.lua` file will be the source file
for all the boost version build with gcc and intel compilers.


The same technique can be applied for modulefiles for Compiler/MPI
dependent packages.  In this case, we will create the phdf5
modulefile.  This is a parallel I/O package that allows for Hierarchical
output.  The modulefile is::

    local pkgName    = myModuleName()
    local pkgVersion = myModuleVersion()
    local pkgNameVer = myModuleFullName()

    local hierA      = hierarchyA(pkgNameVer,2)
    local mpiD       = hierA[1]:gsub("/","-"):gsub("%.","_")
    local compilerD  = hierA[2]:gsub("/","-"):gsub("%.","_")
    local base       = pathJoin("/apps", compilerD, mpiD, pkgNameVer)

    setenv(      "TACC_HDF5_DIR",   base)
    setenv(      "TACC_HDF5_DOC",   pathJoin(base,"doc"))
    setenv(      "TACC_HDF5_INC",   pathJoin(base,"include"))
    setenv(      "TACC_HDF5_LIB",   pathJoin(base,"lib"))
    setenv(      "TACC_HDF5_BIN",   pathJoin(base,"bin"))
    prepend_path("PATH",            pathJoin(base,"bin"))
    prepend_path("LD_LIBRARY_PATH", pathJoin(base,"lib"))

    whatis("Name: Parallel HDF5")
    whatis("Version: " .. pkgVersion)
    whatis("Category: library, mathematics")
    whatis("URL: http://www.hdfgroup.org/HDF5")
    whatis("Description: General purpose library and file format for storing scientific data (parallel I/O version)")

We use the same tricks as before,  It is just that since the module
for phdf5 built by gcc/4.8.3 and mpich/3.1.2 will be found at
`/apps/mfiles/MPI/gcc/4.8./mpich/3.1/phdf5/1.8.14.lua`. The
results of `hierarchyA(pkgNameVer,2)` would be::

    { "mpich/3.1", "gcc/4.8" }

This is because the `hierarchyA` works back up the path two elements
at a time because the full name of this package is also two elements
(phdf5/1.8.14).  The `base` variable now becomes::

    /apps/gcc-4_8/mpich-3_1/phdf5/1.8.14

The last type of modulefile that needs to be discussed is an mpi stack
modulefile such as mpich/3.1.2.  This modulefile is more complicated
because it has to implement the two-digit rule, build the path to the
package and build the new entry to the **MODULEPATH**.  The modulefile
is::

    local pkgNameVer   = myModuleFullName()
    local pkgName      = myModuleName()
    local fullVersion  = myModuleVersion()
    local pkgV         = fullVersion:match('(%d+%.%d+)%.?')

    local hierA        = hierarchyA(pkgNameVer,1)
    local compilerV    = hierA[1]
    local compilerD    = compilerV:gsub("/","-"):gsub("%.","_")
    local base         = pathJoin("/apps",compilerD,pkgName,fullVersion)
    local mpath        = pathJoin("/apps/mfiles/MPI", compilerV, pkgName, pkgV)

    prepend_path("MODULEPATH", mpath)
    setenv(      "TACC_MPICH_DIR", base)
    setenv(      "TACC_MPICH_LIB", pathJoin(base,"lib"))
    setenv(      "TACC_MPICH_BIN", pathJoin(base,"bin"))
    setenv(      "TACC_MPICH_INC", pathJoin(base,"include"))

    whatis("Name: "..pkgName)
    whatis("Version "..fullVersion)
    whatis("Category: mpi")
    whatis("Description: High-Performance Portable MPI")
    whatis("URL: http://www.mpich.org")

The **Two Digit** rule implemented by forming the `pkgV` variable. The
`base` and `mpath` are::

    base  = "/apps/gcc-4_8/mpich-3_1/phdf5/1.8.14"
    mpath = "/apps/mfiles/MPI/gcc/4.8/mpich/3.1"

The *rt* directory contains all the regression test used by Lmod.  As
such they contain many examples of modulefiles.  To complement this
description, the *rt/hierarchy/mf* directory from the source tree
contains a complete hierarchy.