File: source-shell-script-in-modulefile.rst

package info (click to toggle)
modules 5.6.1-1
  • links: PTS, VCS
  • area: main
  • in suites: forky, sid
  • size: 23,036 kB
  • sloc: exp: 79,659; sh: 6,142; tcl: 5,900; makefile: 1,492; ansic: 474; python: 265; csh: 202; perl: 47; ruby: 44; lisp: 13
file content (232 lines) | stat: -rw-r--r-- 13,755 bytes parent folder | download | duplicates (2)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
.. _source-shell-script-in-modulefile:

Source shell script in modulefile
=================================

Configuration
-------------

- No specific configuration


Specification
-------------

- Execute a shell script and include the environment changes it performs in modulefile

  - Environment changes done by shell script are gathered and evaluated in modulefile context through corresponding modulefile commands
  - Goal is to get the same environment after loading a modulefile using a :mfcmd:`source-sh` than if shell script targeted by the ``source-sh`` where directly sourced in shell session

- Environment changes tracked are (all environment elements ``module`` can handle):

  - Value set to variable: transformed into ``setenv`` modulefile command
  - Variable unset: transformed into ``unsetenv`` modulefile command
  - Path element added to variable: transformed into ``append-path`` or ``prepend-path`` modulefile command
  - Path element removed from variable: transformed into ``remove-path`` modulefile command
  - Current working directory change: transformed into ``cd`` modulefile command
  - Shell alias definition: transformed into ``set-alias``
  - Shell alias unset: transformed into ``unset-alias``
  - Shell function definition: transformed into ``set-function``
  - Shell function unset: transformed into ``unset-function``
  - Shell completion definition: transformed into ``complete``
  - Shell completion unset: transformed into ``uncomplete``

- Depending on modulefile evaluation mode, ``source-sh`` has different behaviors:

  - ``load``, ``refresh`` and ``scan``: execute script to gather its environment changes, transform them in modulefile commands and evaluate them
  - ``unload``: undo environment changes made on load mode
  - ``display``: execute script to gather its environment changes and report resulting command name and arguments for display
  - ``help``, ``test`` and ``whatis``: no operation

- **Limitation**: code in modulefile cannot rely on the environment changes done in script targeted by a ``source-sh`` command

  - For instance an environment variable set in shell script cannot be used to define another variable later in modulefile
  - This will work on ``load``, ``unload`` and ``display`` modes, as script is run and/or analyzed and corresponding modulefile commands are evaluated in modulefile context
  - But it will not work on the other modes, as script is not run and analyzed there

    - To simplify processing as script need to be run and analyzed if not yet loaded, but if already loaded changes recorded in environment for tracking should be used instead
    - To avoid a negative impact on performances on the ``whatis`` global evaluation

- Result of ``source-sh`` command evaluation is stored into the environment when modulefile is loaded

  - To keep track of environment changes made by ``source-sh`` script evaluation
  - In order to undo these changes when unloading modulefile and report corresponding modulefile commands when displaying modulefile
  - Environment variable :envvar:`__MODULES_LMSOURCESH` is used for this need

    - Using following syntax: ``mod&shell script arg|cmd|cmd|...&shell script|cmd:mod&shell script arg|cmd``
    - Example value: ``foo/1&sh /tmp/source.sh|append-path PATH /tmp|cd /tmp``
    - Characters used to structure information in ``__MODULES_LMSOURCESH`` (``:``, ``&`` and ``|``) are escaped

      - Respectively to ``<EnvModEscPS>``, ``<EnvModEscS1>`` and ``<EnvModEscS2>``
      - If found in environment changes to record

    - Actual bodies of shell alias, shell functions and shell completions are not recorded in ``__MODULES_LMSOURCESH``, an empty body is recorded instead

      - Example value: ``foo/1&sh /tmp/source.sh|set-alias alfoo {}|set-function funcfoo {}|complete bash foocmd {}``

  - When unloading modulefile, content found for this modulefile in ``__MODULES_LMSOURCESH`` variable is evaluated to reverse environment changes

    - When reaching a ``source-sh`` modulefile command, recorded content is evaluated through a modulefile Tcl interpreter in unload mode, to get the reversed effect

  - When displaying modulefile

    - If it is loaded

      - the content found for this modulefile in ``__MODULES_LMSOURCESH`` variable is evaluated in display mode to report each resulting modulefile command
      - script is evaluated to fetch shell alias, function and completion definitions which are not recorded in ``__MODULES_LMSOURCESH``

    - If not loaded, script is evaluated to gather environment changes and report each resulting modulefile command

- Script targeted by a ``source-sh`` command has to be run and environment prior this run and after this run have to be compared to determine the environment changes the script performs

  - The shell to use to run script has to be specified to ``source-sh``
  - This shell will be run to execute the following sequence:

    1. output current environment state
    2. source the script with its args
    3. output environment state after script source

  - Script output is kept to return it in an error message in case its execution fails
  - This comparison determines the environment changes performed by script which are then translated into corresponding modulefile commands
  - Shell is run in a mode where neither its user nor its system-wide configuration is evaluated to avoid side effect
  - Shell needs current environment to correctly evaluate script

    - it seems desirable to run shell in an empty environment to get the full list of changes it does over the environment
    - but the script may need the environment to be defined to correctly operate

      - for instance it needs the ``PATH`` to be set to execute external commands

    - so shell run inherits current environment to ensure script will be properly evaluated
    - but as a consequence if an environment change performed by script is already done prior script run, this environment change will not be *seen*

- **Limitation**: a variable already set by another module will not be seen set sourced script

  - if this module is unloaded and if it does not trigger the unload of the modulefile using ``source-sh``, variable will be unset as source-sh script has not increased variable reference counter

- **Note**: environment change done through ``source-sh`` will not preserve change order occurring in sourced script, as all changes are analyzed after sourcing script

- Most shell will not get the full environment content when spawned as a sub-process

  - For instance shell aliases are not transmitted into sub-shells
  - As a result the environment prior script source will most of the time have no shell alias defined

- **Note**: shells may have specific behaviors

  - ``dash`` cannot pass arguments to the sourced script
  - ``ksh93`` reads the full script prior sourcing it which may cause issue if a shell alias is defined in script then used in script

- **Limitation**: implementation does not currently support:

  - the ``zsh`` shell when used as ``sh`` or as ``ksh``
  - the ``mksh`` shell
  - the BSD variant of the ``sh`` shell
  - the Windows ``cmd`` shell

- **Note**: if sourced script produces shell alias or function, these alias or function may not be compatible with the current shell of the user

- Shell completion is defined for the shell used to evaluate script

  - :mfcmd:`complete` commands are generated even if current user shell is not the same as the shell used to evaluate script
  - but the completion definition will produce a no-operation on current user shell when :mfcmd:`complete` command is evaluated

- **Note**: the mechanism described here only applies for shell script as to understand the impact the script execution over the user environment, this environment need to be compared prior and after execution

- ``source-sh`` modulefile command relies of the ``sh-to-mod`` procedure of ``modulecmd.tcl``

  - ``sh-to-mod`` procedure handles the evaluation of the targeted script and comparison of the environment prior and after this comparison
  - ``sh-to-mod`` returns as a result the list of modulefile commands describing the environment changes made by the analyzed script

- :subcmd:`sh-to-mod` is a module sub-command, calling the ``sh-to-mod`` procedure and outputting resulting modulefile commands

  - with a ``#%Module`` header
  - to enable users to convert the environment changes made by script in modulefiles

- There is no need to also make ``sh-to-mod`` a modulefile command and ``source-sh`` a module sub-command

- Shell to use to source targeted script may be specified as a command name or as a path name

  - When specified as a command name (e.g., ``bash``), command location will be determined based on currently set ``PATH``
  - When specified as a path name (e.g., ``/bin/bash``), this path name determines command location

- Prior executing shell and making it sourcing script:

  - Shell is checked to ensure an executable command matches it
  - Script is checked to ensure it exists
  - Those tests are done prior executing to avoid it if one check fails and provide a consistent error message whatever the shell used

- Environment changes to output as result should be enclosed and escaped

  - Enclosed if they contains space character (like white-space, tab and newline characters)
  - Escaped if they contains curly braces, as output is formatted as Tcl code, to avoid parsing issue

- If an error occurs during targeted script evaluation

  - Error is thrown which leads to either a modulefile evaluation error or a module sub-command error
  - Error and output messages reported by evaluated script will be reported along error by modulefile command or module sub-command

- To get prior and resulting environment state

  - ``env`` command is not used to gather environment state through exec source execution

    - it would simplify environment state parsing, as same command would be used for every shell
    - but it is an external command, so requires extra processing and an additional requirement
    - moreover it does not return shell functions in general, only exported Bash functions

  - Shell builtin commands are used to query existing environment variables, aliases, functions, completions and current working directory

    - which provides best processing efficiency
    - but leads to specific output parsing for each shell

  - A separator string ``%ModulesShToMod%`` is printed between each kind of environment item (variable, alias, etc) and also prior and after script evaluation

    - to separate each output kind and then be able to split them for separate analysis

- De-duplication of path entries is applied for changes on path-like environment variables

  - If the same path entry appears several times in the newly prepended entries for a variable, the first occurrence of this entry is kept others are dropped
  - If the same path entry appears several times in the newly appended entries for a variable, the first occurrence of this entry is kept others are dropped
  - De-duplication is not applied for path entries:

    - appearing in both the new prepended entries and newly appended entries
    - appearing in newly prepended entries or newly appended entries and in entries defined prior script evaluation

- An environment variable equaling to the path separator character (``:``) prior script evaluation is considered as undefined prior script evaluation to avoid misleading analysis

- Environment variables made for Modules private use are filtered-out from the environment changes produced

  - ``LOADEDMODULES``, ``_LMFILES_`` and any variable prefixed by ``__MODULES_`` are concerned
  - Changes relative to Modules state are ignored this way
  - If script loads in turn a modulefile, environment changes reported will not report the loaded module but only the environment changes it does
  - Modules configuration variable (prefixed by ``MODULES_``) are still taken into account

- There may be several shell completion changes found for the same command on particular shell like fish

  - It produces multiple ``complete`` modulefile commands
  - But in ``__MODULES_LMSOURCESH`` variable, a single entry is recorded as a single ``uncomplete`` command should be generated when unloading modulefile
  - If completion definition is partial for a command when loading modulefile, on fish shell previous completion definition is cleared right before setting those defined in modulefile
  - When a completion definition difference is spotted after script evaluation, new completion definition is set which means previous definition is cleared (either intrinsically by shell or for fish due to the definition clear specifically added)

- Fish shell allows nested definition of shell functions

  - A separator string ``%ModulesSubShToMod%`` is printed between each function definition not to get main and nested functions mixed up
  - Such sub-separator is only applied on fish shell

- ``bash-eval`` shell is added in version 5.4

  - Instead of sourcing a bash shell script, execution output of this script is evaluated

- ``--ignore`` option is added in version 5.4

  - This option accepts a list of shell element separated by ``:`` character among:

    - ``envvar``
    - ``alias``
    - ``function``
    - ``chdir``
    - ``complete``

  - Shell elements changed by sourced script and listed on ``--ignore`` option are ignored (no modulefile command relative to these changes are produced)

- **FUTURE**: this feature may be extended to translate environment changes made by tools like Spack, Lmod or pkg-config. It may provide this way bridges between different realms.

.. vim:set tabstop=2 shiftwidth=2 expandtab autoindent: