File: condor_off.rst

package info (click to toggle)
condor 23.9.6%2Bdfsg-2.1
  • links: PTS, VCS
  • area: main
  • in suites: forky, sid, trixie
  • size: 60,012 kB
  • sloc: cpp: 528,272; perl: 87,066; python: 42,650; ansic: 29,558; sh: 11,271; javascript: 3,479; ada: 2,319; java: 619; makefile: 615; xml: 613; awk: 268; yacc: 78; fortran: 54; csh: 24
file content (173 lines) | stat: -rw-r--r-- 6,789 bytes parent folder | download
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
      

*condor_off*
=============

Shutdown HTCondor daemons
:index:`condor_off<single: condor_off; HTCondor commands>`\ :index:`condor_off command`

Synopsis
--------

**condor_off** [**-help | -version** ]

**condor_off** [**-graceful | -fast | -peaceful |
-force-graceful | -drain** ] [**-annex** *name*] [**-debug[:opts]** ]
[**-pool** *centralmanagerhostname[:portnumber]*] [
**-name** *hostname* | *hostname* | **-addr** *"<a.b.c.d:port>"*
| *"<a.b.c.d:port>"* | **-constraint** *expression* | **-all** ]
[**-daemon** *daemonname* | **-master**]
[**-exec** *name*]
[**-reason** *"reason-string"*]
[**-request-id** *id*]
[**-check** *expr*]
[**-start** *expr*]

Description
-----------

*condor_off* shuts down a set of the HTCondor daemons running on a set
of one or more machines.  By default, it does this cleanly, so that
jobs have time to shut down.

The command *condor_off* without any arguments will shut down all
daemons except *condor_master*, unless **-annex** *name* is
specified. The *condor_master* can then handle both local and remote
requests to restart the other HTCondor daemons if need be. To restart
HTCondor running on a machine, see the *condor_on* command.

When the **-drain** option is chosen, draining options can be specified
by using the optional **-reason**, **-request-id**, **-check**, and **-start**
arguments.

With the **-daemon** *master* option, *condor_off* will shut down all
daemons including the *condor_master*. Specification using the
**-daemon** option will shut down only the specified daemon.

When shutting down all daemons including the *condor_master*, the **-exec**
argument can be used to tell the master to run a configured :macro:`MASTER_SHUTDOWN_<Name>`
script before it exits.

For security reasons of authentication and authorization, this command
requires ``ADMINISTRATOR`` level of access.

Options
-------

 **-help**
    Display usage information
 **-version**
    Display version information
 **-graceful**
    The default. If jobs are running, wait for up to the configured grace period for them to finish, then exit
 **-fast**
    Quickly shutdown daemons, immediately evicting any running jobs. A minimum of the first two characters of
    this option must be specified, to distinguish it from the
    **-force-graceful** command.
 **-peaceful**
    Wait indefinitely for jobs to finish
 **-force-graceful**
    Force a graceful shutdown, even after issuing a **-peaceful**
    command. A minimum of the first two characters of this option must
    be specified, to distinguish it from the **-fast** command.
 **-drain**
    Send a *condor_drain* command with the *-exit-on-completion* option to all
    *condor_startd* daemons that are managed by this master. Then wait for all *condor_startd*
    daemons to exit before before shutting down other daemons.
 **-reason** *"reason-string"*
    Use with **-drain** to set a **-reason** *"reason-string"* value for the *condor_drain* command.
 **-request-id** *id*
    Use with **-drain** to set a **-request-id** *id* value for the *condor_drain* command.
 **-check** *expr*
    Use with **-drain** to set a **-check** *expr* value for the *condor_drain* command.
 **-start** *expr*
    Use with **-drain** to set a **-start** *expr* value for the *condor_drain* command.
 **-annex** *name*
    Turn off master daemons in the specified annex. By default this will
    result in the corresponding instances shutting down.
 **-debug[:opts]**
    Causes debugging information to be sent to ``stderr``. The debug level can be set
    by specifying an optional *opts* value. Otherwise, the configuration variable :macro:`TOOL_DEBUG`
    sets the debug level.
 **-pool** *centralmanagerhostname[:portnumber]*
    Specify a pool by giving the central manager's host name and an
    optional port number
 **-name** *hostname*
    Send the command to a machine identified by *hostname*
 *hostname*
    Send the command to a machine identified by *hostname*
 **-addr** *"<a.b.c.d:port>"*
    Send the command to a machine's master located at *"<a.b.c.d:port>"*
 *"<a.b.c.d:port>"*
    Send the command to a machine located at *"<a.b.c.d:port>"*
 **-constraint** *expression*
    Apply this command only to machines matching the given ClassAd
    *expression*
 **-all**
    Send the command to all machines in the pool
 **-master**
    Shutdown the *condor_master* after shutting down all other daemons.
 **-exec** *name*
    When used with **-master**, the *condor_master* will run the program configured as
    :macro:`MASTER_SHUTDOWN_<Name>` after shutting down all other daemons.
 **-daemon** *daemonname*
    Send the command to the named daemon. Without this option, the
    command is sent to the *condor_master* daemon.

Graceful vs. Peaceful vs Fast
-----------------------------

A "fast" shutdown will cause the requested daemon to exit.  Jobs
running under a startd that is shutdown fast will be evicted. Jobs
running on a schedd that is shutdown fast will be left running for
their job lease duration (default of 20 minutes). (That is, assuming
the corresponding startd is not also being shut down). If that schedd restarts
before the job lease expires, it will reconnect to these running jobs
and continue to run them, as long as the schedd and startd are running.

A "graceful" shutdown of a schedd is functionally the same as a "fast"
shutdown of a schedd.

A "graceful" shutdown of a startd that has jobs running under it causes
the startd to wait for the jobs to exit of their own accord, up to the 
MaxJobRetirementTime.  After the MaxJobRetirementTime, the startd will evict
any remaining running jobs and exit.

A "peaceful" shutdown of a startd or schedd will cause that daemon to
wait indefinitely for all existing jobs to exit before shutting down.
During this time, no new jobs will start.

Exit Status
-----------

*condor_off* will exit with a status value of 0 (zero) upon success,
and it will exit with the value 1 (one) upon failure.

Examples
--------

To shut down all daemons (other than *condor_master*) on the local
host:

.. code-block:: console

    $ condor_off

To shut down only the *condor_collector* on three named machines:

.. code-block:: console

    $ condor_off  cinnamon cloves vanilla -daemon collector

To shut down daemons within a pool of machines other than the local
pool, use the **-pool** option. The argument is the name of the central
manager for the pool. Note that one or more machines within the pool
must be specified as the targets for the command. This command shuts
down all daemons except the *condor_master* on the single machine named
**cae17** within the pool of machines that has **condor.cae.wisc.edu**
as its central manager:

.. code-block:: console

    $ condor_off  -pool condor.cae.wisc.edu -name cae17