File: gfpconcat.1.html

package info (click to toggle)
gfarm 2.7.20%2Bdfsg-1
  • links: PTS, VCS
  • area: main
  • in suites: bookworm
  • size: 24,880 kB
  • sloc: ansic: 188,235; sh: 43,469; cpp: 9,442; java: 6,866; makefile: 3,519; perl: 2,412; python: 1,449; ruby: 965; php: 807; sql: 173; xml: 51; csh: 2
file content (166 lines) | stat: -rw-r--r-- 6,173 bytes parent folder | download | duplicates (3)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
<html>
<head>
<meta http-equiv="Content-Type" content="text/html; charset=UTF-8">
<title>gfpconcat</title>
<meta name="generator" content="DocBook XSL Stylesheets V1.79.1">
</head>
<body bgcolor="white" text="black" link="#0000FF" vlink="#840084" alink="#0000FF"><div class="refentry">
<a name="gfpconcat.1"></a><div class="titlepage"></div>
<div class="refnamediv">
<a name="name"></a><h2>Name</h2>
<p>gfpconcat — concatenate files in parallel</p>
</div>
<div class="refsynopsisdiv">
<a name="synopsis"></a><h2>Synopsis</h2>
<div class="cmdsynopsis"><p><code class="command">gfpconcat</code>  [-fpqvd] [-h <em class="replaceable"><code>filesystem-node</code></em>] [-i <em class="replaceable"><code>input-listfile</code></em>] [-j <em class="replaceable"><code>num-of-processes</code></em>] [-m <em class="replaceable"><code>minimum-bytes</code></em>]  -o <em class="replaceable"><code>destination-file</code></em>   <em class="replaceable"><code>source-file</code></em>... </p></div>
</div>
<div class="refsect1">
<a name="description"></a><h2>DESCRIPTION</h2>
<p>
<span class="command"><strong>gfpconcat</strong></span>
copies a single file or multiple files to a single file in parallel.
</p>
<p>
<em class="parameter"><code>source-file</code></em> is a source file to
copy.  When multiple source files are specified, they will be
concatinated in that order.
</p>
<p>
<em class="parameter"><code>destination-file</code></em>
is a destination file to copy.
When a destination file exists, the file will not be overwritten.
</p>
</div>
<div class="refsect1">
<a name="source-file"></a><h2>SOURCE FILE</h2>
<p>
<em class="parameter"><code>source-file</code></em>
must be one of the following formats.
</p>
<div class="variablelist"><dl class="variablelist">
<dt><span class="term"><em class="parameter"><code>path-name</code></em></span></dt>
<dd><p>
is a relative path or an absolute path of a local file system. When
the path is a mount point on gfarm2fs, the file is read without
passing through the gfarm2fs.
</p></dd>
<dt><span class="term"><em class="parameter"><code>gfarm:...</code></em></span></dt>
<dd><p>
is a Gfarm URL.
</p></dd>
<dt><span class="term"><em class="parameter"><code>file:...</code></em></span></dt>
<dd><p>
is an URL of a local file system.
</p></dd>
</dl></div>
</div>
<div class="refsect1">
<a name="dest-file"></a><h2>DESTINATION FILE</h2>
<p>
<em class="parameter"><code>destination-file</code></em>
must be one of the following formats.
</p>
<div class="variablelist"><dl class="variablelist">
<dt><span class="term"><em class="parameter"><code>path-name</code></em></span></dt>
<dd><p>
is a relative path or an absolute path of a local file system. When
the path is a mount point on gfarm2fs, the file is written without
passing through the gfarm2fs.
</p></dd>
<dt><span class="term"><em class="parameter"><code>gfarm:...</code></em></span></dt>
<dd><p>
is a Gfarm URL.
</p></dd>
<dt><span class="term"><em class="parameter"><code>file:...</code></em></span></dt>
<dd><p>
is an URL of a local file system.
</p></dd>
</dl></div>
</div>
<div class="refsect1">
<a name="options"></a><h2>OPTIONS</h2>
<div class="variablelist"><dl class="variablelist">
<dt><span class="term"><code class="option">-f</code></span></dt>
<dd><p>
Overwrites existing file.
</p></dd>
<dt><span class="term"><code class="option">-h</code> <em class="parameter"><code>filesystem-node</code></em></span></dt>
<dd><p>
explicitly specifies the file system node to be stored.
</p></dd>
<dt><span class="term"><code class="option">-i</code> <em class="parameter"><code>input-listfile</code></em></span></dt>
<dd><p>
Specifies a file which describes input files instead
of <em class="parameter"><code>source-file</code></em> arguments.
The <em class="parameter"><code>input-listfile</code></em> consists of
an input URL on each line.
</p></dd>
<dt><span class="term"><code class="option">-j</code> <em class="parameter"><code>num-of-processes</code></em></span></dt>
<dd>
<p>
Specifies the number of processes to copy.
The source files are splited evenly, and they will be assigned to each process.
</p>
<p>
The default value is the parameter
of <em class="parameter"><code>client_parallel_copy</code></em> in
gfarm2.conf. (see man gfarm2.conf)
</p>
</dd>
<dt><span class="term"><code class="option">-m</code> <em class="parameter"><code>minimum-bytes</code></em></span></dt>
<dd><p>
Specifies the minimum size (byte) per a process to copy in parallel.
When the size assigned to a process is equal or smaller than -m,
single process is used to copy (not copied in parallel).

The default value is 0 byte (always copied in parallel).
</p></dd>
<dt><span class="term"><code class="option">-p</code></span></dt>
<dd><p>
Reports copy performance.
</p></dd>
<dt><span class="term"><code class="option">-q</code></span></dt>
<dd><p>
Suppresses non-error messages.
</p></dd>
<dt><span class="term"><code class="option">-v</code></span></dt>
<dd><p>
Displays verbose output.
</p></dd>
<dt><span class="term"><code class="option">-d</code></span></dt>
<dd><p>
Displays debug output.
</p></dd>
<dt><span class="term"><code class="option">-?</code></span></dt>
<dd><p>
Displays a list of command options.
</p></dd>
</dl></div>
</div>
<div class="refsect1">
<a name="notes"></a><h2>NOTES</h2>
<p>
When a file is created in Gfarm and the file is written in parallel or
randomly, checksum for the file will not be calculated and not added.

Therefore, when <span class="command"><strong>gfpconcat</strong></span> is used,
checksum will not be added to the file.

Once the file is read sequentially or replicas for the file are
created, checksum will be added to the file.

Or, when <em class="parameter"><code>write_verify</code></em> of
gfmd.conf (see man gfarm2.conf) is "enable", checksum is added to the
file after a while after copying
by <span class="command"><strong>gfpconcat</strong></span>.
</p>
</div>
<div class="refsect1">
<a name="see-also"></a><h2>SEE ALSO</h2>
<p>
  <span class="citerefentry"><span class="refentrytitle">gfcksum</span>(1)</span>,
  <span class="citerefentry"><span class="refentrytitle">gfarm2.conf</span>(5)</span>
</p>
</div>
</div></body>
</html>