File: s4_external_data.rst

package info (click to toggle)
bioxtasraw 2.4.1-1
  • links: PTS, VCS
  • area: main
  • in suites: sid
  • size: 258,948 kB
  • sloc: python: 78,311; makefile: 27; sh: 21
file content (294 lines) | stat: -rw-r--r-- 10,773 bytes parent folder | download | duplicates (2)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
Opening data externally
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
.. _raw_external_data:

This tutorial covers how to open data saved from RAW in other programs.
This tutorial won't focus on how to use different plotting software, but rather
on the formatting of data from RAW and how to open it in common programs like
Excel.

The written version of the tutorial follows.


Opening .dat files
*************************

RAW saves profile data, including the q, I(q), and uncertainty data in .dat files.
Note that .dat is a common extension for scattering profile data, and other programs,
such as Primus, also produce .dat files which may have slightly different formats.

File format
##############

These .dat files are standard text files with space separated values. They
consist of a header (and possibly a footer), which are marked out by "#"
starting the header and footer lines. The data is saved in three columns
with space separation, and the numbers are in scientific format, e.g. 1.23E-04.
The scattering profile data has a ``#      Q             I(Q)           Error``
header line, and three columns which are:

    #.  **Q** - The experimental q vector
    #.  **I(Q)** - The experimental intensities
    #.  **Error** - The experimental uncertainty

A large amount of data, including analysis data is saved in either the footer
(by default) or the header. Regardless of physical location in the file the
start of this information is distinguished by a "### HEADER:" line. After that,
if the leading "#" marks are removed from each line the extra data is in json
format.

The actual header always contains the number of data points and the column headings.

Opening .dat files in Excel
##############################

RAW .dat files can be opened in Excel (or Libre Office or similar). From there
it's easy to save the three column data in whatever format you want for further
plotting or analysis.

#.  Open Excel.

#.  Create a blank workbook.

#.  Choose "File->Import->Text file" (or the data ribbon "Get External Data->
    From Text" button) and select a RAW .dat file.

#.  In the first screen of the import wizard select "Delimited" and click "Next".

    |import_dat_excel1_png|

#.  In the second screen of the import wizard check "Space".

    |import_dat_excel2_png|

#.  Click "Finish".

#.  In the next window that appears select the location and click "Import".

#.  The data will be imported. Note that the column headings will be shifted
    over by one column, the first column is Q, the second I(q) and the third
    Error.

    |import_dat_excel3_png|


Opening .out files
*******************

RAW saves IFT data, including the P(r) function and fit to the data, produced
by GNOM in .out files, which is the output format from the GNOM command line
program.

File format
###############

The .out file is a standard text file with space separated values. It contains
four sections delineated by section headers like:
``####      Configuration                                 ####``\.
The numbers for the data in columns are in scientific format, e.g. 1.23E-04.
The first section is "Configuration", which gives input parameters used to create
the P(r) function. The second section is "Results", which gives regularization,
perceptual criteria, real space Rg and I(0) and total estimate results. The
third section is "Experimental Data and Fit", which consists of 5 columns containing
the input experimental data and the scattering profile generated as a Fourier
transform of the P(r) function, which should fit the data (for a good P(r) function).
The column headers are:

    #.  **S** - The q vector extrapolated to q=0
    #.  **J Exp** - The experimental intensities
    #.  **Error** - The experimental uncertainty
    #.  **J Reg** - The scattering profile that is the Fourier transform of the P(r)
        function over the experimental q range (also called the regularized
        intensity)
    #.  **I Reg** - The scattering profile that is the Fourier transform of the P(r)
        function over the extrapolated q range (also called the regularized
        intensity)

The fourth section is "Real Space Data", which consists of the P(r) function.
The columns are:

    #.  **R** - The real space distance
    #.  **P(R)** - The value of the pair distance distribution function at
        a given R value
    #.  **ERROR** - The uncertainty in the P(r) value


Opening .out files in Excel
###############################

RAW .out files can be opened in Excel (or Libre Office or similar). From there
it's easy to save the data in whatever format you want for further
plotting or analysis. The import process is basically the same as for .dat files,
above:

#.  Open Excel.

#.  Create a blank workbook.

#.  Choose "File->Import->Text file" (or the data ribbon "Get External Data->
    From Text" button) and select a GNOM .out file.

#.  In the first screen of the import wizard select "Delimited" and click "Next".

#.  In the second screen of the import wizard check "Space".

#.  Click "Finish".

#.  In the next window that appears select the location and click "Import".

#.  In the imported data, some of the I Reg values will be in the wrong column,
    due to how separators are handled. Scroll down to the experimental data section.
    Select the Data in the second column from the top to just above where the
    next three columns start.

    |import_out_excel1_png|

#.  Cut that data and paste it at the top of the fifth column.

    |import_out_excel2_png|

#.  The header labels are not in the correct columns, but the five data columns
    in the experimental section now correspond to those given above: S, J Exp,
    Error, J Reg, I Reg respectively.

#.  Scroll down further to find the three column P(r) data with the correct
    column headers.

    |import_out_excel3_png|


Opening .ift files
*******************

RAW saves IFT data produced by BIFT in .ift files.

File format
##############

These .ift files are standard text files with space separated values.
The numbers for the data in columns are in scientific format, e.g. 1.23E-04.
These files consist of several different sections marked by columns headers on
lines starting with "#". The whole file overall starts with a "# BIFT" line to
identify the type of IFT. The first section is the P(R) function and is started
with the ``#      R             P(R)           Error`` header. The columns are:


    #.  **R** - The real space distance
    #.  **P(R)** - The value of the pair distance distribution function at
        a given R value
    #.  **Error** - The uncertainty in the P(r) value

The second section is the experimental data and the scattering profile generated
as a Fourier transform of the P(r) function, which should fit the data (for a
good P(r) function). It is started with the
``#      Q             I(Q)           Error            Fit`` header. The columns are:

    #.  **Q** - The experimental q vector
    #.  **I(Q)** - The experimental intensities
    #.  **Error** - The experimental uncertainty
    #.  **Fit** - The scattering profile that is the Fourier transform of the P(r)
        function over the experimental q range (also called the regularized
        intensity)

The third section is the fit/regularized intensity extrapolated to q=0 and
starts with the
``#  Q_extrap       Fit_extrap``
header. The columns are:

    #.  **Q_extrap** - The extrapolated q vector
    #.  **Fit_extrap** - The scattering profile that is the Fourier transform
        of the P(r) function over the extrapolated q range (also called the regularized
        intensity)

The fourth section is data about the parameters used and derived values,
such as the I(0) and Rg, which are saved at the end of the file. This
information is distinguished by a ``### HEADER:`` line. After that,
if the leading "#" marks are removed from each line the extra data is in json
format.

Opening .ift files in Excel
###############################

RAW .ift files can be opened in Excel (or Libre Office or similar). From there
it's easy to save the data in whatever format you want for further
plotting or analysis. The import process is basically the same as for .dat files,
above:

#.  Open Excel.

#.  Create a blank workbook.

#.  Choose "File->Import->Text file" (or the data ribbon "Get External Data->
    From Text" button) and select an .ift file.

#.  In the first screen of the import wizard select "Delimited" and click "Next".

#.  In the second screen of the import wizard check "Space".

#.  Click "Finish".

#.  In the next window that appears select the location and click "Import".

#.  The data will be imported. Note that the column headings will be shifted
    over by one column. For example, for the P(r) data, the first column is
    the R data, the second the P(R) data, and the third the Error data.

#.  Scroll down through the spreadsheet to see all of the imported data.


Opening .csv files
*******************

RAW saves most exported data and analysis in .csv (comma separated value) format.

File format
##############

These .csv files are standard text files with comma separated values. Files
generated from RAW will have an header information such as column headings
marked with a "#" at the start of the line.

Opening .csv files in Excel
#############################

.csv files generally be opened in Excel (or Libre Office or similar) by double
clicking on the file. From there it's easy to save the data in whatever format
you want for further plotting or analysis. If double clicking doesn't work,
the import process is basically the same as for .dat files, above:

#.  Open Excel.

#.  Create a blank workbook.

#.  Choose "File->Import->Text file" (or the data ribbon "Get External Data->
    From Text" button) and select an .ift file.

#.  In the first screen of the import wizard select "Delimited" and click "Next".

#.  In the second screen of the import wizard check "Comma".

#.  Click "Finish".

#.  In the next window that appears select the location and click "Import".


.. |import_dat_excel1_png| image:: images/import_dat_excel1.png
    :target: ../_images/import_dat_excel1.png
    :width: 500 px

.. |import_dat_excel2_png| image:: images/import_dat_excel2.png
    :target: ../_images/import_dat_excel2.png
    :width: 500 px

.. |import_dat_excel3_png| image:: images/import_dat_excel3.png
    :target: ../_images/import_dat_excel3.png
    :width: 500 px

.. |import_out_excel1_png| image:: images/import_out_excel1.png
    :target: ../_images/import_out_excel1.png

.. |import_out_excel2_png| image:: images/import_out_excel2.png
    :target: ../_images/import_out_excel2.png

.. |import_out_excel3_png| image:: images/import_out_excel3.png
    :target: ../_images/import_out_excel3.png