1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 190 191 192 193 194 195 196 197 198 199 200 201 202 203 204 205 206 207 208 209 210 211 212 213 214 215 216 217 218 219 220 221 222 223 224 225 226 227 228 229 230 231 232 233 234 235 236 237 238 239 240 241 242 243 244 245 246 247 248 249 250 251 252 253 254 255 256 257 258 259 260 261 262 263 264 265 266 267 268 269 270 271 272 273 274 275 276 277 278 279 280 281 282 283 284 285 286 287 288 289 290 291 292 293 294 295 296 297 298 299 300 301 302 303 304 305 306 307 308 309 310 311 312 313 314 315 316 317 318 319 320 321 322 323 324 325 326 327 328 329 330 331 332 333 334 335 336 337 338 339 340 341 342 343 344 345 346 347 348 349 350 351 352 353 354 355 356 357 358 359 360 361 362 363 364 365 366 367 368 369 370 371 372 373 374 375 376 377 378 379 380 381 382 383 384 385 386 387 388 389 390 391 392 393 394 395 396 397 398 399 400 401 402 403 404 405 406 407 408 409 410 411 412 413 414 415 416 417 418 419 420 421 422 423 424 425 426 427 428 429 430 431 432 433 434 435 436 437 438 439 440 441 442 443 444 445 446 447 448 449 450 451 452 453 454 455 456 457 458 459 460 461 462 463 464 465 466 467 468 469 470 471 472 473 474 475 476 477 478 479 480 481 482 483 484 485 486 487 488 489 490 491 492 493 494 495 496 497 498 499 500 501 502 503 504 505 506 507 508 509 510 511 512 513 514 515 516 517 518 519 520 521 522 523 524 525 526 527 528 529 530 531 532 533 534 535 536 537 538 539 540 541
|
Basic SEC-SAXS processing
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
.. _s1p7:
In a typical SEC-SAXS run, images are continuously collected while the eluate (outflow)
of a size exclusion column flows through the SAXS sample cell. As proteins scatter more
strongly than buffer, a plot of total scattered intensity vs. time, the so-called SAXS
chromatograph (or scattergram), will show a set of peaks similar to what is seen by
UV absorption measurement of the SEC system. RAW includes the capability to do routine
processing of SEC-SAXS data. This includes creating the SAXS chromatograph from the data,
plotting |Rg|, MW, and I(0) across the peaks, and extracting specific frames for further analysis.
*Note:* In RAW, this is called Series analysis, as the same tools can be used for other
sequentially sampled data sets.
A video version of this tutorial is available:
.. raw:: html
<style>.embed-container { position: relative; padding-bottom: 56.25%; height: 0; overflow: hidden; max-width: 100%; } .embed-container iframe, .embed-container object, .embed-container embed { position: absolute; top: 0; left: 0; width: 100%; height: 100%; }</style><div class='embed-container'><iframe src='https://www.youtube.com/embed/3aJl2kNBPuA' frameborder='0' allowfullscreen></iframe></div>
The written version of the tutorial follows.
#. Clear any data loaded into RAW.
* *Tip:* In the Files tab, click the “Clear All” button to clear all data in RAW.
#. Go to the Files control tab and navigate to the **sec_sample_1** data directory.
#. Click on the first data file, **profile_001_0000.dat**\ . Scroll down to the
bottom of the file list, and shift click on the last file,
**profile_001_0964.dat**\ . This should highlight all of the files in
between, as well as the two you clicked on. Click on the "Plot Series"
button to load the series into RAW.
|series_load1_png|
#. RAW should automatically show you the Series plot panel. If not, click
on the Series tab in the plot. Click on the Series tab in the control panel.
|series_plot_png|
* *Try:* Each point on this curve is the integrated intensity of a scattering profile.
You can figure out which one by right clicking on the filename in the Series list and
selecting ‘Show data’. This will show you the frame number and integrated intensity
displayed on the plot, and the filename corresponding to the displayed frame number.
#. Drag the plot so that you can clearly the see the first frame. You’ll notice it has a
significantly lower intensity than the rest of the frames. This happens occasionally
at the MacCHESS G1 beamline (where the data was taken). It can make it harder to tell
what the data is doing.
* *Tip:* Select the crossed arrows in the plot control bar, and then click and drag on
the plot to move the curve around on the screen.
#. Reload the data as in steps 2-3, but select the second data file,
**profile_001_0001.dat** as the initial file in the series.
* *Tip:* As before, select **profile_001_0001.dat**, scroll down
to the last file, and shift click to select the files to load
in. Then click the "Plot Series" button.
#. You will see the same curve plotted as before, but without the very first
scattering profile. Remove the other loaded data set. Now you should have
a curve where the baseline is very close to the bottom of the plot.
|series_plot2_png|
#. In some cases it is more useful to look at the mean intensity, the intensity at a
specific *q* value, or the intensity in a range of *q* values than the total
intensity. Right click on the plot and select mean intensity for the left
axis y data. Then try the intensity at *q=0.02* and in the *q* range 0.01-0.03.
* *Note:* You need to have the drag button in the plot control bar unselected to
get a right click menu.
* *Tip:* CHROMIXS, in the ATSAS software displays the average intensity
over the *q* range 0.01-0.08 :math:`Å^{-1}`. To achieve a similar
display in RAW, set your q range for the intensity to that. Note that
RAW will display the sum of the intensity over the range while CHROMIXS
displays mean intensity for the range, so the results won't be exactly the same.
#. Return to plotting the integrated intensity. Zoom in near the base of the peak. Notice
that there are two smaller peaks on the left, likely corresponding to higher order
oligomers that we don’t have the signal to properly resolve. Also notice that the
baseline after the peak is not the same as the baseline before the peak. This can happen
for several reasons, such as damaged protein sticking to the sample cell windows.
* *Tip:* Click on the magnifying glass at the bottom of the plot, then click
and drag on the plot to select a region to zoom in on.
|series_plot3_png|
#. Zoom back out to the full plot.
* *Tip:* Click the Home (house) button at the bottom of the plot.
#. In order to determine if we really have a single species across the peak, we will
calculate the |Rg| and MW as a function of frame number. Click on the "LC
Analysis" button at the bottom of the Series control panel or right click on the
filename in the Series control panel and select "LC Series analysis" to open the
LC Series analysis panel.
* *Note:* At the top of the control panel in this window, in the 'Series info'
section you'll see several settings. If you had RNA instead of protein,
you would use the Vc Mol. type menu to select that option. This affects
the calculation of the molecular weight. You could also change the Vp
density away from the default value, or change the averaging window
(discussed below).
|lc_analysis_main_png|
#. The LC Series analysis panel provides basic and advanced analysis tools for
liquid chromatography experiments. Here we will show how to select buffer
and sample regions, and send final processed data to the Profiles plot. The
advanced baseline correction features are discussed later.
In order to calculate |Rg| and other parameters as a function of elution time
(Frame #), we need to define a buffer region. RAW can do this automatically.
In the 'Buffer' section click the 'Auto' button.
|lc_analysis_buffer_auto_png|
* *Checkpoint:* You should see a buffer range show up in the buffer list,
with defined start and end values. The region will be shown in green on
the Unsubtracted plot.
|lc_analysis_buffer_range_png|
#. You can make fine manual adjustments to the buffer range if necessary. Zoom
in on the baseline around the buffer region. Use the up/down arrows for the
Start and End points to adjust the buffer region a little bit. You will see
the region on the plot update as you make the changes.
|lc_analysis_buffer_adjust_png|
* *Warning:* The automatic buffer determination can be wrong! Always be sure
to manually inspect the region it picked. In particular, large flat leading edge
shoulders next to the main peak can look like a baseline region to the
algorithm, and will often mistakenly be picked.
* *Tip:* If the SAXS data isn't clear (noisy, low signal, etc.), it can be
useful to inspect the UV trace associated with the SEC elution to see
where there are minor elution components that you should exclude from
your buffer selection.
#. Zoom back out on the plot. Reset the buffer range to 504 to 562 by typing
those values in the Start/End range boxes and hitting enter.
#. To set the buffer region, create a set of subtracted profiles, and calculate
structural parameters as a function of elution time, click the 'Set buffer'
button. This may take a while to calculate.
|lc_analysis_buffer_set_png|
* *Note:* All of the files in the given buffer range will be averaged and used as a buffer.
A sliding average window (size defined by the 'Averaging window size' in the
'Series Info' section) is then moved across the SEC curve. So for a
window of size five, the profiles corresponding to frames 0-4, 1-5, 2-6,
etc will be averaged. From each of these averaged set of curves, the average
buffer will be subtracted, and RAW will attempt to calculate the |Rg|,
MW, and I(0). These values are then plotted as a function of frame number.
* *Warning:* It is important that the buffer range actually be buffer! In this case,
we need to make sure to not include the small peaks before the main peak.
#. Once the calculation is finished, the window should automatically display the
Subtracted plot. If it doesn't, click on the 'Subtracted' tab in the plot.
On this plot there is a new Intensity vs. Frame # curve, representing the
subtracted data. There is also a set of markers, showing one of the calculated
parameters. By default the |Rg| displayed. The calculated parameters are
plotted on the right Y axis. You can show |Rg|, I(0), and MW calculated by
the volume of correlation (Vc) and adjusted Porod volume (Vp) methods. Click
on the 'Calculated value' menu to switch between the different displays.
* *Try:* Show the |Rg|, MW (Vc), and MW (Vp). Notice that the MW estimate
varies between the two different methods.
* *Note:* You’ll notice a region of roughly constant |Rg| across the peak. To
either side there are regions with higher or lower |Rg| values. Some of these
variations, particularly on the right side, are from scattering profiles
near the edge of the peak with lower concentrations of sample, leading
to more noise in determining the |Rg| values. There may also be some
effects from the small peaks on the leading (left) side of the peak,
and from the baseline mismatch between left and right sides of the peak.
|lc_analysis_subtracted_png|
#. A monodisperse peak should display a region of flat |Rg| and MW near the center.
Note that some spread on either edge can come from small shoulders of other
components, bad buffer selection, or just the low signal to noise in the tails
of the peak. Zoom in on the |Rg| and MW values across the peak to verify that
these show a significant flat region.
RAW can automatically determine a good sample region (good being defined
as monodisperse and excluding low signal to noise data). To do this, click
the 'Auto' button in the Sample region.
|lc_analysis_sample_auto_png|
* *Checkpoint:* You should see a sample range show up in the sample list,
with defined start and end values. The region will be shown in green on
the Subtracted plot.
|lc_analysis_sample_region_png|
#. In the plot, zoom in on the peak region and verify that the |Rg| and MW seem
flat in the selected sample range.
|lc_analysis_sample_region_plot_png|
* *Tip:* You can manually adjust the sample region range in the same way
as the buffer range, using the controls in the Start/End boxes.
#. Once you are satisfied with the region picked (should be 699-713), click the
'To Profiles Plot' button. This averages the selected region and sends the resulting
average to RAW's Profiles Plot.
* *Note:* RAW first averages the selected sample and buffer regions in the
unsubtracted data, then subtracts. This avoids the possibility of correlated
noise that would arise from averaging the subtracted files.
|lc_analysis_sample_to_main_plot_png|
#. If you adjust the sample or buffer region in a way that could be problematic,
RAW will warn you. Try this.
* Adjust the Buffer start to include more of the elution range, such as
starting at 450. You will want to click on the 'Unsubtracted' plot to
see the buffer range. Then click 'Set Buffer'. You will see a warning window
telling you what might be wrong with the selected region. Click 'Cancel'.
|lc_analysis_buffer_range_warning_png|
* Adjust the Sample start to include some of the non-flat Rg region, such as
starting at 680. Then click 'To Profiles Plot'. You will see a warning window
telling you what might be wrong with the selected region. Click 'Cancel'.
|lc_analysis_sample_range_warning_png|
* *Note:* For buffer regions, RAW checks frame-wise similarity across the
whole *q* range and at low and high *q*, correlations in intensity, and
whether there are multiple singular values in the selected region.
For sample regions, RAW checks frame-wise similarity across the
whole *q* range and at low and high *q*, correlations in calculated Rg
and MW values, whether there are multiple singular values in the selected
region, and if some of the selected frames decrease the signal to noise
of the average.
#. Click 'OK' to close the window and save your analysis results. In the Info panel
above the Series control panel you should see information about the series,
including the selected buffer and sample ranges. If you reopen
the LC analysis window you will see the buffer and sample regions you selected
are remembered.
#. Click on the Profiles plot tab and the Profiles tab. You should see one scattering
profile, the buffer subtracted data set you sent to the Profiles plot. Carry out
Guinier and MW analysis.
* *Note:* The I(0) reference and absolute calibration will not be accurate for
SEC-SAXS data, as the concentration is not accurately known.
* *Question:* How does the |Rg| and MW you get from the averaged curve compare
to what RAW found automatically for the peak?
* *Tip:* Make sure your plot axes are Log-Lin or Log-Log. Make sure that both
plots are shown by clicking the 1/2 button at the bottom of the plot window.
#. This particular dataset shows a small difference between initial and final buffer
scattering profiles. A better scattering profile might be obtained by using buffer
from both sides of the peak. To do so, start by reopening the LC Series Analysis panel.
#. Switch to showing the unsubtracted intensity by clicking on the 'Unsubtracted' plot tab.
#. Add a second buffer region by clicking the 'Add region' button.
#. For the second region, click the 'Pick' button.
|lc_analysis_buffer_pick_png|
#. Move your mouse across the plot. You will see a vertical green line moving
with the mouse cursor. This represents the start of the buffer region. Click
once to fix the start point where you click. Move the mouse further to the
right and click again to fix the end point of the buffer region.
|lc_analysis_two_buffers_png|
#. Once you are happy with the second buffer region, click 'Set buffer'. A range like
~840-896 is reasonable.
#. A warning window will pop up. In this case, we have purposefully chosen two
buffer regions because they are different, so ignore the warning and click
'Continue'.
#. Remove the old sample region by clicking in the empty space to the right of
the 'Pick' button to highlight it, and then clicking the 'Remove region' button.
|lc_analysis_remove_sample_png|
#. Click the 'Auto' button to automatically find a new sample region. Click the
'To Profiles Plot' button to send that new region to the Profiles plot.
* *Try:* You can see what the data subtracted by just the second buffer
region looks like by removing the first buffer region, setting the buffer
again, finding a new good sample region, and sending new region to the
Profiles plot.
#. Cancel out of the LC Series analysis window. This will not save the changes
you made to the buffer and sample regions.
#. Carry out the |Rg| and MW analysis on the new curve. How does the scattering
profile compare to the one that you generated using only buffer from before
the peak?
* *Tip:* You should see subtle but noticeable differences in the Guinier
fit.
* *Note:* An alternative approach to using several buffer regions is to use
a single buffer region and apply a baseline correction. Both approaches
have advantages and disadvantages. If you want to do EFA deconvolution,
it is best to not use a baseline correction, however in other cases
it will be more accurate as it doesn't assume a single average buffer
across the peak.
#. Return to the Series control and plot panels.
#. If you want to look at either individual profiles or the average of a
range of profiles you can send profiles to the Profiles plot. To select which
series curve to send profiles from, star the series curve of interest.
|series_star_png|
#. In the ‘Data to Profiles plot’ section enter the frame range of interest.
For this dataset, try the buffer range you selected: 504 to 562. Then click
the ‘Average’ button. That will send the average buffer to the Profiles plot.
* *Try:* Send the average of the sample range you selected to the main
plot (699 to 713), carry out the subtraction, and verify it’s the same as
the curve produced by the ‘To Profiles Plot’ button in the LC Series Analysis
panel.
* *Question:* When you send the sample average to the Profiles plot you will
get a warning that the profiles are different. Why?
|series_average_main_plot_png|
#. You can also send subtracted (or baseline corrected data) to the Profiles plot.
For the selected sample range, select the ‘Subtracted’ frames and send each
individual profile to the plot using the ‘Plot’ button.
* *Try:* Average these profiles and verify they match the subtracted
profiles for this data set generated previously.
|series_profile_main_plot_png|
#. Click on the colored line next
to the star in the Series control panel. In the line properties control panel
this brings up, change the Calc Marker color to something different. Add a
line to the Calc Markers by selecting line style ‘-’ (solid), and adjust
the line color to your liking.
* *Tip:* You can do the same thing to adjust the colors of the scattering
profiles and IFTs in the Profiles and IFT control tabs.
|series_color_png|
|series_line_props_png|
#. For certain beamlines (the BioCAT beamline at the APS and the MacCHESS
BioSAXS beamline at CHESS), RAW can automatically load in series data
from the series panel. This is typically used for online analysis while
data is being collected, but can be used to load in series you have
already collected as well.
#. We will load in the Bovine Serum Albumin (BSA) SEC-SAXS data contained
in the **sec_sample_2** data folder using this automatic method.
In the Series control panel, click the “Select” button. Navigate to
the **Tutorial_Data/series_data/sec_sample_2** folder and select any of
the **.dat** files in the folder.
* *Troubleshooting:* If you get an error message, it means you don't have
a configuration file loaded. Load the SAXS.cfg file referenced :ref:`earlier <s1p1>`.
* The configuration file must be set to either BioCAT or MacCHESS beamlines
for this method to work. Otherwise, RAW doesn't know how to create
all the filenames in a series from a single filename.
|series_panel_png|
#. The SEC-SAXS run will automatically load. Note that because SAXS data can be
reported with an arbitrary intensity scale, the total intensity of this
series is much larger than the previous series.
#. Right click on the **profile_001** series and select "Adjust scale, offset, q range".
This will open a window that allows you to adjust the overall scale and offset
for your series data, as well as the q range used for each type of profile
in the series data. Set the scale to 1800 and click "OK". You should see
that this scales the profiles_001 series to match the BSA series.
* *Tip:* This applies the scale, offset, and q range settings to every profile
in the series. So if you were now to send a profile to the Profiles plot,
it would have an overall scale factor of 1800 applied to it.
|series_scale_png|
#. Hide the first series (**profile_001**).
#. Select a good buffer region, and calculate the |Rg| and MW across the peak for the BSA.
* *Tip:* If you hover your mouse cursor over the info icon, you will see the buffer
range and window size used to calculate the parameters.
* *Question:* Is the BSA peak one species?
#. Find the useful region of the peak (constant |Rg|/MW), and send the buffer and sample
data to the Profiles plot. Carry out the standard |Rg| and MW analysis on the subtracted
scattering profile. For BSA, we expect |Rg| ~28 Å and MW ~66 kDa.
#. In the Series control tab, right click on the name of BSA curve in the list. Select export
data and save it in an appropriate location. This will save a CSV file with the frame
number, integrated intensity, radius of gyration, molecular weight, filename for each
frame number, and a few other items. This allows you to plot that data for publications,
align it with the UV trace, or whatever else you want to do with it.
* *Try:* Open the **.csv** file you just saved in Excel or Libre/Open Office Calc.
#. Select both items in the Series control panel list, and save them in the **series_data**
folder. This saves the series data in a form that can be quickly loaded by RAW.
* *Try:* Clear the Series data and then open one of your saved files from the Files tab
using either the “Plot” or “Plot Series” button.
#. Select just the **profile_001** item, right click and select "Save report". Check
the profiles associated with the SEC-SAXS series, so that you're saving a
report that includes both the series and the averaged subtracted profile.
Save the report as a pdf.
|series_report_png|
.. |series_load1_png| image:: images/series_load1.png
:width: 500 px
:target: ../_images/series_load1.png
.. |series_panel_png| image:: images/series_panel.png
:width: 400 px
:target: ../_images/series_panel.png
.. |series_plot_png| image:: images/series_plot.png
:target: ../_images/series_plot.png
.. |series_plot2_png| image:: images/series_plot2.png
:target: ../_images/series_plot2.png
.. |series_plot3_png| image:: images/series_plot3.png
:target: ../_images/series_plot2.png
.. |lc_analysis_main_png| image:: images/lc_analysis_main.png
:target: ../_images/series_analysis_main.png
.. |lc_analysis_buffer_auto_png| image:: images/lc_analysis_buffer_auto.png
:width: 300 px
:target: ../_images/series_panel.png
.. |lc_analysis_buffer_range_png| image:: images/lc_analysis_buffer_range.png
:target: ../_images/lc_analysis_buffer_range.png
.. |lc_analysis_buffer_adjust_png| image:: images/lc_analysis_buffer_adjust.png
:width: 300 px
:target: ../_images/lc_analysis_buffer_adjust.png
.. |lc_analysis_buffer_set_png| image:: images/lc_analysis_buffer_set.png
:width: 300 px
:target: ../_images/lc_analysis_buffer_set.png
.. |lc_analysis_subtracted_png| image:: images/lc_analysis_subtracted.png
:target: ../_images/lc_analysis_subtracted.png
.. |lc_analysis_sample_auto_png| image:: images/lc_analysis_sample_auto.png
:width: 300 px
:target: ../_images/lc_analysis_sample_auto.png
.. |lc_analysis_sample_region_png| image:: images/lc_analysis_sample_region.png
:target: ../_images/lc_analysis_sample_region.png
.. |lc_analysis_sample_region_plot_png| image:: images/lc_analysis_sample_region_plot.png
:target: ../_images/lc_analysis_sample_region_plot.png
.. |lc_analysis_sample_to_main_plot_png| image:: images/lc_analysis_sample_to_main_plot.png
:width: 300 px
:target: ../_images/lc_analysis_sample_to_main_plot.png
.. |lc_analysis_buffer_range_warning_png| image:: images/lc_analysis_buffer_range_warning.png
:width: 500 px
:target: ../_images/lc_analysis_buffer_range_warning.png
.. |lc_analysis_sample_range_warning_png| image:: images/lc_analysis_sample_range_warning.png
:width: 500 px
:target: ../_images/lc_analysis_sample_range_warning.png
.. |lc_analysis_buffer_pick_png| image:: images/lc_analysis_buffer_pick.png
:width: 300 px
:target: ../_images/lc_analysis_buffer_pick.png
.. |lc_analysis_two_buffers_png| image:: images/lc_analysis_two_buffers.png
:target: ../_images/lc_analysis_two_buffers.png
.. |lc_analysis_remove_sample_png| image:: images/lc_analysis_remove_sample.png
:width: 300 px
:target: ../_images/lc_analysis_remove_sample.png
.. |series_star_png| image:: images/series_star.png
:width: 400 px
:target: ../_images/series_star.png
.. |series_average_main_plot_png| image:: images/series_average_main_plot.png
:width: 400 px
:target: ../_images/series_average_main_plot.png
.. |series_profile_main_plot_png| image:: images/series_profile_main_plot.png
:width: 400 px
:target: ../_images/series_profile_main_plot.png
.. |series_color_png| image:: images/series_color.png
:width: 400 px
:target: ../_images/series_color.png
.. |series_line_props_png| image:: images/series_line_props.png
:width: 400 px
:target: ../_images/series_line_props.png
.. |series_scale_png| image:: images/series_scale.png
:width: 300 px
:target: ../_images/series_scale.png
.. |series_report_png| image:: images/series_report.png
:width: 450 px
:target: ../_images/series_report.png
.. |Rg| replace:: R\ :sub:`g`
|