File: s2_regals.rst

package info (click to toggle)
bioxtasraw 2.4.1-1
  • links: PTS, VCS
  • area: main
  • in suites: sid
  • size: 258,948 kB
  • sloc: python: 78,311; makefile: 27; sh: 21
file content (771 lines) | stat: -rw-r--r-- 39,185 bytes parent folder | download | duplicates (2)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
516
517
518
519
520
521
522
523
524
525
526
527
528
529
530
531
532
533
534
535
536
537
538
539
540
541
542
543
544
545
546
547
548
549
550
551
552
553
554
555
556
557
558
559
560
561
562
563
564
565
566
567
568
569
570
571
572
573
574
575
576
577
578
579
580
581
582
583
584
585
586
587
588
589
590
591
592
593
594
595
596
597
598
599
600
601
602
603
604
605
606
607
608
609
610
611
612
613
614
615
616
617
618
619
620
621
622
623
624
625
626
627
628
629
630
631
632
633
634
635
636
637
638
639
640
641
642
643
644
645
646
647
648
649
650
651
652
653
654
655
656
657
658
659
660
661
662
663
664
665
666
667
668
669
670
671
672
673
674
675
676
677
678
679
680
681
682
683
684
685
686
687
688
689
690
691
692
693
694
695
696
697
698
699
700
701
702
703
704
705
706
707
708
709
710
711
712
713
714
715
716
717
718
719
720
721
722
723
724
725
726
727
728
729
730
731
732
733
734
735
736
737
738
739
740
741
742
743
744
745
746
747
748
749
750
751
752
753
754
755
756
757
758
759
760
761
762
763
764
765
766
767
768
769
770
771
Advanced Series processing – Regularized Alternating Least Squares (REGALS)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
.. _raw_regals:

REGALS is an algorithm for deconvolution of mixtures in small angle scattering data.
It can be applied to deconvolving overlapping peaks in SEC-SAXS, changing baseline
and elution peaks in ion exchange chromatography SAXS, mixtures in time resolved
SAXS data and equilibrium titration series data, and likely other cases. In
practical application it can be thought of as an extension of the :ref:`evolving
factor analysis (EFA) <raw_efa>` technique to more complex conditions where
components are not necessarily entering and exiting the dataset in a strict
first-in-first-out approach like in SEC-SAXS. EFA is recommended for
standard SEC-SAXS data, but for more complex data, as listed above, you should
use REGALS. REGALS can also handle deconvolution of SEC-SAXS data with a
sloping baseline.

To learn more about REGALS we recommend these resources:

    *   *REGALS: a general method to deconvolve X-ray scattering data from evolving
        mixtures* S. P. Meisburger, D. Xu, and N. Ando. IUCrJ (2021). 8(2), 225-237.
        DOI: `10.1107/S2052252521000555 <https://doi.org/10.1107/S2052252521000555>`_

    *   The `source code for the REGALS algorithm on github. <https://github.com/ando-lab/regals>`_

    *   This `talk by Dr. Steve Meisburger <https://youtu.be/AO2kru097Wc>`_

If you use REGALS in RAW, in addition to citing the RAW paper, please cite the
REGALS paper: S. P. Meisburger, D. Xu, and N. Ando. IUCrJ (2021). 8(2), 225-237.
DOI: `10.1107/S2052252521000555 <https://doi.org/10.1107/S2052252521000555>`_

The written version of the tutorial follows.

Deconvolving ion exchange chromatograph coupled SAXS (IEC-SAXS) data
************************************************************************

Ion exchange chromatography is similar to size exclusion chromatography, except
that it uses a charged column media and a salt gradient to separate samples by
charge rather than size. IEC-SAXS (sometimes called anion exchange or AEX-SAXS)
can be useful in cases where there are multiple species in solution that are
not separable by size. However, the use of a salt gradient causes a changing
buffer background for the sample, making accurate buffer subtraction challenging.
REGALS can be used to deconvolve the scattering of the eluted macromolecules
from the changing scattering of the buffer.


#.  Clear all of the data in RAW. Load the **nrde_iec.hdf5** file in the
    **series_data** folder.

    *   *Note:* The data were provided by the Ando group at Cornell University,
        and were originally published in the paper: Parker, M. J. et al. (2018).
        *An endogenous dAMP ligand in Bacillus subtilis class Ib RNR promotes
        assembly of a noncanonical dimer for regulation by dATP.* Proc Natl
        Acad Sci USA 115, E4594–E4603.
        DOI: `10.1073/pnas.1800356115 <https://doi.org/10.1073/pnas.1800356115>`_.
        It is also available from the `REGALS github repository <https://github.com/ando-lab/regals>`_.

    |regals_series_plot_png|

#.  We will use REGALS to extract out the scattering components in both peaks
    of the data. Right click on the **nrde_iec.hdf5** item in the Series list.
    Select the "REGALS" option.

#.  The REGALS window will be displayed. On the left are controls, on the right are plots of
    the value of the singular values and the first autocorrelation of the left and right
    singular vectors.

    *   *Note:* Large singular values indicate significant components. What
        matters is the relative magnitude, that is, whether the value is large
        relative to the mostly flat/unchanging value of high index singular values.

    *   *Note:* A large autocorrelation indicates that the singular vector is
        varying smoothly, while a low autocorrelation indicates the vector is
        very noisy. Vectors corresponding to significant components will tend
        to have autocorrelations near 1 (roughly, >0.6-0.7) and vectors
        corresponding to insignificant components will tend to have
        autocorrelations near 0.

    |regals_panel_png|

#.  Nominally REGALS should work with either subtracted or unsubtracted data.
    Here, a buffer measured before the start of the dataset has been
    subtracted off of all the profiles in the dataset. This helps reduce the
    magnitude of the buffer components in the data, which can make it easier
    to refine the macromolecular components. We will use the full range
    of the dataset, but REGALS can also be done on restricted ranges.

    *   *Note:* Using a restricted range for REGALS might be useful to exclude
        one or more components of a complicated elution from the deconvolution.
        The more components you have the harder it is to do REGALS. There is a
        trade off in the amount of data used (more is better) and the number of
        components in the deconvolution (fewer is better) that can require
        some experimentation to find the right balance for a given dataset.

#.  RAW attempts to automatically determine how many significant singular values
    (SVs) there are in the selected range. This corresponds to the number of
    significant scattering components in solution that REGALS will attempt to
    deconvolve. At the bottom of the control panel, you should see that RAW
    thinks there are four significant SVs (scattering components) in our data.
    For this data set, that is accurate. We evaluate the number of significant
    components by how many singular values are above the baseline, and how many
    components have both left and right singular vectors with autocorrelations
    near one. For this data there are four singular values above baseline, and
    four singular vectors with autocorrelations near 1 (see step 3).

    |regals_components_png|

    *   *Note:* Typically you expect the number of significant singular values and
        the number of singular vectors with autocorrelations near 1 to be equal.
        If they aren't, it likely indicates a weak or otherwise poorly resolved
        component in the dataset. Try the deconvolution first with the lower then
        the higher number of components.

    *   *Note:* RAW can find the wrong number of components automatically. You will
        always want to double check this automatic determination against the SVD results in
        the plots. If you change the data range used (or data type), the number
        of components will not automatically update so you should check and update
        it if necessary.

#.  For REGALS there are two other settings you should check. First, set
    the experiment type. In this case, the default of 'IEC/SEC-SAXS' is
    correct. This sets default values later on in the deconvolution. Second
    specify whether you want to use the evolving factor plots to find
    initial guesses for the range of components in solution. This is useful in
    highly sampled datasets like IEC-SAXS, but may not be very accurate for
    more sparsely sampled data like a titration series, and so can be skipped.
    For this dataset leave it checked.

    |regals_exp_type_png|

#.  Click the "Next" button in the lower right-hand corner of the window to
    advance to the second stage of the REGALS analysis

    *   *Note:* If, as in this case, you are going to use the EFA plots
        to find the component ranges, it will take some time to compute
        the necessary values for this next step, so be patient.

    |regals_bkg_efa_png|

#.  A new window will open on top of the current one when you click "Next". The
    bottom window is the main REGALS window, with the forward and backward
    evolving factor plots. On top of that is the REGALS Background Components
    window. We will first use the Background Components window to estimate
    how many of our components are background components.

    *   *Note:* For the purposes of REGALS, a background component is a component
        that spans the full range of the dataset.

    *   *Note:* If you select an experiment mode other than 'IEC/SEC-SAXS'
        the Background Components window will not open automatically, but
        can still be opened by clicking the "Find background components" button
        if desired.

#.  In the Background Components window, click "Add Region" to add a region to
    the plot. The SVD of the data in that region is shown in the plots on the right.
    Set the range of that region to 0 to 100. The SVD plots show that there is
    only one strong component in the dataset in this range.

    |regals_bkg_1_png|

#.  Add a region corresponding to the last 100 frames of the dataset. This will
    appear as a second colored range on the series plot, with corresponding
    second colored sets of lines in the SVD plots on the right. Notice that
    there is just one component in the last 100 frames of the dataset.

    *   *Note:* The left autocorrelation is shown with the solid line, the right
        autocorrelation with the dashed line.

    |regals_bkg_2_png|

#.  Add several more similar regions through the initial upward sloping buffer
    region. Notice that there appears to be just one strong value throughout
    most of the buffer region, both before and after the eluted peaks. So as a
    starting point we will set the number of significant background components
    to 1, by setting the "# Significant SVs" input to 1. Once that's done,
    click the "Done" button.

    *   *Tip:* You can remove regions if they are not longer useful. To do so,
        select the region by clicking to the right of the pick button and then
        clicking the "Remove region" button.

    |regals_bkg_3_png|

#.  The Background Components window will close and the number of background
    components shown in the main REGALS window will update. The main REGALS
    window will now be fully visible.

    *   *Note:* You can reopen the Background Components window using the "Find
        background components" button if desired.

    |regals_efa_1_png|

#.  In the User Input panel, tweak the "Forward" value start frames so that the
    frame number, as indicated by the open circle on the plot, aligns with
    where the singular value first starts to rise above the baseline. This
    should be around 0, 350, 750, and 1195.

    *   *Note:* For the Forward EFA plot, SVD is run on just the first two
        frames, then the first three, and so on, until all frames in the range
        are included. As more frames are added, the singular values change, as
        shown on the plot. When a singular value starts rising above the
        baseline, it indicates that there is a new scattering component in the
        scattering profile measured at that point. So, for the first ~350
        frames, there is only one scattering components (i.e. just buffer
        scattering). At frame ~350, we see the second singular value (the
        singular value with index 1, labeled SV 1 on the plot) start to
        increase, showing that we have gained a scattering component.

    *   *Note:* One component starts above baseline, indicating there is already
        a significant scattering component at the start of our dataset. In this
        case that is the sloping buffer gradient.

#.  In the User Input panel, tweak the "Backward" value start frames so that the
    frame number, as indicated by the open circle on the plot, aligns with where
    the singular value drops back to near the baseline. This should be around
    700, 1325, 1600, and 1736.

    *   *Note:* For the Backward EFA plot, SVD is run on just the last two
        frames, then the last three, and so on, until all frames in the range
        are included. As more frames are added, the singular values change, as
        shown on the plot. When a singular value drops back to baseline, it
        indicates that a scattering component is leaving the dataset at that
        point.

    *   *Note:* One component ends above baseline, indicating there is still
        a significant scattering component at the end of our dataset. In this
        case that is the sloping buffer gradient.

    |regals_efa_2_png|

#.  Click the "Next" button in the bottom right corner to move to the last
    stage of the REGALS analysis.

    |regals_regals_1_png|

#.  This window shows controls on the left and results on the right. In the
    controls area at the top are general controls. You can adjust the number
    of components, calibrate the X axis, and set the convergence criteria for
    the REGALS algorithm. A plot of the ranges is also shown. On the bottom
    are the controls for each individual component. We won't go through all
    the possible permutations for each setting, so if you want to know more
    check out the links :ref:`at the top <raw_regals>` of this tutorial.

    *   *Note:* The ranges are automatically assigned based on the start and
        end points for components found in the EFA plots. Background components
        ranges are assigned on the principle of first-in last-out, whereas all the
        other component ranges are assigned by first-in first-out.

    *   *Note:* Components are only shown three across. Scroll down in the
        component area to see more component settings.

#.  Now we need to start refining our component ranges, and tweaking other settings
    in the components. Component 0 looks good for now, so we will start with component 1.
    On the concentration plot on the right, notice that despite the component 1
    range being defined from only 350-700, the component shows features at a
    wide range of frame numbers (index). Looking at the scattergram with the
    ranges plotted, there's no obvious elution component in the range where
    component 1 is defined. This means that component 1 is likely a background
    component as well. Since the SVD for the background components and the EFA
    plots were not clear on where the component should end, we will fit it
    from the start point we found, 350, until the end of the dataset. To do
    this, change the endpoint of the concentration range to 1736. Also uncheck
    the 'Zero at Xmax' box for this component, as it may contribute to the
    buffer scattering at the end of the dataset.

    *   *Tip:* If you return to the Background Components window, if you add
        a single range from 0 to 700 you'll see two components, indicating
        that there are likely multiple background components. However, if you
        look at the end region, assuming you avoid the tail of the peak (~1600-1736)
        there's only one component. So it's not clear where the first component might
        end and the second one start, and how much coexistence there is between
        these two background components. In this case, fitting both to end at the
        end of the dataset works well.

    |regals_regals_2_png|

#.  Because it can take a while to run, REGALS does not automatically update
    the results. To see how changing the range changed the deconvolution,
    click the "Run REGALS" button.

    *   *Note:* If lambda is automatically updated for a component, this value
        in the GUI will not be updated until you run REGALS.

    *   *Note:* When you have changes to your deconvolution settings and REGALS
        hasn't been run with those settings the "Run REGALS" button will have
        a yellow background.

    |regals_regals_3_png|

#.  There is a definite improvement in the REGALS results after rerunning
    the data. Next we will refine the protein components. We will first refine the
    ranges. On the concentration plot, notice that the component 3 (red)
    concentration is pulled down to zero pretty sharply on the left side. At
    the same point there's a peak in the component 2 (green) concentration right
    where the red one starts. There's also a small spike in the chi^2 plot around
    frame 1200, which is where component 3 starts. This indicates that we've
    restricted component 3 excessively and some of the scattering is going into
    component 2. We will change the range for component 3 to start at an
    earlier point and see how that affects the deconvolution. To do this, set
    the start of component 3 to 1150 and run REGALS.

    |regals_regals_4_png|

#.  After running REGALS, notice that the chi^2 spike is completely gone,
    and that the range 3 concentration is coming back down to zero in a more
    natural way. Also notice that the spike in component 2 concentration
    is reduced. If you look closely at the component 2 concentration you can
    see a bit of a double peak around ~1150. There could be a little bit more
    component 3 there, so we will start the component 3 range earlier. Try 1125
    and 1100 for the component 3 start.

#.  As there's minimal change between 1100 and 1125, set the component 3 start
    back to 1125.

#.  At this point the deconvolution is starting to look reasonable. There's nothing
    obviously wrong with the component ranges, so next we will adjust the lambda
    values for the components. These control the degree of smoothing in the
    deconvolution. For strong components with lots of measured profiles, like
    the peaks in this dataset, we don't need a large lambda. In fact, we might
    not need any lambda. For component 2 and 3 concentrations turn off "Auto
    lambda", and set the lambda value to 0. Then run REGALS again.

    *   *Tip:* Make sure you're setting lambda for the concentration, not
        the profile, for each component.

    |regals_regals_5_png|

#.  After running REGALS with the peak component concentration lambdas
    set to 0, the component 2 concentration (green) has a small negative dip
    at the end of the concentration range. This indicates that we should adjust
    the range for that component. In particular, we'll reduce the end point to
    try to eliminate that dip. Try 1300 and 1275 as endpoints for component
    2 concentration.

    *   *Note:* If we had set the lambdas particularly poorly, we would start to see
        the chi^2 plot deviate from ~1. Since we don't, we can conclude
        that our lambda values are probably okay.

#.  Notice that 1275 essentially eliminates the dip in the concentration.
    Set that as the endpoint for the component 2 concentration.

    *   *Try:* You can try small tweaks near 1275 if you like, but you
        shouldn't see significant changes in the concentration shape. So we'll
        leave it set at 1275 for now.

#.  The final thing we will adjust is the lambdas for the background component
    concentrations. Those concentration profiles are a bit wavy. We would
    expect the concentrations of the buffer to increase linearly, since a
    linear salt gradient was applied during elution. To increase the smoothness
    of the concentration, we will increase lambda. Turn off "Auto lambda" for
    components 0 and 1. Then use the up arrow next to the lambda box to
    increase each lambda by an order of magnitude. Run REGALS to see how this
    affects the deconvolution.

    *   *Note:* Generally you want to adjust lambda by an order of magnitude
        or more. Smaller adjustments will have minimal effect on the deconvolution.

    *   *Tip:* You can also type a new lambda value directly into the box.

    |regals_regals_6_png|

#.  There are two major changes with the new lambdas. First, the high q of
    the scattering profiles for ranges 2 and 3 will get more similar. Second, the
    concentration for ranges 0 and 1 will get smoother. The change in high q
    comes from a decrease in the high q values component 2, while component 3
    stays mostly the same. This implies that more of the buffer scattering is
    getting picked out component 2. Keep increasing both buffer component
    lambdas by an order of magnitude until this stops changing. You will
    also see a sudden and dramatic change in the component 1 profile at
    some point.

#.  Once you see the high q backgrounds match and the large change in the
    scattering profile for component 1, it means you're oversmoothing
    the buffer components. Reduce the lambda for both buffer components to
    the last good value, which would be ~4e8.

    |regals_regals_7_png|

#.  This is a more or less optimum solution for REGALS deconvolution for this
    dataset. Since we're satisfied, we can now save the results. First,
    click on the "Save REGALS data (not profiles)" button in the bottom
    right corner and save. This saves a spreadsheet (.csv) file with all
    of the information from REGALS that isn't the scattering profiles,
    including the component settings and concentration and chi^2 vs. frame
    data.

#.  Finally, click the "Done" button to close the REGALS window and send the
    deconvolved scattering profiles to the Profiles Plot.

#.  In the main RAW window, go to the Profiles control tab and the Profiles
    plot. There you should see the deconvolved profiles. The labels _0, _1,
    _2, and _3 correspond to the 0, 1, 2, and 3 components from REGALS.

    |regals_profiles_png|



Deconvolving equilibrium titration series SAXS data
************************************************************************

Titration series SAXS data looks at the change in scattering profile of a
macromolecule as a substance (salt, substrate, another macromolecule, etc.)
is titrated in or our of the solution. It is done in equilibrium, so buffers
are prepared ahead of time and samples are equilibrated in the buffer, then
measured. These measurements are typically done in batch mode SAXS (without
in-line separation from SEC), and often involve transitions between different
conformations or oligomeric states, and so the measured scattering at a
given titration point is not from a homogeneous and monodisperse sample.
We can use REGALS to deconvolve the different scattering components in the
titration series to get pure profiles for each component.

#.  Clear all of the data in RAW. Load the **pheh_titration.hdf5** file in the
    **series_data** folder.

    *   *Note:* This data are a titration series of phenylalanine (L-phe) into a
        sample of phenylalanine hydroxylase (PheH). 16 different concentration
        points were collected, ranging from 0 to 80 mM L-phe. The data are
        already background subtracted, using buffers containing a matching
        concentration of L-phe. A conformational change has previously been
        observed on PheH binding L-phe. A small amount of aggregation was also
        observed at all concentrations, preventing the use of saturated
        endpoints in the titration series to completely determine each conformational
        state in batch mode experiments. We will deconvolve both conformations
        and the aggregate scattering from the titration series. `Prior analysis
        of this data without the use of REGALS is published
        <http://dx.doi.org/10.1021/jacs.6b01563>`_.

    *   *Note:* The data were provided by the Ando group at Cornell University,
        and were originally published in the paper: Meisburger, S. P. et al. (2016).
        *Domain Movements upon Activation of Phenylalanine Hydroxylase
        Characterized by Crystallography and Chromatography-Coupled
        Small-Angle X-ray Scattering.* J Am Chem Soc 138, 6506–6516.
        DOI: `10.1021/jacs.6b01563 <https://doi.org/10.1021/jacs.6b01563>`_.
        It is also available from the `REGALS github repository
        <https://github.com/ando-lab/regals>`_.

    |regals_pheh_series_plot_png|

#.  We will use REGALS to extract out the scattering of the individual
    conformers and the aggregate in the titration series. Right click on the
    **pheh_titration.hdf5** item in the Series list. Select the "REGALS" option.

#.  Normally we use the plots of singular values and autocorrelations to determine
    the number of significant singular values (i.e. scattering components) in
    the dataset. Based on the plots of this titration series, there are
    ~4-5 significant singular values (SVs), for reference RAW's automated
    method found 4 such values. However, prior knowledge indicates there
    are only two conformations we care about, and we suspect the other components
    are various aggregates. So we will fit the data to three components:
    each conformation and a single aggregate scattering component. Set the
    "# Significant SVs" to 3.

#.  Set the experiment type to "Titration". This affects the default settings
    later in the deconvolution.

#.  Because we have so few data points, and we're using fewer components than
    the actual number of significant SVs, trying to find the component ranges
    using the EFA plots will not be useful. Uncheck the "Use EFA to find
    component ranges" box.

    |regals_pheh_exp_settings_png|

#.  Click the "Next" button in the lower right-hand corner of the window to
    advance to the final stage of the REGALS analysis.

    *   *Note:* In this case, since we aren't determining the component ranges
        with EFA, REGALS is not automatically run initially. We will have to
        set the component settings, then run REGALS.

    |regals_pheh_regals_1_png|

#.  Notice that the profile part of each component has a "realspace" regularizer.
    This means that instead of constraining the data in q space via the
    scattering profile we will be constraining the data in real space using
    the P(r) function. So in addition to setting component ranges for the
    concentration and the lambda values, we will also set the |Dmax| value for
    the P(r) function.

    |regals_pheh_regals_2_png|

#.  The titration concentrations are not equally spaced from 0 to 80 mM, so we
    will calibrate the X axis appropriately. Click the "Calibrate X axis" button.

    |regals_pheh_regals_3_png|

#.  In the window that opens click the "Load X values from file" button.

    |regals_pheh_regals_4_png|

#.  Select the **pheh_titration_conc.txt** file in the **series_data** folder and
    load it in.

    *   *Note:* A calibration file should consist of a single column of the
        concentration values, in the order that the profiles were loaded into
        the series. So the first line of the file has the concentration for
        the first profile in the series, and so on.

#.  Upon loading you should see the values in the X column update.

    *   *Note:* The concentrations are in uM, so you will see concentrations
        from 0 to 80000.

#.  The concentration values are not linearly spaced, as is typical of a
    titration series. It is also most common to work with titration series
    on a logarithmic concentration scale. To do this, we need to define
    the first concentration as not zero (as log(0) is undefined). For this
    data we will set the concentration to 10 uM. Double click in the first box
    in the X column and enter 10.0.

    *   *Note:* You can enter all the concentrations manually if you want, you
        don't have to create the concentration file we loaded in the previous
        steps. However, for a long series this can get a bit tedious.

    |regals_pheh_regals_5_png|

#.  As we want to work with our data in log space, select "Log10(X)" in the
    "Use for X axis" list. Then click the OK button to exit the window and
    save the X calibration.

    |regals_pheh_regals_6_png|

#.  In the main REGALS window, notice that the X axis is now calibrated as
    Log(concentration), and that the ranges for the concentration components
    have updated accordingly.

#.  Next we will set the ranges for our components. We will use components 0
    and 1 as the different conformations, resting and active respectively,
    and component 2 as the aggregate. Prior analysis of the system showed that
    of the two conformations, only the resting conformation, component 0 is
    present at 0 mM L-phe. So set the "Zero at Xmin" to True for component 1.

#.  Prior analysis showed that features of the scattering profile associated
    with the active state saturated above 3 mM. So we will assume that there
    is no contribution to the scattering from the resting state at/above 3 mM
    (3.48 on the log10(X) axis). Set the end range for component 0 to 3.48,
    and apply a Zero at Xmax boundary condition to it.

    |regals_pheh_regals_7_png|

#.  We will apply not constraints to the concentration range of the aggregate,
    so we are done with the concentration ranges. Next we need to set the
    |Dmax| values for the components. We'll start with the aggregate. Since we
    have no prior knowledge about the aggregate, we will assume it is a non-specific
    size. Because of the q range of the data, the largest dimension of an object
    that can be measured is ~300 Å, based on the Shannon limit of
    :math:`D_{max}<\pi/q_{min}`. So we will set the |Dmax| value for the
    aggregate, component 2, to 300.

    |regals_pheh_regals_8_png|

#.  We will now run REGALS to get an initial look at the deconvolution.
    Click the "Run REGALS" button.

    |regals_pheh_regals_9_png|

#.  Looking at the results, it's already a reasonable deconvolution. The
    concentrations make sense with previous knowledge, e.g. that there's
    a transition from the resting (component 0) to active (component 1)
    with increasing L-phe concentration, and that there's a low level
    of aggregate throughout that increases significantly at higher
    concentrations of L-phe. The profiles and P(r) functions, while not
    completely correct, are at least reasonable shapes, and the chi^2
    is mostly relatively low.

    *   *Note:* On the concentration plot, the markers are the concentrations
        calculated at the titration points. The smooth lines are the
        concentration calculated at the regularlizer grid points, so it is
        effectively interpolated between the measured titration points.
        When you have more than 40 profiles in the series, only the concentration
        at the actual measured points is shown, and there it is shown as
        a continuous line, not individual points (as in the above IEC-SAXS
        example).

#.  Next we will refine the |Dmax| values for the resting and active states.
    On the P(r) plot, notice that for both components the P(r) function
    is forced to zero sharply, indicating that an underestimated |Dmax| value.
    Increase both |Dmax| values to 110 and run REGALS again.

    |regals_pheh_regals_10_png|

#.  Notice that at a |Dmax| of 110 the P(r) functions are still somewhat
    forced to zero, but the chi^2 is lower, so this has improved the
    deconvolution. Continue increasing |Dmax| in steps of 10-20 until you
    reach 160. You should notice several things. First, there's a range
    from ~130-150 where the chi^2 is relatively stable, indicating |Dmax|
    values in those ranges all provide relatively good fits to the data.
    Second, as you increase |Dmax| the concentration of the aggregate
    decreases, particularly in the lower titration concentrations. This implies
    that having a larger |Dmax| is letting that component take up some of the
    aggregate scattering. Third, after ~120 the P(r) functions stop looking
    as forced to zero. Fourth at 160 the chi^2 starts noticeably increasing,
    indicating that's too large for the |Dmax| value. Based on this, we want
    to pick a |Dmax| value near 130, to exclude as much of the aggregate as we
    can while still getting good P(r) functions and chi^2 values.

#.  Set the |Dmax| of both components to 130 and run REGALS.

    *   *Note:* The |Dmax| values found by previous analysis was ~130, so this
        validates our choice of |Dmax|.

    *   *Note:* The |Dmax| values for different components won't necessarily
        agree. In this case they happen to.

    *   *Tip:* If it takes a while to run REGALS every time you change a component,
        you can speed up the convergence by starting with the previous results.
        To do so, you would check the "Start with previous results" box. Then
        change the convergence criteria to "Iterations" and set the number of
        iterations to 10. This will allow you to quickly iterate on changes like
        |Dmax|, as long as the magnitude of the change is relatively small.
        Just be sure to set the convergence criteria back to the default (not
        using previous results, and Chi^2 with 1000 iterations) to do your
        final REGALS run.

#.  The REGALS deconvolution is now as optimized as we can make it. Notice
    that there's still a relatively high chi^2 value for the last frame. This
    indicates that the aggregate may be changing shape as the amount increases,
    and so a single component cannot fit all the data well. You can redo the
    deconvolution without the last data point if you want, the results are very similar
    albeit without the chi^2 spike for the last point. Since we're satisfied
    with these results we can now save them. First, click on "Save REGALS data
    (not profile)" and save.

    *   *Note:* Among other things, this .csv file contains the P(r) functions,
        and the smoothed concentration curves (lines on the concentration plot).
        In this case the smoothed concentration curves are particularly useful
        because the protein changes shape but not size. As the P(r) functions
        are normalized to I(0), the concentration curves for both components
        are on the same overall scale. This means they differ by simply a
        uniform scale factor from the true concentrations (e.g. in mg/ml),
        and so could be useful to help characterize the two state transition.
        This will not always be true, such as if you're characterizing an
        oligomerization reaction.

    |regals_pheh_regals_11_png|

#.  Finally, click the "Done" button to close the REGALS window and send the
    deconvolved scattering profiles to the profiles plot.



General notes
****************

#.  It is okay to mix and match different types of regularizers (e.g. have
    both smooth and real space profile regularizers) for the same series.

#.  You can change the regularizers away from the defaults for a given
    experiment type.

#.  The REGALS examples shown here were chosen to demonstrate the features
    of the GUI. REGALS is not restricted in application to just IEC-SAXS
    and titration data. It has been successfully applied to time resolved
    SAXS data (similar to the titration series example), and we expect it will
    be applied to a range of other types of experiments as well.

#.  As you saw in the PheH titration series example, it is quite useful, and
    sometimes necessary, to have additional information to input to the
    deconvolution, such as the range of the components, or a known maximum
    dimension. One of the advantages of REGALS is that it can incorporate
    these additional pieces of information to improve the deconvolution.

#.  By default, RAW bins the profiles before doing an SVD and calculating the
    evolving factor plots, in order to speed up the process. The final REGALS
    rotation is done on the full unbinned dataset. You can turn binning on and
    off and adjust  the binning parameters in the Series options panel in the
    Advanced Options window.

.. |regals_series_plot_png| image:: images/regals_series_plot.png
    :target: ../_images/regals_series_plot.png

.. |regals_panel_png| image:: images/regals_panel.png
    :target: ../_images/regals_panel.png

.. |regals_components_png| image:: images/regals_components.png
    :target: ../_images/regals_components.png

.. |regals_exp_type_png| image:: images/regals_exp_type.png
    :target: ../_images/regals_exp_type.png
    :width: 300 px

.. |regals_bkg_efa_png| image:: images/regals_bkg_efa.png
    :target: ../_images/regals_bkg_efa.png

.. |regals_bkg_1_png| image:: images/regals_bkg_1.png
    :target: ../_images/regals_bkg_1.png

.. |regals_bkg_2_png| image:: images/regals_bkg_2.png
    :target: ../_images/regals_bkg_2.png

.. |regals_bkg_3_png| image:: images/regals_bkg_3.png
    :target: ../_images/regals_bkg_3.png

.. |regals_efa_1_png| image:: images/regals_efa_1.png
    :target: ../_images/regals_efa_1.png

.. |regals_efa_2_png| image:: images/regals_efa_2.png
    :target: ../_images/regals_efa_2.png
    :width: 175 px

.. |regals_regals_1_png| image:: images/regals_regals_1.png
    :target: ../_images/regals_regals_1.png

.. |regals_regals_2_png| image:: images/regals_regals_2.png
    :target: ../_images/regals_regals_2.png
    :width: 250 px

.. |regals_regals_3_png| image:: images/regals_regals_3.png
    :target: ../_images/regals_regals_3.png
    :width: 250 px

.. |regals_regals_4_png| image:: images/regals_regals_4.png
    :target: ../_images/regals_regals_4.png

.. |regals_regals_5_png| image:: images/regals_regals_5.png
    :target: ../_images/regals_regals_5.png
    :width: 250 px

.. |regals_regals_6_png| image:: images/regals_regals_6.png
    :target: ../_images/regals_regals_6.png
    :width: 250 px

.. |regals_regals_7_png| image:: images/regals_regals_7.png
    :target: ../_images/regals_regals_7.png

.. |regals_profiles_png| image:: images/regals_profiles.png
    :target: ../_images/regals_profiles.png

.. |regals_pheh_series_plot_png| image:: images/regals_pheh_series_plot.png
    :target: ../_images/regals_pheh_series_plot.png

.. |regals_pheh_exp_settings_png| image:: images/regals_pheh_exp_settings.png
    :target: ../_images/regals_pheh_exp_settings.png
    :width: 300 px

.. |regals_pheh_regals_1_png| image:: images/regals_pheh_regals_1.png
    :target: ../_images/regals_pheh_regals_1.png

.. |regals_pheh_regals_2_png| image:: images/regals_pheh_regals_2.png
    :target: ../_images/regals_pheh_regals_2.png
    :width: 250 px

.. |regals_pheh_regals_3_png| image:: images/regals_pheh_regals_3.png
    :target: ../_images/regals_pheh_regals_3.png
    :width: 250 px

.. |regals_pheh_regals_4_png| image:: images/regals_pheh_regals_4.png
    :target: ../_images/regals_pheh_regals_4.png

.. |regals_pheh_regals_5_png| image:: images/regals_pheh_regals_5.png
    :target: ../_images/regals_pheh_regals_5.png

.. |regals_pheh_regals_6_png| image:: images/regals_pheh_regals_6.png
    :target: ../_images/regals_pheh_regals_6.png

.. |regals_pheh_regals_7_png| image:: images/regals_pheh_regals_7.png
    :target: ../_images/regals_pheh_regals_7.png

.. |regals_pheh_regals_8_png| image:: images/regals_pheh_regals_8.png
    :target: ../_images/regals_pheh_regals_8.png
    :width: 250 px

.. |regals_pheh_regals_9_png| image:: images/regals_pheh_regals_9.png
    :target: ../_images/regals_pheh_regals_9.png

.. |regals_pheh_regals_10_png| image:: images/regals_pheh_regals_10.png
    :target: ../_images/regals_pheh_regals_10.png
    :width: 300 px

.. |regals_pheh_regals_11_png| image:: images/regals_pheh_regals_11.png
    :target: ../_images/regals_pheh_regals_11.png

.. |Dmax| replace:: D\ :sub:`max`