1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 190 191 192 193 194 195 196 197 198 199 200 201 202 203 204 205 206 207 208 209 210 211 212 213 214 215
|
# PURIFY
[](https://github.com/astro-informatics/purify/actions/workflows/ci.yml?query=branch%3Adevelopment+)
[](https://codecov.io/gh/astro-informatics/purify)
[](https://doi.org/10.5281/zenodo.2555252)
## Description
**PURIFY** is an open-source collection of routines written in `C++` available under the [license](#license) below. It implements different tools and high-level to perform radio interferometric imaging, _i.e._ to recover images from the Fourier measurements taken by radio interferometric telescopes.
**PURIFY** leverages recent developments in the field of compressive sensing and convex optimization. Low-level functionality to solve the resulting convex optimisation is factored into the open-source companion code, [**SOPT**](https://github.com/astro-informatics/sopt), also written by the authors of **PURIFY**. For further background please see the [reference](#references-and-citation) section.
This documentation outlines the necessary and optional [dependencies](#dependencies-installation) upon which **PURIFY** should be built, before describing [installation](#installing-and-building-PURIFY), [testing](#testing) and [usage](#usage) details. [Contributors](#contributors), [references](#references-and-citation) and [license](#license) information then follows.
## Dependencies installation
**PURIFY** is written in `C++11`. Required software and libraries, and their minimum supported versions, are listed below. The build system will attempt to automatically download and build the automatically included libraries. (an internet connection is required for this).
`C++` dependencies:
- [CMake](http://www.cmake.org/) v3.5.1 A free software that allows cross-platform compilation
- [GCC](https://gcc.gnu.org) v7.3.0 GNU compiler for `C++`
- [OpenMP](http://openmp.org/wp/) v4.8.4 - Optional - Speeds up some of the operations.
- [MPI](https://www.open-mpi.org) v3.1.1 - Optional - Parallelisation paradigm to speed up operations.
- [astro-informatics/sopt](https://github.com/astro-informatics/sopt) v4.1.0: Sparse Optimization
Compressed Sensing library.
- [Boost](https://www.boost.org/) v1.78.0: A set of free peer-reviewed
portable C++ libraries.
- [fftw3](www.fftw.org) v3.3.9: Fastest Fourier Transform in the West.
- [Eigen3](http://eigen.tuxfamily.org/index.php?title=Main_Page) v3.3.7: Modern `C++` linear algebra.
- [tiff](http://www.libtiff.org/) v4.0.9: Tag Image File Format library.
- [cfitsio](http://heasarc.gsfc.nasa.gov/fitsio/fitsio.html): v4.0.0: Library of `C` and `Fortran` subroutines for reading and writing data files in FITS (Flexible Image Transport System) data format.
- [yaml-cpp](https://github.com/jbeder/yaml-cpp) v0.6.3: YAML parser and emitter in `C++`.
- [casacore](http://casacore.github.io/casacore/) - Optional - Needed to interface with measurement
- [ONNXruntime](https://onnxruntime.ai/) v1.17.1 - Optional - a cross-platform runtime engine based on the Open Neural Network eXchange format.
sets.
- [Catch2](https://github.com/catchorg/Catch2) v2.13.9: Optional - A `C++`
unit-testing framework only needed for testing.
- [google/benchmark](https://github.com/google/benchmark) v1.6.0: Optional - A `C++`
micro-benchmarking framework only needed for benchmarks.
For examples on how to install dependencies on Ubuntu and MacOS, see the
[cmake.yml](https://github.com/astro-informatics/purify/blob/development/.github/workflows/cmake.yml).
## Installing and building PURIFY
If the dependencies are already available on your system, you can also install **PURIFY** manually like so
``` bash
cd /path/to/code
mkdir build
cd build
cmake .. -DCMAKE_INSTALL_PREFIX=${PWD}/../local
make -j
make -j install
```
On MacOS, you can also install most of the dependencies with Homebrew e.g.
``` bash
brew install boost fftw eigen yaml-cpp catch2 [onnxruntime]
```
### Machine-learning models
The **SOPT** library includes an interface to ONNXrt for using trained models
as priors in the Forward-Backward optimization algorithm. To build **PURIFY** with
ONNXrt capability, you need to enable `ONNXrt` support also in **SOPT** using
the `onnxrt` option when running the `cmake` command.
## Testing
To check everything went all right, run the test suite:
```
cd /path/to/purify/build
ctest .
```
## Usage
The main `purify` executable lives either in the build directory or in the in the `bin` subdirectory
of the installation directory. `purify` has one required argument, it a string for the file path of the config file containing the settings.
`purify path/to/config.yaml`.
A [template with a description of the settings](https://github.com/astro-informatics/purify/blob/development/data/config/config.yaml)
is included in the `data/config` directory.
When `purify` runs a directory will be created, and the output images will be
saved and time-stamped. Additionally, a config file with the settings used will
be saved and time-stamped, helping for reproducibility and book-keeping.
Uncertainty Quantification
--------------------------
Bayesian hypothesis testing may be performed with the `purify_UQ` application, which can be located in the `build` directory.
This application takes a config yaml file with the following parameters:
- `confidence_interval` or `alpha`. (alpha = 1 - confidence interval.))
- `measurements_path`: path to measurements data (.vis file)
- `reference_image_path`: path to reference image i.e. output from purify run. (.fits files)
- `surrogate_image_path`: path to surrogate image i.e. doctored image with blurring or structural change that you want to test. (.fits file)
- `sigma`: standard deviation for Gaussian likelihood.
- `gamma`: multiplicative factor for prior.
- `purify_config`: path to purify config used to generate the reference image. **This should be used if available in order to ensure consistency of things like measurement and wavelet operators.**
You can then run the uncertainty quantification with the command:
```
purify_UQ <path to UQ_config yaml>
```
The application will report the value of the objective function for each image, the threshold value calculated from the reference image, and whether the surrogate image is ruled out or not.
Presently this is designed to work for the unconstrained problem where:
- The negative log-likelihood is a scaled L2-norm i.e. $ \frac{1}{2 \sigma^2} \sum (y_i - \Phi x_i)$ for some data $y$, image $x$, and measurement operator $\Phi$. (Equivalent to indepdendent multivariate Gaussian likelihood.)
- The negative log prior is a scaled L1-norm in _wavelet space_ i.e. $\gamma \sum (\Psi^\dag x)_i$ for some image $x$ and wavelet operator $\Psi$.
- The objective function is the sum of these two terms.
Docker
-------
## Debugging the CI workflow with tmate
The CI workflow has a manual dispatch trigger which allows you to log into the job while it's running. You can trigger it in
[actions](https://github.com/astro-informatics/purify/actions/workflows/cmake.yml).
Run the workflow and set `debug_enabled=true` to enable the `tmate` step in the CI workflow. Once the workflow is running, open the job in actions.
You should see it printing out a line with a `ssh` command. Run it in terminal to log into the GitHub Actions runner.
## Docker
A Dockerfile is available on DockerHub. We are currently not maintaining it, and cannot
guarantee it is up to date. Use the below documentation at your own risk.
If you want to use Docker instead, you can build an image using the Dockerfile
available in the repository or pulling it from
[DockerHub](https://hub.docker.com/r/uclrits/purify).
```
docker build -t purify .
```
or
```
docker pull uclrits/purify
```
Then to use it, you should mount the directory with your data and config files
to `/mydata` in the container. To run the container and mount the directory is
with:
```
docker run -it --name purify -v /full/path/to/data:/mydata uclrits/purify
```
That will start a shell inside the container in the `/mydata` directory where
you can see all the files from your `/full/path/to/data`. There you can run
`purify` as shown above.`
## Contributors
Check the [contributors](@ref purify_contributors) page ([github](cpp/docs/PURIFY_CONTRIBUTORS.md)).
## References and citation
If you use **PURIFY** for work that results in publication, please reference the [webpage](#webpage) and our related academic papers:
1. L. Pratley, et al.
"Distributed convex optimization for Radio Interferometry with PURIFY".
Link will be here soon!
2. L. Pratley, M. Johnston-Hollitt, J. D. McEwen,
"A fast and exact w-stacking and w-projection hybrid algorithm for wide-field interferometric imaging".
Submitted to ApJ [arXiv:1807.09239](https://arxiv.org/abs/1807.09239)
3. L. Pratley, J. D. McEwen, M. d'Avezac, R. E. Carrillo, A. Onose, Y. Wiaux. "Robust sparse image
reconstruction of radio interferometric observations with PURIFY".
Accepted (2016) [arxiv:1610.02400](https://arxiv.org/abs/1610.02400v2)
4. A. Onose, R. E. Carrillo, A. Repetti, J. D. McEwen, J.-P. Thiran, J.-C. Pesquet, and Y. Wiaux.
"Scalable splitting algorithms for big-data interferometric imaging in the SKA era" _Mon. Not.
Roy. Astron. Soc._ **462(4):4314-4335** (2016) [arXiv:1601.04026](http://arxiv.org/abs/arXiv:1601.04026)
5. R. E. Carrillo, J. D. McEwen and Y. Wiaux. "PURIFY: a new approach to radio-interferometric
imaging". _Mon. Not. Roy. Astron. Soc._ **439(4):3591-3604** (2014) [arXiv:1307.4370](http://arxiv.org/abs/1307.4370)
## License
> PURIFY Copyright (C) 2013-2019
>
> This program is free software; you can redistribute it and/or modify it
> under the terms of the GNU General Public License as published by the
> Free Software Foundation; either version 2 of the License, or (at your
> option) any later version.
>
> This program is distributed in the hope that it will be useful, but
> WITHOUT ANY WARRANTY; without even the implied warranty of
> MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU General
> Public License for more details (LICENSE.txt).
>
> You should have received a copy of the GNU General Public License along
> with this program; if not, write to the Free Software Foundation, Inc.,
> 51 Franklin Street, Fifth Floor, Boston, MA 02110-1301, USA.
## Webpage
- [Documentation](http://astro-informatics.github.io/purify)
- [Repository](https://github.com/astro-informatics/purify)
## Support
For any questions or comments, feel free to contact [Jason McEwen](http://www.jasonmcewen.org), or add
an issue to the [issue tracker](https://github.com/astro-informatics/purify/issues).
## Notes
The code is given for educational purpose. The code is in beta and still under development.
|