1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64
|
.. note that if this is changed from the default approach of using an *include*
(in index.rst) to a separate performance page, the header needs to be changed
from === to ***, the filename extension needs to be changed from .inc.rst to
.rst, and a link needs to be added in the subpackage toctree
.. _astropy-units-performance:
Performance Tips
================
If you are attaching units to arrays to make |Quantity| objects, multiplying
arrays by units will result in the array being copied in memory, which will slow
things down. Furthermore, if you are multiplying an array by a composite unit,
the array will be copied for each individual multiplication. Thus, in the
following case, the array is copied four successive times::
In [1]: import numpy as np
In [2]: from astropy import units as u
In [3]: rng = np.random.default_rng()
In [4]: array = rng.random(10000000)
In [5]: %timeit array * u.m / u.s / u.kg / u.sr
92.5 ms ± 2.52 ms per loop (mean ± std. dev. of 7 runs, 10 loops each)
There are several ways to speed this up. First, when you are using composite
units, ensure that the entire unit gets evaluated first, then attached to the
array. You can do this by using parentheses as for any other operation::
In [6]: %timeit array * (u.m / u.s / u.kg / u.sr)
21.5 ms ± 886 µs per loop (mean ± std. dev. of 7 runs, 10 loops each)
In this case, this has sped things up by a factor of 4. If you use a composite
unit several times in your code then you can define a variable for it::
In [7]: UNIT_MSKGSR = u.m / u.s / u.kg / u.sr
In [8]: %timeit array * UNIT_MSKGSR
22.2 ms ± 551 µs per loop (mean ± std. dev. of 7 runs, 10 loops each)
In this case and the case with brackets, the array is still copied once when
creating the |Quantity|. If you want to avoid any copies altogether, you can
make use of the ``<<`` operator to attach the unit to the array::
In [9]: %timeit array << u.m / u.s / u.kg / u.sr
47.1 µs ± 5.77 µs per loop (mean ± std. dev. of 7 runs, 10000 loops each)
Note that these are now **microseconds**, so this is 2000x faster than the
original case with no brackets. Note that brackets are not needed when using
``<<`` since ``*`` and ``/`` have a higher precedence, so the unit will be
evaluated first. When using ``<<``, be aware that because the data is not being
copied, changing the original array will also change the |Quantity| object.
Note that for composite units, you will definitely see an
impact if you can pre-compute the composite unit::
In [10]: %timeit array << UNIT_MSKGSR
6.51 µs ± 112 ns per loop (mean ± std. dev. of 7 runs, 100000 loops each)
Which is over 10000x faster than the original example. See
:ref:`astropy-units-quantity-no-copy` for more details about the ``<<``
operator.
|