1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136
|
.. currentmodule:: h5py
.. _attributes:
Attributes
==========
Attributes are a critical part of what makes HDF5 a "self-describing"
format. They are small named pieces of data attached directly to
:class:`Group` and :class:`Dataset` objects. This is the official way to
store metadata in HDF5.
Each Group or Dataset has a small proxy object attached to it, at
``<obj>.attrs``. Attributes have the following properties:
- They may be created from any scalar or NumPy array
- Each attribute should be small (generally < 64k)
- There is no partial I/O (i.e. slicing); the entire attribute must be read.
The ``.attrs`` proxy objects are of class :class:`AttributeManager`, below.
This class supports a dictionary-style interface.
By default, attributes are iterated in alphanumeric order. However,
if group or dataset is created with ``track_order=True``, the
attribute insertion order is remembered (tracked) in HDF5 file, and
iteration uses that order. The latter is consistent with Python 3.7+
dictionaries.
The default ``track_order`` for all new groups and datasets can be
specified globally with ``h5.get_config().track_order``.
Large attributes
----------------
HDF5 allows attributes to be larger than 64 KiB, but these need to be stored in
a different way. As of March 2024, the way HDF5 documentation suggests you
configure this does not work. Instead, enable order tracking when creating the
object you want to attach attributes to::
grp = f.create_group('g', track_order=True)
grp.attrs['large'] = np.arange(1_000_000, dtype=np.uint32)
Reference
---------
.. class:: AttributeManager(parent)
AttributeManager objects are created directly by h5py. You should
access instances by ``group.attrs`` or ``dataset.attrs``, not by manually
creating them.
.. method:: __iter__()
Get an iterator over attribute names.
.. method:: __contains__(name)
Determine if attribute `name` is attached to this object.
.. method:: __getitem__(name)
Retrieve an attribute.
.. method:: __setitem__(name, val)
Create an attribute, overwriting any existing attribute. The type
and shape of the attribute are determined automatically by h5py.
.. method:: __delitem__(name)
Delete an attribute. KeyError if it doesn't exist.
.. method:: keys()
Get the names of all attributes attached to this object.
:return: set-like object.
.. method:: values()
Get the values of all attributes attached to this object.
:return: collection or bag-like object.
.. method:: items()
Get ``(name, value)`` tuples for all attributes attached to this object.
:return: collection or set-like object.
.. method:: get(name, default=None)
Retrieve `name`, or `default` if no such attribute exists.
.. method:: get_id(name)
Get the low-level :class:`AttrID <low:h5py.h5a.AttrID>` for the named
attribute.
.. method:: create(name, data, shape=None, dtype=None)
Create a new attribute, with control over the shape and type. Any
existing attribute will be overwritten.
:param name: Name of the new attribute
:type name: String
:param data: Value of the attribute; will be put through
``numpy.array(data)``.
:param shape: Shape of the attribute. Overrides ``data.shape`` if
both are given, in which case the total number of
points must be unchanged.
:type shape: Tuple
:param dtype: Data type for the attribute. Overrides ``data.dtype``
if both are given.
:type dtype: NumPy dtype
.. method:: modify(name, value)
Change the value of an attribute while preserving its type and shape.
Unlike :meth:`AttributeManager.__setitem__`, if the attribute already
exists, only its value will be changed. This can be useful for
interacting with externally generated files, where the type and shape
must not be altered.
If the attribute doesn't exist, it will be created with a default
shape and type.
:param name: Name of attribute to modify.
:type name: String
:param value: New value. Will be put through ``numpy.array(value)``.
|