1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53
|
.. _merging:
Merging Datasets
================
If several of your files describe the same items, you can merge them into a
single Glue :class:`~glue.core.data.Data` object. Examples of files that make
sense to merge together include:
- Two or more images that are pixel-aligned to each other
- Several catalogs whose rows describe the same objects
Why merge?
----------
For multi-dimensional visualizations (like a scatter plot, or an RGB image),
merging datasets allows you to combine attributes from two different files
into a single visualization. It also guarantees that any subset defined
using attributes from one file can be applied to the entries in another file.
Merging vs Linking
------------------
Merging is a different operation than :ref:`linking <linking>`. The easiest
way to appreciate the difference is to think of spreadsheet-like data.
In Glue, linking two datasets defines a conceptual relationship between
the **columns** of a spreadsheet (e.g., two spreadsheets have a column
called "age", but row N describes a different object in each spreadsheet).
Merging, on the other hand, indicates that two spreadsheets are pre-aligned
along each **row** (e.g. row N describes the same item in every spreadsheet, but
the columns of each spreadsheet might be different).
Merging collapses several datasets into a single dataset, while
linking keeps each dataset separate.
How to merge datasets
---------------------
Whenever you load a file whose shape matches a pre-existing dataset,
Glue will ask you if you want to merge them into a single object.
If you choose not to merge at this time, you can merge later
by highlighting the relevant datasets in the left panel, right-clicking,
and selecting ``Merge datasets``.
To merge datasets programmatically, use the :meth:`DataCollection.merge
<glue.core.data_collection.DataCollection.merge>` method.
.. note::
Datasets should only be merged if each element describes the same item
in each file. Consequently, all merged datasets must have the same
number of elements.
|