File: beyond.rst

package info (click to toggle)
kerchunk 0.2.9-4
  • links: PTS, VCS
  • area: main
  • in suites: forky, sid
  • size: 135,208 kB
  • sloc: python: 6,483; makefile: 32
file content (23 lines) | stat: -rw-r--r-- 1,119 bytes parent folder | download
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
Beyond Python
=============

The reference files currently stored in JSON format and can be readily interpreted
in any language, as documented in detail in the :doc:`spec`.
The contents of each key
is either encoded binary data or a URL/offset/size set. It can be loaded as long
as the language being used can access the particular URL type.

To interpret the blocks as parts of a zarr dataset, the language should
have an `implementation of zarr`_, as well as whichever binary codecs the target
requires (maybe nothing for plain binary, or common compressors like gzip, but might
be more specific). You would need to write some code to expose the reference set
as a storage object that ``zarr`` can use.

.. _implementation of zarr: https://github.com/zarr-developers/zarr_implementations

One example of a reference dataset used via a JS implementation, applied to multi-scale
TIFF microscopy, can be found
at https://observablehq.com/@manzt/ome-tiff-as-filesystemreference.

GDAL (since 3.11) has also support for JSON and Parquet reference stores as
described in https://gdal.org/en/stable/drivers/raster/zarr.html