Oven logo

Oven

Published

spatial-image

pip install spatial-image

Package Downloads

Weekly DownloadsMonthly Downloads

Project URLs

Requires Python

>=3.8

spatial-image

Test DOI

A multi-dimensional spatial image data structure for scientific Python.

To facilitate:

  • Multi-scale processing and analysis
  • Registration
  • Resampling
  • Subregion parallel processing
  • Coupling with meshes, point sets, and annotations

with scientific images, which are typically multi-dimensional with anisotropic sampling, this package provides a spatial-image data structure. In addition to an N-dimensional array of pixel values, spatial metadata defines the location of the pixel sampling grid in space time. We also label the array dimensions. This metadata is easily utilized and elegantly carried through image processing pipelines.

This package defines spatial image metadata, provides a function, is_spatial_image, to verify the expected behavior of a spatial image instance, and provides a reference function, to_spatial_image to convert an array-like, e.g. a NumPy ndarray or a Dask array, to a spatial image.

The spatial-image data structure is implemented with Xarray, a library for N-D labeled arrays and datasets in Python. The Xarray library is well-tested, relatively mature, and integrates well with scientific Python ecosystem tooling. The Xarray library leverages NumPy and pandas for labeled array indexing, integrates well with machine-learning libraries utilizing the scikit-learn interface, integrates with Dask for distributed computing, and zarr for serialization.

In essence, a spatial image is an xarray.DataArray with a defined set of dims labels, {'c', 'x', 'y', 'z', 't'}, constraints on the coords, to enforce uniform spacing in a given direction, and defined set of additional metadata attrs.

Installation

pip install spatial-image

Definitions

Data Dimensions

A spatial image's xarray dims belong to the set: {'c', 'x', 'y', 'z', 't'}. These dimensions are:

c
Component / channel dimension.
x
First spatial dimension.
y
Second spatial dimension.
z
Third spatial dimension.
t
Time dimension.

Axis attributes

Each dim has an axis with additional attributes to describe the dimension.

long_name
A descriptive name for the axis, e.g. anterior-posterior or x-axis. Defaults to the dim name.
units
Units for the axis, e.g. millimeters. Defaults to the empty string.

Coordinates

A spatial image's Xarray coords specify the spatial location of pixels in the image for the 'x', 'y', and 'z' data dimensions. For the 'c' and 't' data dimensions, component identities and timestamps can optionally be provided.

Spatial coordinates define the position in the coordinate reference frame of the image. In general, the image's coordinate reference frame may be different from the world coordinate reference frame.

Pixels are sampled on a uniform, possibly anisotropic, spatial grid. Spatial coordinates have a 64-bit float type. The difference between adjacent coordinates, i.e. the pixel spacing, for a dimension must be uniform. The first coordinate value defines the origin or offset of an image.

The component or channel dimension coordinates defaults to a sequence of integer identifiers but can be strings describing the channels, e.g. ['r', 'g', 'b'].

The time coordinates can have integer, float, or datetime64 type.

Motivational Notes

  • Image-axis-aligned Cartesian coordinate reference frames enable Pythonic subscripting in processing pipelines on xarray.DataArray's. When indexing with slices, the same slices are applied to the multi-dimensional pixel array as the 1-D coordinate arrays, and the result is valid.

  • Regular coordinate spacing enables processing optimizations, both algorithmically and computationally.

Development

Contributions are welcome and appreciated.

To run the test suite:

git clone https://github.com/spatial-image/spatial-image
cd spatial-image
pip install -e ".[test]"
pytest