The Earthmover Platform

Streamline multidimensional data workflows, reduce DevOps complexity, and avoid bottlenecks as data sets scale

The Earthmover Platform
Overview

Modern architecture for tensor data

The Earthmover team does more than provide a data platform for multidimensional data. As leaders in the open source ecosystem, we’re defining a new cloud-native standard for working with tensor data to accelerate scientific research, earth system AI modeling, and data-driven application development.


Image
arraylake

Unify data management with Arraylake

Transform scattered datasets into a single source of truth, while retaining the native multidimensional structure of your data

Media for step 1
01.

Turn-key cloud platform for data management

Harness the power of cloud computing and collaboration for data cubes without building and maintaining custom cloud infrastructure. Bring your own object storage or let us manage it for you.

Media for step 2
02.

High performance data loading

Combine performance and flexibility with chunked, compressed array-based storage for performant training of machine learning models. Zero-copy ingestion of existing Zarr, NetCDF, HDF, GRIB, and TIFF data.

Media for step 3
03.

User-friendly data catalog

Streamline data organization through a central, unified catalog of all your array-based data assets.

Media for step 4
04.

Robust data governance

Easily manage access with rich permission structures and audit your data with immutable data references.

flux

Accelerate data delivery with Flux

Enable seamless data exploration and accelerate data product development and delivery with a turn-key, high-performance gateway to your data

01.

Explore data at the speed of thought

Empower analysts to query and visualize multidimensional data in the cloud using their preferred tools, without waiting to download locally.

02.

Prototype and iterate faster

Quickly explore and iterate on a dataset to develop your product, service, or AI/ML model and then build applications that query your data via API using WMS, EDR, or OPeNDAP.

03.

Remove the complexity of Data Delivery

Remove the operational complexity of data product delivery. Flux enables a wide array of scalable delivery options offering massive savings in engineering, infrastructure, and maintenance.

Supported standards-compliant endpoints

Web mapping service (WMS)

Explore, query, and integrate maps layers, data, and metadata in MapboxGL, QGIS,  and Leaflet via web mapping service (WMS) integration.

Read more
Environmental data retrieval (EDR)

Retrieve consistent, well-formatted JSON and CSV data from Arraylake using the EDR API developed by the Open Geospatial Consortium (OGC).

Read more
Network data access protocol (OPeNDAP)

Query subsets and aggregates of array data in a range of data formats (NetCDF, HDF, GRIB) without downloading entire files.

Read more
open source leadership

Open-source core

Built on the Icechunk/Zarr open-source data storage engine, with Arraylake, enterprises avoid vendor lock-in, maintain data sovereignty and control, and ensure seamless data portability through adherence to open standards.

Earthmover founders Dr. Ryan Abernathey and Dr. Joe Hamman, have spent their careers doing cutting-edge research in climate science, remote sensing, and cloud-native data analytics. They maintain numerous critical open-source scientific Python packages, including Xarray, Zarr, and the Pangeo Project. Xarray and Zarr together have over 4,000 stars on Github and are used by teams at NVIDIA, NOAA, Google, Microsoft, and more.

Build smarter with expert guidance
Learn

Build smarter with expert guidance

Accelerate your roadmap with expert guidance from climate scientists and data engineers building the leading open source solutions and defining the modern workflow for multidimensional data. 


Want to learn more? Book a demo or join our mailing list to stay up to date with new releases.