The Earthmover Platform
Streamline multidimensional data workflows, reduce DevOps complexity, and avoid bottlenecks as data sets scale
Modern architecture for tensor data
The Earthmover team does more than provide a data platform for multidimensional data. As leaders in the open source ecosystem, we’re defining a new cloud-native standard for working with tensor data to accelerate scientific research, earth system AI modeling, and data-driven application development.
Unify data management with Arraylake
Transform scattered datasets into a single source of truth, while retaining the native multidimensional structure of your data
Turn-key cloud platform for data management
Harness the power of cloud computing and collaboration for data cubes without building and maintaining custom cloud infrastructure. Bring your own object storage or let us manage it for you.
High performance data loading
Combine performance and flexibility with chunked, compressed array-based storage for performant training of machine learning models. Zero-copy ingestion of existing Zarr, NetCDF, HDF, GRIB, and TIFF data.
User-friendly data catalog
Streamline data organization through a central, unified catalog of all your array-based data assets.
Robust data governance
Easily manage access with rich permission structures and audit your data with immutable data references.
Accelerate data delivery with Flux
Enable seamless data exploration and accelerate data product development and delivery with a turn-key, high-performance gateway to your data
Explore data at the speed of thought
Empower analysts to query and visualize multidimensional data in the cloud using their preferred tools, without waiting to download locally.
Prototype and iterate faster
Quickly explore and iterate on a dataset to develop your product, service, or AI/ML model and then build applications that query your data via API using WMS, EDR, or OPeNDAP.
Remove the complexity of Data Delivery
Remove the operational complexity of data product delivery. Flux enables a wide array of scalable delivery options offering massive savings in engineering, infrastructure, and maintenance.
Supported standards-compliant endpoints
Web mapping service (WMS)
Explore, query, and integrate maps layers, data, and metadata in MapboxGL, QGIS, and Leaflet via web mapping service (WMS) integration.
Read moreEnvironmental data retrieval (EDR)
Retrieve consistent, well-formatted JSON and CSV data from Arraylake using the EDR API developed by the Open Geospatial Consortium (OGC).
Read moreNetwork data access protocol (OPeNDAP)
Query subsets and aggregates of array data in a range of data formats (NetCDF, HDF, GRIB) without downloading entire files.
Read moreOpen-source core
Built on the Icechunk/Zarr open-source data storage engine, with Arraylake, enterprises avoid vendor lock-in, maintain data sovereignty and control, and ensure seamless data portability through adherence to open standards.
Earthmover founders Dr. Ryan Abernathey and Dr. Joe Hamman, have spent their careers doing cutting-edge research in climate science, remote sensing, and cloud-native data analytics. They maintain numerous critical open-source scientific Python packages, including Xarray, Zarr, and the Pangeo Project. Xarray and Zarr together have over 4,000 stars on Github and are used by teams at NVIDIA, NOAA, Google, Microsoft, and more.

Build smarter with expert guidance
Accelerate your roadmap with expert guidance from climate scientists and data engineers building the leading open source solutions and defining the modern workflow for multidimensional data.