Latest Posts
Thanks to Xvec and developments across a number of packages, the Xarray ecosystem now supports data cubes with vector geometries as coordinate locations.
This is a blog version of a webinar that took place on April 16, 2024. Here’s a video of that webinar:
Geospatial datasets representing information about real-world features such as points, lines, and polygons are increasingly large, complex, and multidimensional. They are naturally represented as vector data cubes: n-dimensional arrays where at least one dimension is a set of vector geometries. The Xarray ecosystem now supports vector data cubes thanks to Xvec, a package designed for working with vector geometries within the Xarray data model 🎉. For those familiar with GeoPandas, Xvec is to Xarray as GeoPandas is to Pandas.
This blog post is geared toward analysts working with geospatial datasets. We introduce vector data cubes, discuss how they differ from raster data cubes, and de…
Read More
Background
The University of Wisconsin is home to a research team called Advanced Baseline Imager Live Imaging of Vegetated Ecosystems (ALIVE). The team, working remotely and led by Prof. Paul Stoy, PhD, is building a gradient-boosting regression model using geostationary satellites to estimate terrestrial carbon and water fluctuations in near real-time. The team trains its models using GOES-R and other public satellite and meteorological datasets.
In trying to process this data, they ran into the central problem when working with raster data for time series analysis – the data’s format, mainly NetCDF and GeoTIFF, is not conducive to time-series analysis. This experience inspired them to strive to create output datasets that are analysis-ready for various applications.
During AMS 2024, …
Read More
This post describes the fundamentals of Earth-Observation datacubes, outlines the basic Python building blocks for creating Zarr-backed datacubes, and presents a scalable serverless approach to building large-scale datacubes which is cost-effective, reliable, and performant.
This is a blog version of a webinar that took place on April 16, 2024. Here’s a video of that webinar:
Earth Observation satellites generate massive volumes of data about our planet, and these data are vital for confronting global challenges.
Satellite imagery is commonly distributed as individual “scenes” — a single file consisting of a single image of a tiny part of the Earth.
Popular public satellite programs such such as NASA / USGS Landsat and Copernicus Sentinel produce millions of such images a year, comprising petabytes of data.
Increasingly, we see organizations looking to aggregate raw satellite imagery into more analysis-ready datacubes.
In contrast to millions of individual images sampled unevenly in space and time, Earth-system datacubes contain multiple variables, align…
Read More
Note: This post was originally published on the Zarr developer blog.
We released Zarr-Python 2.18.0 this week. Although this release was quite light in terms of user-facing changes, it represents the beginning of a new phase for the project. In this post, we’ll walk through our plan for Zarr-Python 3.0 and what users of the library can expect in the coming months.
Zarr-Python 2.18
Before we get into the 3.0 release, we’ll first cover a few details about the 2.18 release series. The first thing to know is that we will continue to support 2.18 with bug fixes up until the release of 3.0. Additionally, we expect to use the 2.18 series to communicate changes in the Zarr-Python API, which will come in 3.0. For example, this week’s release included a number of new deprecation warnings for part…
Read More