Skip to content

Benchmarking Zarr and Parquet Data Retrieval using the National Water Model (NWM) in a Cloud-native environment

Posted on:September 22, 2022 at 11:00 PM

DALL-E 2 Imagines a Slice from a Large Dataset

Understanding efficiency gains in a scalable, cloud-native data-processing environment unlocks an array of new possibilities when dealing with large, complex datasets such as the National Water Model. In this article, we take a deep dive into Zarr and Parquet and determine which is more performant on various time scales.

This blog post recaps our research on which popular data format works better for National Water Model access over the cloud. Read the full article on the Azavea blog here.

Also read our follow up to this work that deals with optimizing access to the NWM’s Prediction Dataset.