<div dir="ltr"><div class="gmail_default" style="font-size:small">Please excuse a typo in the last email, which should say <b>Joe Hamman</b> not Joe Hammand.<br></div></div><br><div class="gmail_quote"><div dir="ltr" class="gmail_attr">On Mon, Oct 16, 2023 at 2:17 PM Aimee Barciauskas <<a href="mailto:aimee@developmentseed.org">aimee@developmentseed.org</a>> wrote:<br></div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><div dir="ltr"><div class="gmail_default" style="font-size:small"><div>Please join us Monday Oct 30 in welcoming Joe Hammand and Ryan Abernathey to present Arraylake.<br></div><div>
<p><strong>Topic:</strong> Arraylake: A Cloud-Native Data Lake Platform for Earth System Science</p>
<p><strong>When:</strong> Monday October 30th, 10:30-11:30 am PT / 1:30-2:30 pm ET / 7:30-8:30pm CEST</p>
<p><strong>Who:</strong> Ryan Abernathey and Joe Hamman, Founders of <a href="http://Earthmover.io" rel="noopener nofollow ugc" target="_blank">Earthmover.io</a></p>
<p><strong>Where:</strong> Find joining information on <a href="https://www.esipfed.org/get-involved/community-calendar" rel="noopener nofollow ugc" target="_blank">ESIP Community Calendar</a></p>
<p>NOTE: THIS DAY AND TIME IS DIFFERENT FROM THE ORIGINAL ESIP COMMUNITY
CALENDAR. The ESIP community calendar should be updated with this new
day and time soon.</p>
<p><strong>Abstract:</strong><br>
The vast amount of earth system data available today is an incredible
resource for understanding our planet and confronting the challenge of
climate change. Traditionally, a few large organizations have provided
most of the data, and users have downloaded data to local computers.
This way of working is becoming increasingly infeasible as data volumes
grow and as AI-based methods demand direct access to full-scale data
archives. With essentially infinite compute and storage capacity, cloud
computing has the potential to revolutionize our interaction with
weather and climate data, allowing everyone to bring their own compute
workloads to bear against a single shared copy of the data. Over the
past years, via our work in the Pangeo project, we have prototyped a
cloud-native approach to weather and climate data in the cloud,
combining scalable computing technologies such as Xarray and Dask with
analysis-ready, cloud-optimized data in formats like Zarr. While these
tools show great potential, they remain difficult to deploy and use in
an operational context for many scientists and institutions.</p>
<p>Motivated by this challenge, we founded Earthmover, a company aimed
at democratizing access to state-of-the-art cloud-native data analytics,
and built Arraylake, a data platform which enables teams of any size to
manage and analyze weather and climate data in the cloud. Arraylake
users can access high-quality public datasets alongside their own
private data, all via the high-performance Zarr data standard. This talk
describes Arraylake’s architecture, novel version control system for
data, and approach to supporting all common climate data formats
(NetCDF, HDF5, Grib, Tiff, Zarr) via a single, user-friendly interface.
Via a short demo, we illustrate how Arraylake helps overcome common data
management challenges that have henceforth limited widespread adoption
of cloud computing in earth system science.</p>
<ul><li>5 minutes - Welcome and Announcements</li><li>30 minutes - Presentation</li><li>25 minutes - Q + A</li></ul><div>Thanks,</div><div>Aimee on behalf of the Cloud Computing Cluster organizing team<br></div></div></div></div>
</blockquote></div>