Back

Minisymposium

MS1A - Foundation Models in Earth System Science

Fully booked
Monday, June 3, 2024
11:30
-
13:30
CEST
HG F 1

Replay

Lorem ipsum dolor sit amet, consectetur adipiscing elit. Suspendisse varius enim in eros elementum tristique. Duis cursus, mi quis viverra ornare, eros dolor interdum nulla, ut commodo diam libero vitae erat. Aenean faucibus nibh et justo cursus id rutrum lorem imperdiet. Nunc ut sem vitae risus tristique posuere.

Session Chair

Description

Foundation models represent a new class of artificial intelligence models designed to encapsulate information from large amounts of data. The most prominent example are large language models, which have achieved significant breakthroughs in various applications, including natural language processing (NLP), text generation, and machine translation. Some prominent examples of foundation models include GPT models, BERT, PalmX and their derivatives. This workshop will explore the untapped potential of foundation models in Earth system science, bringing together three different perspectives. First, we will introduce AtmoRep, a foundation model for the atmosphere released by a multidisciplinary collaboration between Magdeburg University, CERN and JSC. After that, we will move the discussion to the industry perspective, with Dr. Johannes Jakubik from IBM. In particular, Dr. Jakubik will present the Prithvi model for Earth Observations data, developed in collaboration with NASA. After that, Troy Arcomano from Argonne National Laboratory will present two different models: ClimaX, released in collaboration with Microsoft, and Stromer. The last part of the workshop will be an interactive discussion with the audience on the role of foundation models in fundamental science. It will be the occasion for the community to brainstorm around the future and the potential of these new technologies.

Presentations

11:30
-
12:00
CEST
AtmoRep: A Probabilistic Multi-Purpose Model for Atmospheric Dynamics

AtmoRep is a first example of probabilistic multi-purpose model that extends the concept of representation learning to Earth system science, and in particular to atmospheric dynamics. Starting from a single pre-trained architecture used as a backbone, AtmoRep is able to achieve skilful results in multiple zero-shot applications such as nowcasting, temporal interpolation and scenario generation, compared to the state-of-the-art approaches. Thanks to a novel definition of the loss, the model is also probabilistic by design, as it outputs a set of ensemble members for each task, with well calibrated distributions as proven for weather forecasting. The first part of the talk will focus on the innovative aspects of the model architecture, with particular attention to the local space-time approach and the flexible plug-and-play design. The second part of the talk will show the current results for the most relevant applications, while the last part of the talk will focus on the future foreseen developments.

Christian Lessig (ECMWF, Otto-von-Guericke-Universitat Magdeburg) and Ilaria Luise (CERN)
With Thorsten Kurth (NVIDIA Inc.)
12:00
-
12:30
CEST
Foundation Models in Earth Science and Remote Sensing

Significant progress in the development of highly adaptable and reusable Artificial Intelligence (AI) models is expected to have a significant impact on Earth science and remote sensing. Foundation Models (FMs), AI models designed to replace task-specific models, are increasingly being recognized for their versatility across numerous downstream applications. These models, trained using self-supervised techniques on any type of sequence data, circumvent the need for large annotated datasets, a major bottleneck in traditional AI model development. FMs can be applied to downstream tasks using few-shot learning and fine-tuning, significantly reducing the need for large labeled training datasets and computational resources. In contrast to task-specific models, large-scale FMs facilitate the processing of multi-modal data from different satellites and additional data modalities in order to obtain improved model skills in the Earth observation domain.

Johannes Jakubik (IBM Research)
With Thorsten Kurth (NVIDIA Inc.)
12:30
-
13:00
CEST
Evaluation of a Foundation Model Approach for Weather and Climate

Foundation models have demonstrated great success in the field of natural language processing (NLP) and for other vision-based tasks (e.g., DALL-E). With the rise of data-driven, global weather forecast models, researchers have begun to create foundation models for the Earth System. There several foundation models in development (e.g., ClimaX) to allow for rapid fine-tuning to specific tasks such as weather forecasting or climate. However, several open questions remain on how well this foundation model approach will work with such a complex and diverse set of tasks typically needed for weather and climate. Here, we evaluate the ability for ClimaX to perform downstream tasks not seen during the pre-training phase. Specifically, we look at two tasks using ClimaX fine-tuned on ERA5 to 1) perform data assimilation using real, in-situ observations of the atmosphere and 2) replace the output layer of the foundation model with one that is parameterized by a Gaussian to perform uncertainty quantification (UQ). We also use the lessons learned from these experiments to develop a state-of-the-art weather forecasting model called Stormer. Stormer is a simple transformer-based model that achieves state-of-the-art performance on weather forecasting with minimal changes to the standard transformer backbone.

Troy Arcomano (Argonne National Laboratory), Alex Wikner (University of Chicago), Tung Nguyen (UCLA), Romit Maulik (University of Pennsylvania), Sandeep Madireddy (Argonne National Laboratory), Aditya Grover (UCLA), and Rao Kotamarthi (Argonne National Laboratory)
With Thorsten Kurth (NVIDIA Inc.)
13:00
-
13:30
CEST
Discussion Panel: Future Perspectives

The last part of the workshop will be an interactive discussion on the following open questions: “Where foundation models could be useful in Earth system science?”, “What would be the advantages and limitations of foundation models in Earth system science?” and “Can we start building an agenda?”. Given the diverse audience of the event, we foreseen this part as a brainstorming occasion also across different scientific communities. For this reason, the discussion will focus on more general questions like “To which extent can we trust and utilize a foundation model for scientific predictions?” and “Will these models be able to replace the current numerical approaches?”. For this interactive Q&A part the speakers will be invited as panelists, together with experts like Karthik Kashinath from NVIDIA and Sebastian Schemm from ETH Zurich to ensure a broader overview of the ongoing efforts in the field.

Christian Lessig (ECMWF, Otto-von-Guericke-Universitat Magdeburg) and Ilaria Luise (CERN)
With Thorsten Kurth (NVIDIA Inc.)