Back

Minisymposium

MS2C - Innovations Unleashed: The Future of Scientific Research with Cloud Labs and Self-Driving Labs

Fully booked
Monday, June 3, 2024
14:30
-
16:30
CEST
HG E 1.1

Replay

Lorem ipsum dolor sit amet, consectetur adipiscing elit. Suspendisse varius enim in eros elementum tristique. Duis cursus, mi quis viverra ornare, eros dolor interdum nulla, ut commodo diam libero vitae erat. Aenean faucibus nibh et justo cursus id rutrum lorem imperdiet. Nunc ut sem vitae risus tristique posuere.

Session Chair

Description

The integration of artificial intelligence (AI), machine learning (ML), and automated instrumentation is transforming research through Cloud Labs and Self-Driving Labs (SDLs). Positioned strategically, these labs enhance accessibility and integrate computing and data analysis, enabling researchers to leverage cutting-edge technology globally. A national network of Cloud Labs and SDLs is envisioned to promote synergy in AI, ML, computing, and data analysis. The minisymposium prioritizes efficiency, reproducibility, inclusivity, collaboration, and cost-effectiveness, fostering a robust ecosystem. Addressing challenges, it targets reproducibility in Cloud Labs and SDLs, emphasizing parallels between computing and experimental workflows. Pursuing replicable results in virtual environments and automated SDL procedures builds trust in scientific validity. Standardized practices, rigorous documentation, and transparent methodologies, whether in digital Cloud Labs or physical SDLs, are crucial for advancing scientific inquiry and bolstering trust in research outcomes. The minisymposium aims to unite experts, researchers, and industry professionals to showcase innovations, discuss challenges, explore funding opportunities, and share success stories in integrating computing and data analysis within research paradigms.

Presentations

14:30
-
15:00
CEST
Accelerated Materials and Molecular Discovery with Self-Driving Fluidic Labs

Accelerating the discovery of new molecules and materials, as well as green and sustainable ways to synthesize and manufacture them, will profoundly impact the worldwide challenges in energy, sustainability, and healthcare. The current human-dependent paradigm of experimental research in chemical and materials sciences fails to identify technological solutions quickly. Recent advances in reaction miniaturization, automated experimentation, and data science provide an exciting opportunity to reshape the discovery and development of new molecules and materials related to energy transition and sustainability. In this talk, I will present a 'self-driving fluidic lab (SDFL)' for autonomous discovery and development of emerging advanced functional materials and molecules through integrating flow chemistry, online characterization, and machine learning (ML). I will discuss how modularization of different chemical synthesis and processing stages in tandem with constantly evolving ML modeling and decision-making under uncertainty can enable resource-efficient navigation through high dimensional experimental design spaces (>1020 possible experimental conditions). Example applications of the SDFL for the autonomous synthesis of quantum dots and specialty chemicals will be presented to illustrate the potential of autonomous robotic experimentation in reducing synthetic route discovery timeframe from >10 years to a few months.

Milad Abolhasani (North Carolina State University)
With Thorsten Kurth (NVIDIA Inc.)
15:00
-
15:30
CEST
The Interplay of Computational and Experimental Workflows

For the past two decades, computational workflows and workflow management systems have enabled the automation of complex computing applications. They have empowered scientists to describe their computational problems in terms of tasks that need to be executed, the data they need to process, and in which order the tasks need to be performed. Because of the workflow formalism and the automation provided by workflow management systems, scientists in a number of domains, including bioinformatics, astronomy, earthquake science, and gravitational-wave physics have been able to achieve breakthroughs otherwise not possible.

Recently, experimental workflows in biology, chemistry, and material science are being formalized and their execution is being enabled via automation provided by cloud and self-driving labs. However, there is currently no connection between computational and experimental workflows.

This talk explores the synergies between computational and experimental workflow systems and how they can potentially complement each other to achieve scientific objectives.

Ewa Deelman (University of Southern California)
With Thorsten Kurth (NVIDIA Inc.)
15:30
-
16:00
CEST
Optimizing Dataflow Pipelines from Self-Driving Labs to the Cloud

The rapid advancements in cloud computing and the integration of experimental facilities, including self-driving labs, have resulted in an era where scientists can generate unprecedented amounts of data and conduct more extensive analyses across various scientific domains, including chemistry, materials sciences, molecular biology, and drug design. This capability enables a broader exploration of natural phenomena but also introduces significant challenges in effectively composing and scaling dataflow pipelines. This talk addresses these challenges by presenting innovative solutions for optimizing dataflow pipelines across cloud resources, thereby enhancing the study and application of scientific dataflows.

This talk will cover three main research components of our work when optimizing dataflow pipelines from self-driving labs to the cloud. First, we establish a taxonomy of common dataflow motifs ranging from simple producer-consumer pairs to complex multi-scale pipelines, applying these motifs to real-world use cases. Second, we discuss methods to mitigate data loss and pipeline inefficiencies, especially those arising from disparities in moving pipelines traditionally executed on high performance computing systems to the cloud. Last, we highlight our efforts to train and build a community of experts, emphasizing the development of tailored data analytics material across scientific domains.

Michela Taufer (University of Tennessee)
With Thorsten Kurth (NVIDIA Inc.)
16:00
-
16:30
CEST
The Mixed Impacts of Clouds for Reproducible Research

Cloud environments have democratized access to large compute centers with easy to use interfaces and straightforward cost models. While this has lowered the barriers to many scientific disciplines to make reproducible computational experiments and data analysis, many of these enabling technologies can hurt reproducibility efforts inadvertently. This talk will explore some of the common interfaces, such as Jupyter notebooks and containers, to explore how these tools both support and hurt reproducibility efforts and offer ideas on some ways to mitigate these challenges.

Jay Lofstead (Sandia National Laboratories)
With Thorsten Kurth (NVIDIA Inc.)