Paper
Libyt: A Tool for Parallel In Situ Analysis with yt, Python, and Jupyter
Presenter
Shin-Rong Tsai is a research scientist at the University of Illinois Urbana-Champaign School of Information Sciences. She has worked on developing astrophysics simulations, processing and visualizing data, and improving application performance when scaling to high-performance computing platforms. Her work now focuses on creating an in situ analysis tool that enables Python-based data analysis of running simulations. She also develops tools for analyzing and visualizing volumetric data.
Description
In the era of extreme-scale computing, large-scale data storage and analysis have become more critical and challenging. For post-processing, the simulation first needs to dump snapshots on a hard disk before processing any data. This becomes a bottleneck for high spatial and temporal resolution simulation. In situ analysis provides a viable solution for analyzing extreme scale simulations by processing data in memory, which skips the step of storing data on disk. We present libyt, an open-source C library that allows researchers to analyze and visualize data using yt or other Python packages in parallel computing during simulation runtime. We describe the code method for connecting simulation runtime data to Python, handling data transition and redistribution between Python and simulation processes with minimal memory overhead, and supporting interactive Python prompt and Jupyter Notebook for users to probe the ongoing simulation data at the current time step. We demonstrate how it solves the problem of visualizing large-scale astrophysical simulations, improving disk usage efficiency, and monitoring simulations closely. We conclude it with discussions and compare libyt to post-processing.