Back to Search Start Over

Massive Scale Data Analytics at LCLS-II

Authors :
Thayer Jana
Chen Zhantao
Claus Richard
Damiani Daniel
Ford Christopher
Dubrovin Mikhail
Elmir Victor
Kroeger Wilko
Li Xiang
Marchesini Stefano
Mariani Valerio
Melcchiori Riccardo
Nelson Silke
Peck Ariana
Perazzo Amedeo
Poitevin Frederic
O’Grady Christopher Paul
Otero Julieth
Quijano Omar
Shankar Murali
Uervirojnangkoorn Monarin
Veraldi Riccardo
Weaver Matthew
Weninger Clemens
Yamajala Seshu
Wang Cong
Yoon Chun Hong
Source :
EPJ Web of Conferences, Vol 295, p 13002 (2024)
Publication Year :
2024
Publisher :
EDP Sciences, 2024.

Abstract

The increasing volumes of data produced at light sources such as the Linac Coherent Light Source (LCLS) enable the direct observation of materials and molecular assemblies at the length and timescales of molecular and atomic motion. This exponential increase in the scale and speed of data production is prohibitive to traditional analysis workflows that rely on scientists tuning parameters during live experiments to adapt data collection and analysis. User facilities will increasingly rely on the automated delivery of actionable information in real time for rapid experiment adaptation which presents a considerable challenge for data acquisition, data processing, data management, and workflow orchestration. In addition, the desire from researchers to accelerate science requires rapid analysis, dynamic integration of experiment and theory, the ability to visualize results in near real-time, and the introduction of ML and AI techniques. We present the LCLS-II Data System architecture which is designed to address these challenges via an adaptable data reduction pipeline (DRP) to reduce data volume on-thefly, online monitoring analysis software for real-time data visualization and experiment feedback, and the ability to scale to computing needs by utilizing local and remote compute resources, such as the ASCR Leadership Class Facilities, to enable quasi-real-time data analysis in minutes. We discuss the overall challenges facing LCLS, our ongoing work to develop a system responsive to these challenges, and our vision for future developments.

Subjects

Subjects :
Physics
QC1-999

Details

Language :
English
ISSN :
2100014X
Volume :
295
Database :
Directory of Open Access Journals
Journal :
EPJ Web of Conferences
Publication Type :
Academic Journal
Accession number :
edsdoj.8c7579faa446e98839e02c9c6cf13c
Document Type :
article
Full Text :
https://doi.org/10.1051/epjconf/202429513002