Integrating HPC, AI, and Workflows for Scientific Data Analysis (Dagstuhl Seminar 23352)

Authors Rosa M. Badia, Laure Berti-Equille, Rafael Ferreira da Silva, Ulf Leser and all authors of the abstracts in this report



PDF
Thumbnail PDF

File

DagRep.13.8.129.pdf
  • Filesize: 3.02 MB
  • 36 pages

Document Identifiers

Author Details

Rosa M. Badia
  • Barcelona Supercomputing Center, ES
Laure Berti-Equille
  • Research and Development Institute, IRD Montpellier, FR
Rafael Ferreira da Silva
  • Oak Ridge National Laboratory, US
Ulf Leser
  • Humboldt University of Berlin, DE
and all authors of the abstracts in this report

Cite AsGet BibTex

Rosa M. Badia, Laure Berti-Equille, Rafael Ferreira da Silva, and Ulf Leser. Integrating HPC, AI, and Workflows for Scientific Data Analysis (Dagstuhl Seminar 23352). In Dagstuhl Reports, Volume 13, Issue 8, pp. 129-164, Schloss Dagstuhl – Leibniz-Zentrum für Informatik (2024)
https://doi.org/10.4230/DagRep.13.8.129

Abstract

The Dagstuhl Seminar 23352, titled "Integrating HPC, AI, and Workflows for Scientific Data Analysis," held from August 27 to September 1, 2023, was a significant event focusing on the synergy between High-Performance Computing (HPC), Artificial Intelligence (AI), and scientific workflow technologies. The seminar recognized that modern Big Data analysis in science rests on three pillars: workflow technologies for reproducibility and steering, AI and Machine Learning (ML) for versatile analysis, and HPC for handling large data sets. These elements, while crucial, have traditionally been researched separately, leading to gaps in their integration. The seminar aimed to bridge these gaps, acknowledging the challenges and opportunities at the intersection of these technologies. The event highlighted the complex interplay between HPC, workflows, and ML, noting how ML has increasingly been integrated into scientific workflows, thereby enhancing resource demands and bringing new requirements to HPC architectures, like support for GPUs and iterative computations. The seminar also addressed the challenges in adapting HPC for large-scale ML tasks, including in areas like deep learning, and the need for workflow systems to evolve to leverage ML in data analysis fully. Moreover, the seminar explored how ML could optimize scientific workflow systems and HPC operations, such as through improved scheduling and fault tolerance. A key focus was on identifying prestigious use cases of ML in HPC and understanding their unique, unmet requirements. The stochastic nature of ML and its impact on the reproducibility of data analysis on HPC systems was also a topic of discussion.

Subject Classification

ACM Subject Classification
  • Computing methodologies → Distributed computing methodologies
  • Computing methodologies → Machine learning
  • Computing methodologies → Parallel computing methodologies
Keywords
  • Large scale data presentation and analysis
  • Exascale class machine optimization
  • Performance data analysis and root cause detection
  • High dimensional data representation

Metrics

  • Access Statistics
  • Total Accesses (updated on a weekly basis)
    0
    PDF Downloads
Questions / Remarks / Feedback
X

Feedback for Dagstuhl Publishing


Thanks for your feedback!

Feedback submitted

Could not send message

Please try again later or send an E-mail