TY - JOUR
T1 - A characterization of workflow management systems for extreme-scale applications
AU - Ferreira da Silva, Rafael
AU - Filgueira, Rosa
AU - Pietri, Ilia
AU - Jiang, Ming
AU - Sakellariou, Rizos
AU - Deelman, Ewa
N1 - Funding Information:
This work was performed under the auspices of the US Department of Energy (DOE) by Lawrence Livermore National Laboratory (LLNL) under Contract DE-AC52-07NA27344 (LLNL-JRNL-706700). This work was partially funded by the Laboratory Directed Research and Development Program at LLNL under project 16-ERD-036; by the Scottish Informatics and Computer Science Alliance (SICSA) with the Postdoctoral and Early Career Researcher Exchanges (PECE) fellowship; and by DOE under Contract #DESC0012636, ?Panorama?Predictive Modeling and Diagnostic Monitoring of Extreme Science Workflows?.
Publisher Copyright:
© 2017 Elsevier B.V.
PY - 2017/10
Y1 - 2017/10
N2 - Automation of the execution of computational tasks is at the heart of improving scientific productivity. Over the last years, scientific workflows have been established as an important abstraction that captures data processing and computation of large and complex scientific applications. By allowing scientists to model and express entire data processing steps and their dependencies, workflow management systems relieve scientists from the details of an application and manage its execution on a computational infrastructure. As the resource requirements of today's computational and data science applications that process vast amounts of data keep increasing, there is a compelling case for a new generation of advances in high-performance computing, commonly termed as extreme-scale computing, which will bring forth multiple challenges for the design of workflow applications and management systems. This paper presents a novel characterization of workflow management systems using features commonly associated with extreme-scale computing applications. We classify 15 popular workflow management systems in terms of workflow execution models, heterogeneous computing environments, and data access methods. The paper also surveys workflow applications and identifies gaps for future research on the road to extreme-scale workflows and management systems.
AB - Automation of the execution of computational tasks is at the heart of improving scientific productivity. Over the last years, scientific workflows have been established as an important abstraction that captures data processing and computation of large and complex scientific applications. By allowing scientists to model and express entire data processing steps and their dependencies, workflow management systems relieve scientists from the details of an application and manage its execution on a computational infrastructure. As the resource requirements of today's computational and data science applications that process vast amounts of data keep increasing, there is a compelling case for a new generation of advances in high-performance computing, commonly termed as extreme-scale computing, which will bring forth multiple challenges for the design of workflow applications and management systems. This paper presents a novel characterization of workflow management systems using features commonly associated with extreme-scale computing applications. We classify 15 popular workflow management systems in terms of workflow execution models, heterogeneous computing environments, and data access methods. The paper also surveys workflow applications and identifies gaps for future research on the road to extreme-scale workflows and management systems.
KW - Extreme-scale computing
KW - in situ processing
KW - Scientific workflows
KW - Workflow management systems
UR - http://www.scopus.com/inward/record.url?scp=85014024259&partnerID=8YFLogxK
U2 - 10.1016/j.future.2017.02.026
DO - 10.1016/j.future.2017.02.026
M3 - Article
AN - SCOPUS:85014024259
SN - 0167-739X
VL - 75
SP - 228
EP - 238
JO - Future Generation Computer Systems
JF - Future Generation Computer Systems
ER -