Abstract
Scientific workflows bridge scientific challenges with computational resources. While dispel4py, a stream-based workflow system, offers mappings to parallel enactment engines like MPI or Multiprocessing, its optimization primarily focuses on dynamic process-to-task allocation for improved performance. An efficiency gap persists, particularly with the growing emphasis on conserving computing resources. Moreover, the existing dynamic optimization lacks support for stateful applications and grouping operations.
To address these issues, our work introduces a novel hybrid approach for handling stateful operations and groupings within workflows, leveraging a new Redis mapping. We also propose an auto-scaling mechanism integrated into dispel4py’s dynamic optimization. Our experiments showcase the effectiveness of auto-scaling optimization, achieving efficiency while upholding performance. In the best case, auto-scaling reduces dispel4py’s runtime to 87% compared to the baseline, using only 76% of process resources. Importantly, our optimized stateful dispel4py demonstrates a remarkable speedup, utilizing just 32% of the runtime compared to the contender.
To address these issues, our work introduces a novel hybrid approach for handling stateful operations and groupings within workflows, leveraging a new Redis mapping. We also propose an auto-scaling mechanism integrated into dispel4py’s dynamic optimization. Our experiments showcase the effectiveness of autoscaling optimization, achieving efficiency while upholding performance. In the best case, auto-scaling reduces dispel4py’s runtime to 87% compared to the baseline, using only 76% of process resources. Importantly, our optimized stateful dispel4py demonstrates a remarkable speedup, utilizing just 32% of the runtime compared to the contender.
To address these issues, our work introduces a novel hybrid approach for handling stateful operations and groupings within workflows, leveraging a new Redis mapping. We also propose an auto-scaling mechanism integrated into dispel4py’s dynamic optimization. Our experiments showcase the effectiveness of auto-scaling optimization, achieving efficiency while upholding performance. In the best case, auto-scaling reduces dispel4py’s runtime to 87% compared to the baseline, using only 76% of process resources. Importantly, our optimized stateful dispel4py demonstrates a remarkable speedup, utilizing just 32% of the runtime compared to the contender.
To address these issues, our work introduces a novel hybrid approach for handling stateful operations and groupings within workflows, leveraging a new Redis mapping. We also propose an auto-scaling mechanism integrated into dispel4py’s dynamic optimization. Our experiments showcase the effectiveness of autoscaling optimization, achieving efficiency while upholding performance. In the best case, auto-scaling reduces dispel4py’s runtime to 87% compared to the baseline, using only 76% of process resources. Importantly, our optimized stateful dispel4py demonstrates a remarkable speedup, utilizing just 32% of the runtime compared to the contender.
Original language | English |
---|---|
Title of host publication | Proceedings of the SC '23 workshops of the international conference on high performance computing, network, storage, and analysis (SC-W '23) |
Subtitle of host publication | Nov 12-17, 2023 | Denver, CO |
Publisher | ACM |
Pages | 2021–2032 |
ISBN (Print) | 9798400707858 |
DOIs | |
Publication status | Published - 1 Nov 2023 |
Event | 18th Workshop on Workflows in Support of Large-Scale Science (WORKS 2023) - Denver, United States Duration: 12 Nov 2023 → 12 Nov 2023 Conference number: 18 https://works-workshop.org/ |
Conference
Conference | 18th Workshop on Workflows in Support of Large-Scale Science (WORKS 2023) |
---|---|
Abbreviated title | WORKS 2023 |
Country/Territory | United States |
City | Denver |
Period | 12/11/23 → 12/11/23 |
Internet address |
Keywords
- Scientific workflows
- Stream-based workflow
- Workflow optimization
- Auto-scaling
- Stateful application
- dispel4py