Education, Science, Technology, Innovation and Life
Open Access
Sign In

Scheduling optimization strategy for data intensive workflows in cloud computing

Download as PDF

DOI: 10.23977/cpcs.2024.080112 | Downloads: 11 | Views: 723

Author(s)

Suxing Hua 1, Qiqi Gao 1, Zhenling Wang 1

Affiliation(s)

1 Wuxi Institute of Technology, Wuxi, Jiangsu, 214000, China

Corresponding Author

Qiqi Gao

ABSTRACT

The rapid growth of data-intensive applications has led to an increased demand for efficient scheduling strategies in cloud computing environments. This paper focuses on the optimization of scheduling for data-intensive workflows, addressing the challenges posed by resource heterogeneity, complex workflow dependencies, and the need for scalability and elasticity. We propose a comprehensive approach that encompasses advanced task scheduling algorithms, dynamic resource allocation techniques, and effective parallelization and pipelining methods to enhance the performance of these workflows. The paper begins by characterizing data-intensive workflows and the cloud computing environment, highlighting the performance metrics crucial for evaluating workflow execution. It then delves into the scheduling challenges, discussing the implications of resource heterogeneity, the complexity of workflow dependencies, and the scalability and elasticity requirements of cloud-based workflows. We present optimization strategies that leverage heuristic and metaheuristic algorithms to schedule tasks efficiently, considering both task characteristics and resource capabilities. The resource allocation techniques discussed aim to optimize the utilization of cloud resources, adapting to the dynamic nature of the environment and the varying demands of tasks.

KEYWORDS

Cloud Computing; Data-Intensive Workflows; Scheduling Optimization; Resource Allocation

CITE THIS PAPER

Suxing Hua, Qiqi Gao, Zhenling Wang, Scheduling optimization strategy for data intensive workflows in cloud computing. Computing, Performance and Communication Systems (2024) Vol. 8: 99-105. DOI: http://dx.doi.org/10.23977/cpcs.2024.080112.

REFERENCES

[1] Roberto C, Oliver K, Diego C, et al. Data journeys: Explaining AI workflows through abstraction [J]. Semantic Web, 2024, 15 (4): 1057-1083.
[2] Gadaleta D. Automated Workflows for Data Curation and Machine Learning to Develop Quantitative Structure-Activity Relationships. [J]. Methods in molecular biology (Clifton, N.J.), 2025, 2834 115-130.
[3] Gierend K, Krüger F, Genehr S, et al. Provenance Information for Biomedical Data and Workflows: Scoping Review. [J]. Journal of medical Internet research, 2024, 26 e51297.
[4] Stow M S, Gibbons C B, Iii R C L, et al. Exploring Ion Mobility Mass Spectrometry Data File Conversions to Leverage Existing Tools and Enable New Workflows. [J]. Journal of the American Society for Mass Spectrometry, 2024,
[5] Haghani V, Goyal A, Zhang A, et al. Improving rigor and reproducibility in chromatin immunoprecipitation assay data analysis workflows with Rocketchip. [J]. bioRxiv : the preprint server for biology, 2024,
[6] Ding Y, Huang Y, Gao P, et al. Brain image data processing using collaborative data workflows on Texera [J]. Frontiers in Neural Circuits, 2024, (18) 1398884.
[7] Shrivastava M. Optimal Data Placement for Scientific Workflows in Cloud [J]. The Journal of Computer Information Systems, 2024, 64 (4): 501-517.
[8] Tholen D. Gasanalyze R: advancing reproducible research using a new R package for photosynthesis data workflows. [J]. AoB PLANTS, 2024, 16 (4): plae035.
[9] Florian R, Benjamin S, Kai E. Implementing Data Workflows and Data Model Extensions with RDF-star [J]. The Electronic Library, 2024, 42 (3): 393-412.
[10] Simon A, Ned L. Getting Started with DuckDB: A practical guide for accelerating your data science, data analytics, and data engineering workflows [M]. Packt Publishing Limited: 2024-06-24. DOI:10.0000/9781803232539.
[11] Lorella V. Editorial: Data and Workflows for Multilingual Digital Humanities [J]. Journal of Open Humanities Data, 2024, (10) 37.

Downloads: 2987
Visits: 168771

Sponsors, Associates, and Links


All published work is licensed under a Creative Commons Attribution 4.0 International License.

Copyright © 2016 - 2031 Clausius Scientific Press Inc. All Rights Reserved.