Schedule multi-instance microservices to minimize response time under budget constraint in cloud HPC systems
File version
Version of Record (VoR)
Author(s)
Shen, Hong
Tian, Hui
Yang, Yuanhao
Griffith University Author(s)
Primary Supervisor
Other Supervisors
Editor(s)
Date
Size
File type(s)
Location
Abstract
In the emerging microservice-based architecture of cloud HPC systems, a challenging problem of critical importance for system service capability is how we can schedule microservices to minimize the end-to-end response time for user requests while keeping cost within the specified budget. We address this problem for multi-instance microservices requested by a single application to which no existing result is known to our knowledge. We propose an effective two-stage solution of first allocating budget (resources) to microservices within the budget constraint and then deploying microservice instances on servers to minimize system operational overhead. For budget allocation, we formulate it as the Discrete Time Cost Tradeoff (DTCT) problem which is NP-hard, present a linear program (LP) based algorithm, and provide a rigorous proof of its worst-case performance guarantee of 4 from the optimal solution. For microservice deployment, we show that it is harder than the NP-hard problem of 1-D binpacking through establishing its mathematical model, and propose a heuristic algorithm of Least First Mapping that greedily places microservice instances on fewest possible servers to minimize system operation cost. The experiment results of extensive simulations on DAG-based applications of different sizes demonstrate the superior performance of our algorithm in comparison with the existing approaches.
Journal Title
Journal of Parallel and Distributed Computing
Conference Title
Book Title
Edition
Volume
202
Issue
Thesis Type
Degree Program
School
Publisher link
Patent number
Funder(s)
Grant identifier(s)
Rights Statement
Rights Statement
© 2025 The Authors. Published by Elsevier Inc. This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
Item Access Status
Note
Access the data
Related item(s)
Subject
Persistent link to this record
Citation
Wang, D; Shen, H; Tian, H; Yang, Y, Schedule multi-instance microservices to minimize response time under budget constraint in cloud HPC systems, Journal of Parallel and Distributed Computing, 2025, 202, pp. 105086