LATENCY-AWARE-BASED SERVERLESS REQUEST SCHEDULING APPARATUS AND SYSTEM
Inventors
Gingfung Matthew YEUNG, Jianfeng WANG
Abstract
The apparatus includes: a first scheduling module that determines a determined user request at a queue head of a current request queue as a target user request, and sends a pod creation request to a second scheduling module when determining that a target pod that meets an execution condition and that is to execute the target user request is absent; and the second scheduling module that, when determining, based on the pod creation request, that a new pod meets a node creation condition, selects a target node from a plurality of nodes, creates the new pod in the target node, and sends information about the new pod to the first scheduling module. The first scheduling module manages the new pod based on the received information about the new pod, determines the new pod as a target pod, and sends the target user request to the target pod for execution.
CPC Classifications
Filing Date
2025-11-21
Application No.
19396668