Inference service deployment method, device, and storage medium
Assignee
Beijing Baidu Netcom Science Technology Co., Ltd.
Inventors
Zhengxiong Yuan, Zhenfang Chu, Jinqi Li, Mingren Hu, Guobin Wang, Yang Luo, Yue Huang, Zhengyu Qian, En Shi
Abstract
Provided are an inference service deployment method, a device and a storage medium, relating to the field of artificial intelligence technology, and in particular to the field of machine learning and inference service technology. The inference service deployment method includes: obtaining performance information of a runtime environment of a deployment end; selecting a target version of an inference service from a plurality of candidate versions of the inference service of a model according to the performance information of the runtime environment of the deployment end; and deploying the target version of the inference service to the deployment end.
CPC Classifications
Filing Date
2022-11-03
Application No.
17980204
Claims
17