← USPTO Patent Grants

Inference service deployment method, device, and storage medium

Grant US12591462B2 Kind: B2 Mar 31, 2026

Assignee

Beijing Baidu Netcom Science Technology Co., Ltd.

Inventors

Zhengxiong Yuan, Zhenfang Chu, Jinqi Li, Mingren Hu, Guobin Wang, Yang Luo, Yue Huang, Zhengyu Qian, En Shi

Abstract

Provided are an inference service deployment method, a device and a storage medium, relating to the field of artificial intelligence technology, and in particular to the field of machine learning and inference service technology. The inference service deployment method includes: obtaining performance information of a runtime environment of a deployment end; selecting a target version of an inference service from a plurality of candidate versions of the inference service of a model according to the performance information of the runtime environment of the deployment end; and deploying the target version of the inference service to the deployment end.

CPC Classifications

G06N 3/04 G06N 20/00 G06F 9/505 G06F 11/3409

Filing Date

2022-11-03

Application No.

17980204

Claims

17