Dynamic quantization for energy efficient deep learning
Grant
US12591771B2
Kind: B2
Mar 31, 2026
Assignee
QUALCOMM Incorporated
Inventors
Randy Ardywibowo, Venkata Ravi Kiran Dayana, Hau Hwang
Abstract
A method performed by a deep neural network (DNN) includes receiving, at a layer of the DNN during an inference stage, a layer input comprising content associated with a DNN input received at the DNN. The method also includes quantizing one or more parameters of a plurality of parameters associated with the layer based on the content of the layer input. The method further includes performing a task corresponding to the DNN input, the task performed with the one or more one quantized parameters.
CPC Classifications
G06N 3/0495
G06N 3/04
G06N 3/08
G06N 3/048
G06F 18/217
Filing Date
2021-09-28
Application No.
17488261
Claims
28