Dynamic quantization for energy efficient deep learning

Grant US12591771B2 Kind: B2 Mar 31, 2026

Assignee

QUALCOMM Incorporated

Inventors

Randy Ardywibowo, Venkata Ravi Kiran Dayana, Hau Hwang

Abstract

A method performed by a deep neural network (DNN) includes receiving, at a layer of the DNN during an inference stage, a layer input comprising content associated with a DNN input received at the DNN. The method also includes quantizing one or more parameters of a plurality of parameters associated with the layer based on the content of the layer input. The method further includes performing a task corresponding to the DNN input, the task performed with the one or more one quantized parameters.

CPC Classifications

G06N 3/0495 G06N 3/04 G06N 3/08 G06N 3/048 G06F 18/217

Filing Date

2021-09-28

Application No.

17488261

Claims

View original document →