System and method for torque-based structured pruning for deep neural networks
Assignee
Samsung Electronics Co., Ltd.
Inventors
Tien C. Bau, Arshita Gupta, Hrishikesh Deepak Garud
Abstract
A method includes accessing a machine learning model, the machine learning model trained using a torque-based constraint. The method also includes receiving an input from an input source and providing the input to the machine learning model. The method also includes receiving an output from the machine learning model. The method also includes instructing at least one action based on the output from the machine learning model. Training the machine learning model includes applying a torque-based constraint on one or more filters of the machine learning model, adjusting, based on applying the torque-based constraint, a first set of one or more filters of the machine learning model to have a higher concentration of weights than a second set of one or more filters of the machine learning model, and pruning at least one channel of the machine learning model based on an average weight for the at least one channel.
CPC Classifications
Filing Date
2022-11-03
Application No.
18052297
Claims
26