Method and apparatus for generating architecture specific convolution gradient kernels
Assignee
HUAWEI TECHNOLOGIES CO., LTD.
Inventors
Hoai Linh Tran, Zichun Ye, Giancarlo Colmenares, Kai-Ting Amy Wang, Xiaoteng Fu
Abstract
A method for accelerating a convolution operation includes receiving from an I/O interface, a first data set and a second data set. Transforming the first data set into a first converted data set, the first converted data set having the first format. Transforming the second data set into a second converted data set, the second converted data set having the second format. Loading into a convolution functional unit, the first converted data set and the second converted data set, where the convolution functional unit is configured to receive a first data in a first format, to receive a second data in a second format, and to output a third data in a third format. Receiving, by the task scheduler from the convolution functional unit, a result in the third format.
CPC Classifications
Filing Date
2022-03-08
Application No.
17689295
Claims
20