Adjusting precision and topology parameters for neural network training based on a performance metric
Assignee
Microsoft Technology Licensing, LLC
Inventors
Bita Darvish Rouhani, Eric S. Chung, Daniel Lo, Douglas C. Burger
Abstract
Apparatus and methods for training neural networks based on a performance metric, including adjusting numerical precision and topology as training progresses are disclosed. In some examples, block floating-point formats having relatively lower accuracy are used during early stages of training. Accuracy of the floating-point format can be increased as training progresses based on a determined performance metric. In some examples, values for the neural network are transformed to normal precision floating-point formats. The performance metric can be determined based on entropy of values for the neural network, accuracy of the neural network, or by other suitable techniques. Accelerator hardware can be used to implement certain implementations, including hardware having direct support for block floating-point formats.
CPC Classifications
Filing Date
2018-12-31
Application No.
16237308
Claims
20