ON-DEVICE NEURAL PROCESSING UNIT WITH HETEROGENEOUS CORES FOR SPECULATIVE DECODING
Application
US20260080227A1
Kind: A1
Mar 19, 2026
Inventors
Lok Won KIM
Abstract
According to the present disclosure, a device is provided. The device includes a first memory of a first capacity configured to store a first generative neural network model comprising a first parameters, and a first neural processing unit configured to generate a response corresponding to an input query utilizing the first generative neural network model stored in the first memory, and wherein the first neural processing unit may be configured to store a first execution code of the first generative neural network model compiled to process speculative decoding.
CPC Classifications
G06N 3/0475
Filing Date
2025-11-18
Application No.
19392189