← USPTO Patent Applications

ON-DEVICE NEURAL PROCESSING UNIT WITH HETEROGENEOUS CORES FOR SPECULATIVE DECODING

Application US20260080227A1 Kind: A1 Mar 19, 2026

Inventors

Lok Won KIM

Abstract

According to the present disclosure, a device is provided. The device includes a first memory of a first capacity configured to store a first generative neural network model comprising a first parameters, and a first neural processing unit configured to generate a response corresponding to an input query utilizing the first generative neural network model stored in the first memory, and wherein the first neural processing unit may be configured to store a first execution code of the first generative neural network model compiled to process speculative decoding.

CPC Classifications

G06N 3/0475

Filing Date

2025-11-18

Application No.

19392189