Sparse activation-aware weight loading for ML inference
Summary
The USPTO published patent application US20260093965A1 by inventors Mahdi Heydari et al. describing techniques for sparse activation-aware weight loading and inference for machine learning models. The application covers methods for identifying non-zero values in activation tensors and loading only corresponding weight channels to optimize memory usage and computational efficiency during ML inference.
What changed
USPTO published patent application US20260093965A1 (Application No. 18901836, filed September 30, 2024) describing sparse activation-aware weight loading and inference techniques for machine learning models. The invention generates sparsity maps for activation tensors, identifies non-zero positions, and loads only corresponding weight channels into memory to optimize inference efficiency. CPC classification: G06N 3/0495. Inventors: Mahdi Heydari, Sankalp Dayal, Abhishek Sahadev Sutar, Deepak Shivarudrappa, Tariq Afzal, Rahul Bakshi.
This is a patent application publication providing public notice of the claimed invention. No compliance obligations or deadlines apply. Technology companies developing ML inference systems, semiconductor manufacturers, and AI chip designers may review the application for prior art purposes or to assess potential licensing considerations. Patent prosecution is handled through standard USPTO procedures.
Archived snapshot
Apr 2, 2026GovPing captured this document from the original source. If the source has since changed or been removed, this is the text as it existed at that time.
SPARSE ACTIVATION-AWARE WEIGHT LOADING AND INFERENCE FOR MACHINE LEARNING MODELS
Application US20260093965A1 Kind: A1 Apr 02, 2026
Inventors
Mahdi Heydari, Sankalp Dayal, Abhishek Sahadev Sutar, Deepak Shivarudrappa, Tariq Afzal, Rahul Bakshi
Abstract
Devices and techniques are generally described for sparse activation-aware weight loading and inference for machine learning models. In some examples, a first activation tensor may be generated for first input data. A first sparsity map may be generated for the first activation tensor. The first sparsity map may indicate respective positions of zero values and non-zero values in the first activation tensor. A first set of channels of a weight tensor that correspond to respective non-zero values from the first sparsity map may be identified. The first set of channels of the weight tensor may be loaded into memory. A machine learning model may generate output data based on the first set of channels and the first activation tensor.
CPC Classifications
G06N 3/0495
Filing Date
2024-09-30
Application No.
18901836
Related changes
Get daily alerts for USPTO Patent Applications - AI & Computing (G06N)
Daily digest delivered to your inbox.
Free. Unsubscribe anytime.
Source
About this page
Every important government, regulator, and court update from around the world. One place. Real-time. Free. Our mission
Source document text, dates, docket IDs, and authority are extracted directly from USPTO.
The summary, classification, recommended actions, deadlines, and penalty information are AI-generated from the original text and may contain errors. Always verify against the source document.
Classification
Who this affects
Taxonomy
Browse Categories
Get alerts for this source
We'll email you when USPTO Patent Applications - AI & Computing (G06N) publishes new changes.
Subscribed!
Optional. Filters your digest to exactly the updates that matter to you.