Changeflow GovPing Telecom & Technology Sparse activation-aware weight loading for ML i...
Routine Notice Added Final

Sparse activation-aware weight loading for ML inference

Favicon for changeflow.com USPTO Patent Applications - AI & Computing (G06N)
Published
Detected
Email

Summary

The USPTO published patent application US20260093965A1 by inventors Mahdi Heydari et al. describing techniques for sparse activation-aware weight loading and inference for machine learning models. The application covers methods for identifying non-zero values in activation tensors and loading only corresponding weight channels to optimize memory usage and computational efficiency during ML inference.

Published by USPTO on changeflow.com . Detected, standardized, and enriched by GovPing. Review our methodology and editorial standards .

What changed

USPTO published patent application US20260093965A1 (Application No. 18901836, filed September 30, 2024) describing sparse activation-aware weight loading and inference techniques for machine learning models. The invention generates sparsity maps for activation tensors, identifies non-zero positions, and loads only corresponding weight channels into memory to optimize inference efficiency. CPC classification: G06N 3/0495. Inventors: Mahdi Heydari, Sankalp Dayal, Abhishek Sahadev Sutar, Deepak Shivarudrappa, Tariq Afzal, Rahul Bakshi.

This is a patent application publication providing public notice of the claimed invention. No compliance obligations or deadlines apply. Technology companies developing ML inference systems, semiconductor manufacturers, and AI chip designers may review the application for prior art purposes or to assess potential licensing considerations. Patent prosecution is handled through standard USPTO procedures.

Archived snapshot

Apr 2, 2026

GovPing captured this document from the original source. If the source has since changed or been removed, this is the text as it existed at that time.

← USPTO Patent Applications

SPARSE ACTIVATION-AWARE WEIGHT LOADING AND INFERENCE FOR MACHINE LEARNING MODELS

Application US20260093965A1 Kind: A1 Apr 02, 2026

Inventors

Mahdi Heydari, Sankalp Dayal, Abhishek Sahadev Sutar, Deepak Shivarudrappa, Tariq Afzal, Rahul Bakshi

Abstract

Devices and techniques are generally described for sparse activation-aware weight loading and inference for machine learning models. In some examples, a first activation tensor may be generated for first input data. A first sparsity map may be generated for the first activation tensor. The first sparsity map may indicate respective positions of zero values and non-zero values in the first activation tensor. A first set of channels of a weight tensor that correspond to respective non-zero values from the first sparsity map may be identified. The first set of channels of the weight tensor may be loaded into memory. A machine learning model may generate output data based on the first set of channels and the first activation tensor.

CPC Classifications

G06N 3/0495

Filing Date

2024-09-30

Application No.

18901836

View original document →

Get daily alerts for USPTO Patent Applications - AI & Computing (G06N)

Daily digest delivered to your inbox.

Free. Unsubscribe anytime.

About this page

What is GovPing?

Every important government, regulator, and court update from around the world. One place. Real-time. Free. Our mission

What's from the agency?

Source document text, dates, docket IDs, and authority are extracted directly from USPTO.

What's AI-generated?

The summary, classification, recommended actions, deadlines, and penalty information are AI-generated from the original text and may contain errors. Always verify against the source document.

Last updated

Classification

Agency
USPTO
Published
April 2nd, 2026
Instrument
Notice
Legal weight
Non-binding
Stage
Final
Change scope
Minor
Document ID
US20260093965A1

Who this affects

Applies to
Technology companies Manufacturers
Industry sector
3341 Computer & Electronics Manufacturing 5112 Software & Technology
Geographic scope
United States US

Taxonomy

Primary area
Artificial Intelligence
Operational domain
Legal
Topics
Machine Learning Semiconductor Design Memory Optimization

Get alerts for this source

We'll email you when USPTO Patent Applications - AI & Computing (G06N) publishes new changes.

Free. Unsubscribe anytime.

You're subscribed!