Resilient Optimizer States for Fully Sharded Data Parallel Distributed ML Training

USPTO

Changeflow GovPing Telecom & Technology Resilient Optimizer States for Fully Sharded Da...

Routine Notice Added Final

Resilient Optimizer States for Fully Sharded Data Parallel Distributed ML Training

USPTO Patent Applications - AI & Computing (G06N)

Published April 9th, 2026

Detected April 9th, 2026

Email

Summary

USPTO published patent application US20260099411A1 for systems and methods enabling failure resiliency in distributed machine learning model training. The invention allows compute nodes to store replicated optimizer shards and recover from node failures by reconstructing optimizer state from surviving replicas. The application names five inventors and claims priority to filing date December 11, 2025.

View original document View source feed page

What changed

USPTO published application US20260099411A1 disclosing systems for maintaining resilient optimizer states in fully sharded data parallel distributed ML training environments. The invention addresses failure recovery by storing replicated optimizer shard portions across multiple compute nodes, enabling any surviving node to reconstruct the optimizer state of a failed node. This allows ML model training to continue without full checkpoint restoration delays.

Technology companies developing distributed training infrastructure, AI research organizations, and cloud service providers offering ML compute resources should monitor this filing for competitive intelligence. The patent's claims covering shard replication and dynamic state reconstruction in distributed training could affect how companies design fault-tolerant ML training pipelines. If granted, the patent may influence approaches to optimizer state management in large-scale model training systems.

What to do next

Monitor for patent grant and claims examination outcomes
Review for freedom-to-operate implications if developing similar distributed ML training systems

Archived snapshot

Apr 9, 2026

GovPing captured this document from the original source. If the source has since changed or been removed, this is the text as it existed at that time.

← USPTO Patent Applications

RESILIENT OPTIMIZER STATES FOR FULLY SHARDED DATA PARALLEL

Application US20260099411A1 Kind: A1 Apr 09, 2026

Inventors

Lianjie Cao, Saeed Rashidi, Garrett Goon, Paolo Faraboschi, Puneet Sharma

Abstract

Systems and methods are provided for failure resiliency in distributed training of machine learning (ML) models. Examples include a plurality of compute nodes storing optimizer shards of a plurality of optimizer shards and a first compute node storing a first optimizer shard of optimizer states. The first compute node can store optimizer shard portions, each of which can be received from a respective compute node of the plurality of compute nodes and can be a replica of a portion of a respective optimizer shard of the plurality of optimizer shards, stored at the respective compute node. Responsive to a failure of a compute node of the plurality of compute nodes, the first compute node can update the first optimizer shard with an optimizer shard portion corresponding to the failed compute node and the ML model can be trained based on the updated first optimizer shard.

CPC Classifications

G06F 11/2028 G06N 3/098

Filing Date

2025-12-11

Application No.

19416964

View original document →

Named provisions

Abstract CPC Classifications Inventors Filing Date Application No.

Related changes

Secure Data Authorization Using Cryptographic Hash Tokens

Routine Apr 09, 2026 • USPTO Patent Applications - Networking (H04L) • Telecom & Technology

Machine Learning Data Messaging for Reservation Management

Routine Apr 09, 2026 • USPTO Patent Applications - Business Methods (G06Q) • Banking & Finance

Conversational AI System for Real-Time Cooking Guidance

Routine Apr 09, 2026 • USPTO Patent Applications - AI & Computing (G06N) • Telecom & Technology

Get daily alerts for USPTO Patent Applications - AI & Computing (G06N)

Daily digest delivered to your inbox.

Free. Unsubscribe anytime.

Source

USPTO Patent Applications - AI & Computing (G06N) changeflow.com/changebridge/uspto-patent-applications/G06N

Telecom & Technology

About this page

What is GovPing?

Every important government, regulator, and court update from around the world. One place. Real-time. Free. Our mission

What's from the agency?

Source document text, dates, docket IDs, and authority are extracted directly from USPTO.

What's AI-generated?

The plain-English summary, classification, and "what to do next" steps are AI-generated from the original text. Cite the source document, not the AI analysis.

Last updated

April 9, 2026

Press inquiries →

Classification

Agency

USPTO

Published

April 9th, 2026

Instrument

Notice

Legal weight

Binding

Stage

Final

Change scope

Minor

Document ID

US20260099411A1

Docket

19416964

Who this affects

Applies to

Technology companies Manufacturers Investors

Industry sector

5112 Software & Technology

Activity scope

Patent filing Distributed ML training Optimizer state management

Geographic scope

United States US

Taxonomy

Primary area

Intellectual Property

Operational domain

Legal

Topics

Artificial Intelligence Data Privacy Cybersecurity

Resilient Optimizer States for Fully Sharded Data Parallel Distributed ML Training

Summary

What changed

What to do next

Archived snapshot

RESILIENT OPTIMIZER STATES FOR FULLY SHARDED DATA PARALLEL

Inventors

Abstract

CPC Classifications

Filing Date

Application No.

Named provisions

Related changes

Source

About this page

Classification

Who this affects

Taxonomy

Browse Categories

Get alerts for this source