Automated Pipeline for Training Language Models

USPTO

Changeflow GovPing Telecom & Technology Automated Pipeline for Training Language Models

Routine Notice Added Final

Automated Pipeline for Training Language Models

USPTO Patent Applications - AI & Computing (G06N)

Published April 9th, 2026

Detected April 18th, 2026

Email

Summary

USPTO has published patent application US20260099674A1 assigned to U.S. Bank National Association, describing a method and system for generating training datasets for language models using an automated pipeline. The system receives input samples, performs rephrasing operations via a generative LLM to produce semantically equivalent versions with different phrasing, labels entity references, and aggregates them into an expanded labeled dataset for natural language processing model training.

Published by USPTO on changeflow.com . Detected, standardized, and enriched by GovPing. Review our methodology and editorial standards .

View original document View source feed page

What changed

USPTO has published patent application US20260099674A1 assigned to U.S. Bank National Association, describing an automated pipeline for generating training datasets for language models. The system receives input samples, performs rephrasing operations using a generative LLM to produce semantically equivalent versions with different phrasing, labels entity references in the generated versions, and aggregates them into an expanded labeled dataset for NLP model training.

This document is informational in nature and does not create compliance obligations for other entities. It represents a patent filing by a commercial bank related to AI/LLM training data generation technology.

Archived snapshot

Apr 18, 2026

GovPing captured this document from the original source. If the source has since changed or been removed, this is the text as it existed at that time.

← USPTO Patent Applications

AUTOMATED PIPELINE FOR TRAINING LANGUAGE MODELS

Application US20260099674A1 Kind: A1 Apr 09, 2026

Assignee

U.S. Bank National Association

Inventors

Giacomo Domeniconi, Ali Fathi, Samuel A. Assefa, Kausik Gangopadhyay, Christopher Taggert, Samuel Atkins

Abstract

The disclosed embodiments describe a method, system, and computer-readable medium for generating a training dataset for training a model in the field of natural language processing involving receiving a set of input samples and performing a rephrasing operation to produce new versions of the set of input samples, where the new versions preserve semantic equivalence as the set of input samples but have different phrasing. A dataset of generated versions of the input samples is generated using a generative Language Learning Model (LLM), all entity references present in the generated versions of the input samples are labeled, and the generated versions of the input samples and their corresponding labeled versions to form an expanded labeled dataset are aggregated.

CPC Classifications

G06F 40/295 G06F 40/284 G06F 40/40 G06N 3/0475 G06N 3/094

Filing Date

2024-10-08

Application No.

18909206

View original document →

Related changes

Topological Sparse Training Process for Machine Learning Models

Routine Apr 17, 2026 • USPTO Patent Applications - AI & Computing (G06N) • Telecom & Technology

4D Generative Models From in the Wild Videos

Priority review Apr 18, 2026 • USPTO Patent Applications - AI & Computing (G06N) • Telecom & Technology

Pyramid Key-Value Cache Compression for Transformer Models

Routine Apr 17, 2026 • USPTO Patent Applications - AI & Computing (G06N) • Telecom & Technology

Get daily alerts for USPTO Patent Applications - AI & Computing (G06N)

Daily digest delivered to your inbox.

Free. Unsubscribe anytime.

Source

USPTO Patent Applications - AI & Computing (G06N) changeflow.com/changebridge/uspto-patent-applications/G06N

Telecom & Technology

About this page

What is GovPing?

Every important government, regulator, and court update from around the world. One place. Real-time. Free. Our mission

What's from the agency?

Source document text, dates, docket IDs, and authority are extracted directly from USPTO.

What's AI-generated?

The summary, classification, recommended actions, deadlines, and penalty information are AI-generated from the original text and may contain errors. Always verify against the source document.

Last updated

April 18, 2026

Press inquiries →

Classification

Agency

USPTO

Published

April 9th, 2026

Instrument

Notice

Legal weight

Non-binding

Stage

Final

Change scope

Minor

Document ID

US20260099674A1

Who this affects

Applies to

Banks

Industry sector

5221 Commercial Banking

Activity scope

Patent application filing AI/ML model training Data pipeline development

Geographic scope

United States US

Taxonomy

Primary area

Intellectual Property

Operational domain

Legal

Topics

Artificial Intelligence Software & Technology

Browse Categories

Agriculture & Food Safety 70 AI Regulation 3 Banking & Finance 349 Consumer Protection 75 Courts & Legal 361 Data Privacy & Cybersecurity 82 Defense & National Security 51 Education 48 Energy 100 Environment 86 Environmental & Energy 38 Environmental Regulation 7 Financial Regulation 2 Financial Services 31 Government & Legislation 278 Government Operations 119 Healthcare 136 Healthcare Compliance 6 Healthcare & Life Sciences 120 Housing 16 Immigration 8 Immigration & Border Control 2 Insurance 68 Labor & Employment 132 Legal & Judicial 38 Occupational Safety 2 Pharma & Drug Safety 101 Pharma & Healthcare 1 Privacy 1 Public Health 2 Real Estate & Housing 61 Sanctions & Export Controls 2 Securities & Investments 34 Securities & Markets 103 Securities Regulation 6 Tax 64 Tax & Revenue 9 Telecom & Technology 47 Trade & Commerce 3 Trade & Sanctions 135 Transportation 91

Automated Pipeline for Training Language Models

Summary

What changed

Archived snapshot

AUTOMATED PIPELINE FOR TRAINING LANGUAGE MODELS

Assignee

Inventors

Abstract

CPC Classifications

Filing Date

Application No.

Related changes

Source

About this page

Classification

Who this affects

Taxonomy

Browse Categories

Get alerts for this source

Subscribed!