SAFETY ALIGNMENT FOR LANGUAGE MODELS USING MODEL-GENERATED SAFETY CATEGORIES

USPTO

Changeflow GovPing Telecom & Technology SAFETY ALIGNMENT FOR LANGUAGE MODELS USING MODE...

Routine Notice Added Final

SAFETY ALIGNMENT FOR LANGUAGE MODELS USING MODEL-GENERATED SAFETY CATEGORIES

USPTO Patent Applications - AI & Computing (G06N)

Detected April 15th, 2026

Email

Summary

USPTO published patent application US20260099707A1 for safety alignment techniques in language models. The application describes using an ensemble of generative AI models to generate machine-defined safety labels for interactions, applying majority voting with predefined safety labels to revise training data labels, and training language models to implement guardrails restricting unsafe content generation. The application covers ensemble-based safety labeling and alignment training methodologies for AI systems.

Published by USPTO on changeflow.com . Detected, standardized, and enriched by GovPing. Review our methodology and editorial standards .

View original document View source feed page

What changed

USPTO published a patent application covering techniques for aligning language models with safety guardrails. The disclosed methods involve using an ensemble of generative AI models to generate machine-defined safety labels for interactions, revising training data labels through majority voting between machine-defined and predefined safety labels, and training language models to restrict unsafe content outputs. The techniques address how AI systems can be trained to recognize and filter potentially unsafe responses.

Technology companies developing generative AI language models should monitor this application for potential implications on AI safety and alignment practices. While patent applications do not automatically create licensing obligations, if granted, the patent could affect how companies implement model safety training techniques.

What to do next

Monitor for patent grant and potential licensing implications
Review intellectual property strategy for AI safety techniques

Archived snapshot

Apr 15, 2026

GovPing captured this document from the original source. If the source has since changed or been removed, this is the text as it existed at that time.

← USPTO Patent Applications

SAFETY ALIGNMENT FOR LANGUAGE MODELS BASED ON LANGUAGE MODEL-GENERATED SAFETY CATEGORIES

Application US20260099707A1 Kind: A1 Apr 09, 2026

Inventors

Shaona GHOSH, Prasoon Varshney, Makesn Narsimhan Sreedhar, Aishwarya Padmakumar, Traian Eugen Rebedea, Christopher Marc Rarisien

Abstract

In various examples, techniques for training a language model to implement guardrails on generated outputs include receiving a data set including a plurality of interactions with a language model, each interaction of the plurality of interactions being associated with a predefined safety label; generating, using an ensemble of generative artificial intelligence models, one or more machine-defined safety labels for each interaction in the plurality of interactions; generating a training data set based on revising a label associated with each interaction of the plurality of interactions, the revising being based on a majority vote of the one or more machine-defined safety labels and the predefined safety label associated with each interaction of the plurality of interactions; and training the language model based on the training data set, wherein the training implements guardrails on an output of the language model such that the language model is restricted from generating responses including unsafe content.

CPC Classifications

G06N 3/08 G06N 20/20

Filing Date

2025-09-17

Application No.

19331849

View original document →

Named provisions

Abstract Inventors CPC Classifications Filing Date Application No.

Related changes

Modeling Agents Using Local and Global Models

Priority review Apr 17, 2026 • USPTO Patent Applications - AI & Computing (G06N) • Telecom & Technology

METHOD AND SYSTEM FOR DEPLOYMENT OF LARGE LANGUAGE MODELS (LLM) IN CLOUD INSTANCES

Routine Apr 15, 2026 • USPTO Patent Applications - AI & Computing (G06N) • Telecom & Technology

Lane Violation Detection Using Convolutional Neural Networks

Routine Apr 17, 2026 • USPTO Patent Applications - AI & Computing (G06N) • Telecom & Technology

Get daily alerts for USPTO Patent Applications - AI & Computing (G06N)

Daily digest delivered to your inbox.

Free. Unsubscribe anytime.

Source

USPTO Patent Applications - AI & Computing (G06N) changeflow.com/changebridge/uspto-patent-applications/G06N

Telecom & Technology

About this page

What is GovPing?

Every important government, regulator, and court update from around the world. One place. Real-time. Free. Our mission

What's from the agency?

Source document text, dates, docket IDs, and authority are extracted directly from USPTO.

What's AI-generated?

The summary, classification, recommended actions, deadlines, and penalty information are AI-generated from the original text and may contain errors. Always verify against the source document.

Last updated

April 15, 2026

Press inquiries →

Classification

Agency

USPTO

Instrument

Notice

Legal weight

Non-binding

Stage

Final

Change scope

Minor

Document ID

US20260099707A1

Who this affects

Applies to

Technology companies

Industry sector

5112 Software & Technology

Activity scope

Patent application filing AI model training Safety alignment techniques

Geographic scope

United States US

Taxonomy

Primary area

Intellectual Property

Operational domain

Legal

Topics

Artificial Intelligence

Browse Categories

Agriculture & Food Safety 65 AI Regulation 3 Banking & Finance 330 Consumer Protection 61 Courts & Legal 361 Data Privacy & Cybersecurity 77 Defense & National Security 51 Education 43 Energy 100 Environment 86 Environmental & Energy 34 Environmental Regulation 7 Financial Regulation 2 Government & Legislation 278 Government Operations 107 Healthcare 142 Healthcare & Life Sciences 63 Housing 16 Immigration 8 Immigration & Border Control 2 Insurance 66 Labor & Employment 124 Legal & Judicial 27 Pharma & Drug Safety 101 Pharma & Healthcare 1 Public Health 2 Real Estate & Housing 57 Sanctions & Export Controls 1 Securities & Investments 27 Securities & Markets 103 Securities Regulation 6 Tax 65 Tax & Revenue 9 Telecom & Technology 47 Trade & Commerce 3 Trade & Sanctions 135 Transportation 85

SAFETY ALIGNMENT FOR LANGUAGE MODELS USING MODEL-GENERATED SAFETY CATEGORIES

Summary

What changed

What to do next

Archived snapshot

SAFETY ALIGNMENT FOR LANGUAGE MODELS BASED ON LANGUAGE MODEL-GENERATED SAFETY CATEGORIES

Inventors

Abstract

CPC Classifications

Filing Date

Application No.

Named provisions

Related changes

Source

About this page

Classification

Who this affects

Taxonomy

Browse Categories

Get alerts for this source

Subscribed!