Methods and Apparatus to Process Training Data for an AI-Based Model
Summary
The USPTO published patent application US20260099759A1 by Niall Fitzgerald, covering methods and apparatus for processing AI training data using feature transformation, hash signature generation, and clustering techniques. The application relates to apparatus comprising interface circuitry and programmable circuits to filter training data clusters and train AI-based models. The application was filed on October 4, 2024, and published on April 9, 2026.
What changed
The USPTO published patent application US20260099759A1 on April 9, 2026, covering methods and apparatus for processing AI training data. The invention involves transforming data samples into features, generating hash signatures, grouping features into clusters, and filtering clusters exceeding a threshold count to generate a filtered dataset for training AI models. Patent applications do not impose compliance obligations on third parties but establish intellectual property rights for the applicant. Affected parties include technology companies and AI developers who may need to consider this intellectual property when developing similar AI training data processing systems.
What to do next
- Monitor for updates
Archived snapshot
Apr 14, 2026GovPing captured this document from the original source. If the source has since changed or been removed, this is the text as it existed at that time.
METHODS AND APPARATUS TO PROCESS TRAINING DATA FOR AN AI-BASED MODEL
Application US20260099759A1 Kind: A1 Apr 09, 2026
Inventors
Niall Fitzgerald
Abstract
An example apparatus includes interface circuitry to obtain data; samples to train an AI-based model; machine readable instructions; and at least one programmable circuit to at least one of instantiate or execute the machine readable instructions to: transform the data samples into features; generate hash signatures for corresponding ones of the features; group the features into clusters based on the hash signatures; generate a filtered data set by filtering out features within a cluster of features having more than a threshold number of features; and train the AI-based model based on the filtered data set.
CPC Classifications
G06N 20/00
Filing Date
2024-10-04
Application No.
18907233
Related changes
Get daily alerts for USPTO Patent Applications - AI & Computing (G06N)
Daily digest delivered to your inbox.
Free. Unsubscribe anytime.
Source
About this page
Every important government, regulator, and court update from around the world. One place. Real-time. Free. Our mission
Source document text, dates, docket IDs, and authority are extracted directly from USPTO.
The summary, classification, recommended actions, deadlines, and penalty information are AI-generated from the original text and may contain errors. Always verify against the source document.
Classification
Who this affects
Taxonomy
Browse Categories
Get alerts for this source
We'll email you when USPTO Patent Applications - AI & Computing (G06N) publishes new changes.
Subscribed!
Optional. Filters your digest to exactly the updates that matter to you.