Fine-Tuning a Target Generative Neural Network Using an Improvement Generative Neural Network
Summary
Six inventors (Wang, Fan, Zhao, Ramachandran, Yang, Jain) filed USPTO patent application US20260099906A1 for methods to train generative neural networks. The technique uses an improvement neural network to process outputs from a target network and generate preferred training examples iteratively, reducing reliance on static datasets and external human annotation. The application was published April 9, 2026, following an October 3, 2025 filing date.
What changed
USPTO published patent application US20260099906A1 disclosing methods for training generative neural networks through iterative preference learning. The target network generates a data item, the improvement network processes it to create a preferred version, and a training example pairing the two is used to update the target network. This eliminates reliance on static offline datasets and computationally expensive reward models.
For technology companies developing AI/ML systems, this patent describes techniques that could affect approaches to training generative models. Patent applications represent an early stage of IP protection with no immediate compliance obligations. However, companies in AI development should monitor published applications in this space for freedom-to-operate considerations. The filing has no stated deadlines, fees, or regulatory obligations at this stage.
Archived snapshot
Apr 18, 2026GovPing captured this document from the original source. If the source has since changed or been removed, this is the text as it existed at that time.
FINE-TUNING A TARGET GENERATIVE NEURAL NETWORK USING AN IMPROVEMENT GENERATIVE NEURAL NETWORK
Application US20260099906A1 Kind: A1 Apr 09, 2026
Inventors
Qifei Wang, Ying Fan, Yang Zhao, Deepak Ramachandran, Feng Yang, Rahul Anant Jain
Abstract
Methods, systems, and apparatuses, including computer programs encoded on computer storage media, for training a target generative neural network over a plurality of training iterations. At each iteration, a first data item is generated by processing a conditioning input using the target generative neural network. An improvement generative neural network then processes the first data item and the conditioning input to generate a second, preferred data item. A training example is generated that includes the first and second data items and indicates that the second data item is preferred over the first. The target generative neural network is then trained on this training example. By using this iterative process to dynamically generate preference data, the described techniques improve the performance of the generative neural network beyond the limitations of static, offline datasets without requiring computationally expensive reward models or external human annotation.
CPC Classifications
G06T 5/70 G06N 3/0475 G06T 5/60 G06T 2207/20081
Filing Date
2025-10-03
Application No.
19349179
Related changes
Get daily alerts for USPTO Patent Applications - AI & Computing (G06N)
Daily digest delivered to your inbox.
Free. Unsubscribe anytime.
Source
About this page
Every important government, regulator, and court update from around the world. One place. Real-time. Free. Our mission
Source document text, dates, docket IDs, and authority are extracted directly from USPTO.
The summary, classification, recommended actions, deadlines, and penalty information are AI-generated from the original text and may contain errors. Always verify against the source document.
Classification
Who this affects
Taxonomy
Browse Categories
Get alerts for this source
We'll email you when USPTO Patent Applications - AI & Computing (G06N) publishes new changes.
Subscribed!
Optional. Filters your digest to exactly the updates that matter to you.