Self-Learning RL Agent for Supply Chain Management by Kinaxis
Summary
The USPTO published Kinaxis Inc.'s patent application (US20260099785A1) for a self-learning reinforcement learning agent designed for enterprise production scheduling. The system uses dynamic graph modeling and iterative training to generate optimal production schedules autonomously. Competitors developing similar AI-driven supply chain solutions should monitor this publication for potential licensing implications.
What changed
The USPTO published Kinaxis Inc.'s patent application US20260099785A1 for a self-learning reinforcement learning (RL) agent used in enterprise production scheduling. The invention enables the RL agent to interact with a simulated production environment modeled as a dynamic graph, allowing efficient handling of multi-stage scheduling dependencies. The system autonomously learns optimal scheduling policies through iterative training, inference, and continuous learning modes while incorporating user preferences. Key components include a data profiler for historical analysis, a synthesizer for training data generation, and an initializer for environment setup.
Manufacturers and supply chain software developers should review the patent claims to assess potential licensing needs or design-around opportunities. The disclosed methodology using dynamic graph modeling and RL-based policy learning for production scheduling represents a potentially significant advancement over traditional heuristics and genetic algorithms. Companies developing competing AI-driven supply chain optimization solutions should evaluate whether their systems fall within the scope of the disclosed claims.
What to do next
- Review patent claims for potential licensing implications
- Assess whether your AI-driven scheduling systems may infringe disclosed claims
- Monitor related patent publications from Kinaxis for continuation applications
Archived snapshot
Apr 9, 2026GovPing captured this document from the original source. If the source has since changed or been removed, this is the text as it existed at that time.
SYSTEMS AND METHODS FOR A SELF-LEARNING, RESILIENT REINFORCEMENT-LEARNING AGENT
Application US20260099785A1 Kind: A1 Apr 09, 2026
Assignee
Kinaxis Inc
Inventors
Saju Peter, Loganathan Balasubramani, Sudhan MANI
Abstract
Systems and methods for enterprise production scheduling using a self-learning, resilient Reinforcement Learning (RL) agent. The RL agent interacts with a simulated production environment modeled as a dynamic graph, enabling efficient handling of complex multi-stage scheduling dependencies. Through iterative training, inference, and continuous learning modes, the agent autonomously learns optimal scheduling policies, adapts to evolving production conditions, and incorporates user preferences. The system includes components such as a data profiler for historical analysis, a synthesizer for training data generation, and an initializer for environment setup. The RL agent generates multiple feasible schedules, refines its policy based on feedback, and significantly reduces computational overhead compared to traditional heuristics and genetic algorithms.
CPC Classifications
G06Q 10/06314 G06N 20/00
Filing Date
2025-10-06
Application No.
19350105
Related changes
Get daily alerts for USPTO Patent Applications - Business Methods (G06Q)
Daily digest delivered to your inbox.
Free. Unsubscribe anytime.
Source
About this page
Every important government, regulator, and court update from around the world. One place. Real-time. Free. Our mission
Source document text, dates, docket IDs, and authority are extracted directly from USPTO.
The plain-English summary, classification, and "what to do next" steps are AI-generated from the original text. Cite the source document, not the AI analysis.
Classification
Who this affects
Taxonomy
Browse Categories
Get alerts for this source
We'll email you when USPTO Patent Applications - Business Methods (G06Q) publishes new changes.