Expert Selection System for MoE Large Language Models
Summary
USPTO published patent application US20260099697A1 for an expert selection system in Mixture of Experts (MoE) large language models. The system includes a global selector that manages selection between global and local modes, and a pre-fetcher that pre-loads selected global expert sets from main memory into faster cache memory. Inventors include Usman Sajid, Marie Mai Nguyen, Shuyi Pei, Younghoon Kim, and Rekha Pitchumani. The filing date was October 1, 2025, under application number 19347703.
What changed
USPTO published patent application US20260099697A1 disclosing a system and method for expert selection in Mixture of Experts (MoE) large language models. The invention comprises a global selector configured to operate in either global mode (selecting a global expert set for each layer prior to inference) or local mode, along with a pre-fetcher that moves selected global expert sets from main memory into faster cache memory. The system identifies and pre-loads "global hot experts" expected to be frequently accessed.
Affected parties include AI researchers, LLM developers, semiconductor manufacturers designing AI accelerators, and cloud computing providers deploying large language models. The patent addresses computational efficiency challenges in MoE architectures where multiple specialized expert networks must be selectively activated. Organizations developing or deploying MoE-based AI systems may wish to monitor this application for potential licensing considerations or design-around opportunities.
What to do next
- Monitor for patent grant status
Archived snapshot
Apr 12, 2026GovPing captured this document from the original source. If the source has since changed or been removed, this is the text as it existed at that time.
EXPERT SELECTION FROM MIXTURE OF EXPERTS IN LARGE LANGUAGE MODELS
Application US20260099697A1 Kind: A1 Apr 09, 2026
Inventors
Usman SAJID, Marie Mai NGUYEN, Shuyi PEI, Younghoon KIM, Rekha PITCHUMANI
Abstract
A system and a method for a machine learning (ML) model with expert selection are disclosed. The model includes a global selector and a pre-fetcher. The global selector is configured to manage a selection scheme having at least one of a global mode or a local mode. In the global mode, the global selector selects a global expert set from a mixture of experts (MoE) to generate a selected global expert set for each layer prior to an inference phase in the ML model. The pre-fetcher is configured to pre-fetch in the global mode the selected global expert set from a first memory into a second memory. The selected global expert set includes one or more global hot experts.
CPC Classifications
G06N 3/045 G06N 3/0499
Filing Date
2025-10-01
Application No.
19347703
Related changes
Get daily alerts for USPTO Patent Applications - AI & Computing (G06N)
Daily digest delivered to your inbox.
Free. Unsubscribe anytime.
Source
About this page
Every important government, regulator, and court update from around the world. One place. Real-time. Free. Our mission
Source document text, dates, docket IDs, and authority are extracted directly from USPTO.
The summary, classification, recommended actions, deadlines, and penalty information are AI-generated from the original text and may contain errors. Always verify against the source document.
Classification
Who this affects
Taxonomy
Browse Categories
Get alerts for this source
We'll email you when USPTO Patent Applications - AI & Computing (G06N) publishes new changes.
Subscribed!
Optional. Filters your digest to exactly the updates that matter to you.