Natural language processing with contextual data associated with content displayed by a computing device
Grant
US12579973B2
Kind: B2
Mar 17, 2026
Assignee
Amazon Technologies, Inc.
Inventors
Angeliki Metallinou, Rahul Goel, Vishal Ishwar
Abstract
Multi-modal natural language processing systems are provided. Some systems are context-aware systems that use multi-modal data to improve the accuracy of natural language understanding as it is applied to spoken language input. Machine learning architectures are provided that jointly model spoken language input (“utterances”) and information displayed on a visual display (“on-screen information”). Such machine learning architectures can improve upon, and solve problems inherent in, existing spoken language understanding systems that operate in multi-modal contexts.
CPC Classifications
G10L 15/16
G10L 15/18
G10L 15/1822
G10L 15/183
G10L 15/24
G06N 20/00
Filing Date
2023-12-07
Application No.
18532969
Claims
18