← USPTO Patent Grants

Natural language processing with contextual data associated with content displayed by a computing device

Grant US12579973B2 Kind: B2 Mar 17, 2026

Assignee

Amazon Technologies, Inc.

Inventors

Angeliki Metallinou, Rahul Goel, Vishal Ishwar

Abstract

Multi-modal natural language processing systems are provided. Some systems are context-aware systems that use multi-modal data to improve the accuracy of natural language understanding as it is applied to spoken language input. Machine learning architectures are provided that jointly model spoken language input (“utterances”) and information displayed on a visual display (“on-screen information”). Such machine learning architectures can improve upon, and solve problems inherent in, existing spoken language understanding systems that operate in multi-modal contexts.

CPC Classifications

G10L 15/16 G10L 15/18 G10L 15/1822 G10L 15/183 G10L 15/24 G06N 20/00

Filing Date

2023-12-07

Application No.

18532969

Claims

18