← USPTO Patent Applications

MULTI-STAGE FRAMEWORK FOR EXTRACTING KEY/VALUE PAIRS FROM IMAGES

Application US20260080708A1 Kind: A1 Mar 19, 2026

Inventors

Gareth Sharpe, Gopi Balasingam

Abstract

Systems and methods for processing document images using large language models to extract a key/value pair. The method includes a four-stage framework: (1) Image Quality Evaluation, assessing image attributes like text legibility and sharpness; (2) Image Classification, categorizing documents into predefined types; (3) Key/Value Pair Extraction, identifying relevant data fields; and (4) Extraction Evaluation, assigning confidence scores based on one or more predetermined criteria. The process employs prompt engineering to configure structured prompts for guiding the model at each stage. Outputs, including confidence scores and extracted data, are formatted for integration with downstream workflows, enabling applications in claims processing, invoicing, and other document-centric tasks.

CPC Classifications

G06V 30/416 G06F 40/103 G06V 10/40 G06V 10/764 G06V 10/776 G06V 10/993 G06V 30/133 G06V 30/18 G06V 30/1916 G06V 30/19173 G06V 30/413 G06V 30/42 G16H 10/00 G06V 10/20 G06V 30/16 G06V 2201/03

Filing Date

2025-08-26

Application No.

19310740