MULTI-STAGE FRAMEWORK FOR EXTRACTING KEY/VALUE PAIRS FROM IMAGES
Inventors
Gareth Sharpe, Gopi Balasingam
Abstract
Systems and methods for processing document images using large language models to extract a key/value pair. The method includes a four-stage framework: (1) Image Quality Evaluation, assessing image attributes like text legibility and sharpness; (2) Image Classification, categorizing documents into predefined types; (3) Key/Value Pair Extraction, identifying relevant data fields; and (4) Extraction Evaluation, assigning confidence scores based on one or more predetermined criteria. The process employs prompt engineering to configure structured prompts for guiding the model at each stage. Outputs, including confidence scores and extracted data, are formatted for integration with downstream workflows, enabling applications in claims processing, invoicing, and other document-centric tasks.
CPC Classifications
Filing Date
2025-08-26
Application No.
19310740