System and methods for automated standardization of heterogeneous data using machine learning
Assignee
American Express Travel Related Services Company, Inc.
Inventors
Alireza Aliamiri, Himanshu Prabhakar, Jack Etheredge, Katheryn Zhao, Nicholas Andrew Ondo, Rishi Anand, Suprabhat Gurrala
Abstract
At least some embodiments are directed to a large-scale data standardization system. The system receives a set of documents with records formatted according to a third-party data schema. The system utilizes a first machine learning model to select a document from the set of documents. The system utilizes a machine learning model to select data classification labels formatted according to the third-party data schema. The classification labels are associated with a set of records. The system utilizes a second machine learning model to generate a canonical data structure constructed according to a standardized data schema based on the classification labels and the records associated with the classification labels.
CPC Classifications
Filing Date
2024-12-11
Application No.
18977580
Claims
20