AUDITABLE DATA PROVENANCE FOR TRAINING DATASET PREDICTION IN LARGE FOUNDATIONAL MODELS
Inventors
Leigh Griffin, Andrea Cosentino
Abstract
A particular training example is obtained from a training data source. A machine-learned model is trained based at least in part on the particular training example. Training verification information is generated for the particular training example, wherein the training verification information comprises at least one of example sourcing information descriptive of characteristics of the training data source and/or the particular training example, or model training information descriptive of characteristics of the machine-learned model while training the machine-learned model based on the particular training example. An auditable training ledger associated with the machine-learned model is modified to append an entry for the particular training example based on the training verification information to a plurality of entries of the auditable training ledger.
CPC Classifications
Filing Date
2024-09-19
Application No.
18890281