Capturing data properties to recommend machine learning models for datasets
Assignee
International Business Machines Corporation
Inventors
Manjit Singh Sodhi, Suja Mohandas, Nitin Gupta, Kalapriya Kannan, Prerna Agarwal
Abstract
Recommending machine learning models is provided. The method comprises training machine learning models, wherein each machine learning model is trained with a unique respective dataset. Metadata associated with each machine learning model is extracted, wherein the metadata includes properties of the respective dataset used to train the machine learning model. The machine learning models and metadata are stored in a model catalog. Upon receiving a new dataset, similarity scores are calculated between the new dataset and the machine learning models in the model catalog according to the properties of the datasets in the metadata of the machine learning models. A closest match machine learning model is identified from the model catalog for the new dataset according to similarity score. Responsive to a determination that the closest match machine learning model exceeds a similarity threshold, predictions for the new dataset are generated with the closest match machine learning model.
CPC Classifications
Filing Date
2022-09-28
Application No.
17936045
Claims
20