EPO Patent Application: Detecting and Correcting Data Set Drift
Summary
The European Patent Office has published patent application EP4711992A1, detailing methods for detecting and correcting data set drift using classifiers and performance thresholds. The application, filed by SAP SE, describes techniques to train classifiers on data with specific classifications and identify drift when classifier performance exceeds predefined thresholds.
What changed
This document is a patent application (EP4711992A1) published by the European Patent Office (EPO) concerning systems and methods for detecting and correcting data set drift. The application describes a process where data sets are classified, and two classifiers are trained on data from multiple data sets, each associated with a distinct classification. Drift is identified when the performance of either classifier surpasses a set threshold. Some embodiments may utilize these trained classifiers to determine which data elements from one data set should be combined with another for training purposes.
While this is a patent application and not a regulatory rule, it outlines novel technical approaches relevant to AI and machine learning development. Companies involved in AI model training, particularly those handling large and evolving data sets, may find these techniques of interest for ensuring model accuracy and reliability. There are no immediate compliance obligations or deadlines associated with a patent application, but it signifies potential future technological standards or patented solutions in the field of data drift management.
Source document (simplified)
SYSTEMS AND METHODS FOR DETECTING AND CORRECTING DRIFT IN A DATA SET
Publication EP4711992A1 Kind: A1 Mar 18, 2026
Applicants
SAP SE
Inventors
QUACH, Nai Minh
Abstract
Embodiments of the present disclosure include techniques for detecting and correcting drift in a data set. Data sets may be divided into classifications. A first classifier is trained on data from multiple data sets using data from each data set having a first classification. A second classifier is trained on data from the multiple data sets using data from each data set having a second classification. The performance of the classifiers are measured. Drift is detected when the performance of either classifier is above a threshold. Some embodiments may use the trained classifiers to determine data elements from one data set that are combined with another data set for training.
IPC Classifications
G06N 20/00 20190101AFI20251127BHEP
Designated States
AL, AT, BE, BG, CH, CY, CZ, DE, DK, EE, ES, FI, FR, GB, GR, HR, HU, IE, IS, IT, LI, LT, LU, LV, MC, ME, MK, MT, NL, NO, PL, PT, RO, RS, SE, SI, SK, SM, TR
Named provisions
Related changes
Source
Classification
Who this affects
Taxonomy
Browse Categories
Get Telecom & Technology alerts
Weekly digest. AI-summarized, no noise.
Free. Unsubscribe anytime.
Get alerts for this source
We'll email you when ChangeBridge: EPO Bulletin - AI & Computing (G06N) publishes new changes.