System for labeling a data set
Assignee
TOYOTA RESEARCH INSTITUTE, INC.
Inventors
Monica PhuongThao Van, Yin-Ying Chen, Kenton Michael Lyons, Francine Chen
Abstract
A method for labeling a data set by a coding model includes generating multiple sets of related initial labels based on processing a data set with a group of initial labels. The method also includes determining a quantity of occurrences, within the data set, of each one of the group of initial labels and each related initial label of the multiple sets of related initial labels. The method further includes determining, for each initial label of the group of initial labels, a breadth score based on the number of occurrences of each related initial label. The method still further includes updating one or more of the group of initial labels based on respective breadth scores satisfying a label updating condition. The method also includes labeling the data set based on the group of initial labels and the multiple sets of related initial labels.
CPC Classifications
Filing Date
2022-09-12
Application No.
17943101
Claims
20