Data augmentation and batch balancing methods to enhance negation and fairness

Grant US12579471B2 Kind: B2 Mar 17, 2026

Assignee

ORACLE INTERNATIONAL CORPORATION

Inventors

Duy Vu, Varsha Kuppur Rajendra, Dai Hoang Tran, Shivashankar Subramanian, Poorya Zaremoodi, Thanh Long Duong, Mark Edward Johnson

Abstract

Techniques for augmentation and batch balancing of training data to enhance negation and fairness of a machine learning model. In one particular aspect, a method is provided that includes obtaining a training set of labeled examples for training a machine learning model to classify sentiment, searching the training set of labeled examples or an unlabeled corpus of text on target domains for sentiment examples having negation cues, sentiment laden words, words with sentiment prefixes or suffixes, or a combination thereof, rewriting the sentiment examples to create negated versions thereof and generate a labeled negation pair data set, and training the machine learning model using labeled examples from the labeled negation pair data set.

CPC Classifications

G06N 20/00 G06N 5/022 G06F 40/279 G06F 40/166 G06F 40/49 G06F 40/20

Filing Date

2022-11-10

Application No.

17984768

Claims

View original document →