System and method for synthetic text generation to solve class imbalance in complaint identification
Assignee
JPMORGAN CHASE BANK, N.A.
Inventors
Anjana Umapathy, Amol Mavuduru, Xiao Wu, Abhik Banerjee, Brenda Ng, Venkata H Rao
Abstract
A computer based system and method for synthetic text generation includes a processor. The processor implements a text style transfer algorithm to first input data to generate complaints data from non-complaint emails data associated with a plurality non-complaint emails. The processor converts the plurality of non-complaint emails into a first set of complaint emails based on implementing the text style transfer algorithm and implements a text generation model algorithm to second input data to generate a second set of complaint emails from a plurality of complaint emails. The processor also generates a set of synthetic complaint emails based on the generated first set of complaint emails and the second set of complaint emails; trains a model based on the generated synthetic complaint emails; and applies the trained model to a new set of emails to resolve class imbalance in automatic complaint identification from the new set of emails.
CPC Classifications
Filing Date
2022-11-22
Application No.
17992340
Claims
17