EVALUATING SOURCE CONTRIBUTION TO GENERATED CONTENT
Inventors
Hirenkumar Ashokbhai Thummar, Alfredo Alba, Daniel Gruhl, Shubhi Asthana, Linda Ha Kato, Bing Zhang, Steven R. Welch
Abstract
Evaluating the presence of training data in generated content by defining a first signature of a portion of the generated content by loading the generated content into memory registers, dividing the generated content into tokens, designating a sequential group of tokens as a signature shingle, and defining the first signature as a hash function value for the signature shingle. The evaluation also including matching the first signature to a training data signature in a training data signature database, updating a database record for the generated content to include the training data associated with the training data signature, and providing an output comprising data associated with the training data signature over a network.
CPC Classifications
Filing Date
2024-09-19
Application No.
18889444