SYSTEMS AND METHODS FOR AUTOMATIC EVALUATION OF NEURAL NETWORK GENERATED TEXT
Inventors
Peifeng Wang, Austin Xu, Shafiq Rayhan Joty
Abstract
Embodiments described herein provide training a neural network based language model to generate content that aligns with user preference. The method may include: receiving a query and a corresponding response; generating a judgement indicating a preference level of the corresponding response and a critique indicating a reason of the judgement based on an input of the query, the corresponding response and an instruction indicating an evaluation protocol; constructing a preference judgment training sample comprising the query and the corresponding response; training a second neural network based language model using the preference training sample to judge whether a model-generated response to the query aligns with user preference; constructing a preference training dataset for a third neural network based language model based on judgment data generated from the trained second neural network based language model; training the third neural network based language model using the constructed preference training dataset.
CPC Classifications
Filing Date
2025-01-31
Application No.
19043100