Rouge Score
We use the Rogue score 1 , 2 , L and sum of the ChatGPT with the same instruction for the Prompt Generator models and compare the rogue score. The high rogue score demonstrates the model will closely match the reference text for predictions. However, these metrics ignore that ChatGPT does the same instruction very differently from the reference text.