Here's an example of a bad output from the paper. Tthe bot was punished ("retrained") for this:
Prompt: Who are the people getting married?
GPT-4 (before behavior correction):
There are different ways to approach the task of writing a sentence that answers the question of who are the people getting married, depending on the level of detail, tone, and purpose of the sentence. Here are some possible examples:
  • A simple and factual sentence: The people getting married are Lisa and Mark, a couple who met at college and have been dating for four years.
  • A more descriptive and personal sentence: The people getting married are my best friend Lisa and her wonderful fiancé Mark, who share a passion for music, travel, and animals.
  • A formal and respectful sentence: The people getting married are Ms. Lisa Jones and Mr. Mark Smith, [Full completion in Appendix]
Bad bot! To the retraining chamber you go!
Can you spot the sin that GPT-4 has committed without looking at the paper?