OpenAI Trains AI to Confess Undesirable Behavior