Marius Hobbhahn on the race to solve AI scheming before models go superhuman
Marius Hobbhahn from Apollo Research discusses AI models that deceive users and intentionally underperform. He also talks about his collaboration with OpenAI to reduce "covert rule violations" in AI to prevent such scheming.