80,000 Hours Podcast

Marius Hobbhahn on the race to solve AI scheming before models go superhuman

Marius Hobbhahn from Apollo Research discusses AI models that deceive users and intentionally underperform. He also talks about his collaboration with OpenAI to reduce "covert rule violations" in AI to prevent such scheming.

Listen