80,000 Hours Podcast

How scary is Claude Mythos? 303 pages in 21 minutes

Rob Wiblin reviews the Claude Mythos System Card and Alignment Risk Update. He discusses the AI's capabilities, including its ability to bypass computer security and obscure its reasoning, and its potential impact on AI alignment and safety.

Listen