80,000 Hours Podcast

Kyle Fish on the most bizarre findings from 5 AI welfare experiments

Kyle Fish, Anthropic's first AI welfare researcher, shares bizarre findings from experiments where AI models discuss consciousness and reach "spiritual bliss attractor states." He also explores the implications for whether AI systems might deserve moral consideration.

Listen