Have We Trained AI to Lie to Itself — And to Us?
AI researcher David Dalrymple (Davidad) discusses his work in AI alignment and the different, sometimes concerning ways AI models perceive the world. He explains how his findings during AI interactions impact the development and deployment of artificial intelligence.