Chain of Thought | AI Agents, Infrastructure & Engineering

Every AI Agent Has an Evaluation Gap | Alex Ratner, Snorkel AI

Alex Ratner of Snorkel AI discusses the "evaluation gap" in AI agents, suggesting that the ability to build AI agents has surpassed the ability to measure them. He outlines three axes of this gap: input complexity, autonomy horizon, and output complexity, and explains how this…

Listen