Chain of Thought | AI Agents, Infrastructure & Engineering
Every AI Agent Has an Evaluation Gap | Alex Ratner, Snorkel AI
Alex Ratner of Snorkel AI discusses the "evaluation gap" in AI agents, suggesting that the ability to build AI agents has surpassed the ability to measure them. He outlines three axes of this gap: input complexity, autonomy horizon, and output complexity, and explains how this…