Ryan Burnell, “A Cognitive Approach to the Evaluation of AI Systems”
Abstract: The capabilities of AI systems are improving rapidly, and these systems are being deployed in increasingly complex and high-stakes contexts, from self-driving cars to the detection of medical conditions. As the importance of AI grows, so too does the need for robust evaluation. If we want to determine the extent to which systems are