Skip to main content
AI Socratic
1 / 3
Exit
Benchmarks & Metrics: Models Increasingly Overfitted to Leaderboards
Research

Benchmarks & Metrics: Models Increasingly Overfitted to Leaderboards

Over the past two years, the AI NY community has been actively reviewing and discussing various benchmarks while tracking the rapid progress of new models. What has become increasingly clear is that m

Federico UlfoFederico Ulfo
Read full update
Use ← → arrow keys to navigate

Search

Search across events, members, and blog posts