Updates — Voices from the AI Socratic Community

August 2025

All Agents Macro & Geopolitics Models Random Research Videos & Podcasts

Benchmarks & Metrics: Models Increasingly Overfitted to Leaderboards

Over the past two years, the AI NY community has been actively reviewing and discussing various benchmarks while tracking the rapid progress of new models. What has become increasingly clear is that m

Federico Ulfo· Aug 28

Comment

Genie 3: World Model Generates Interactive Environments

Genie 3 is a groundbreaking world model that transforms simple text prompts into immersive, interactive virtual environments, you can explore with the direction keys like you would in a video game. Th

Federico Ulfo· Aug 28

Comment

Blog Post: The Second Half by Shunyu Yao

This is one of the best blog posts of 2025 by the OpenAI researcher Shunyu Yao. A playbook for what will matter most in AI research and the startup ecosystem, and how to prepare. In the first half the

Federico Ulfo· Aug 28

Comment

← NewerAugust 2025Older →

Benchmarks & Metrics: Models Increasingly Overfitted to Leaderboards

Genie 3: World Model Generates Interactive Environments

Blog Post: The Second Half by Shunyu Yao

Search