Updates — Voices from the AI Socratic Community

February 2025

All Agents Hardware Macro & Geopolitics Models Random Research Videos & Podcasts

Scaling up test-time compute with latent reasoning

https://x.com/MatthewBerman/status/1890081482104008920

Federico Ulfo· Feb 21

SFT Memorizes, RL Generalizes

SFT Memorizes, RL Generalizes.https://tianzhechu.com/SFTvsRL/ DeepSeek has shown the power of Reinforcement Learning RL without Supervised Fine-Tuning SFT. What does RL learn differently than SFT? Wel

Federico Ulfo· Feb 12

Comment

As AIs Get Smarter, They Develop Coherent Value Systems

As AIs get smarter, they develop their own coherent value systemshttps://x.com/DanHendrycks/status/1889344074098057439. For example, they value human lives higher in order of Pakistan India China US.

Federico Ulfo· Feb 12

Comment

Humanity's Last Exam Dataset Released

Humanity's Last Examhttps://x.com/DanHendrycks/status/1882433928407241155 is a dataset with 3,000 questions, with known and verifiable answers, developed with hundreds of subject matter experts to cap

Federico Ulfo· Feb 12

Comment