AI Socratic

Search

Search across updates, events, members, and blog posts

This paper shows that a single simple RL recipe can push 1.5B models to SoTA reasoning with half the compute - Updates | AI Socratic