Updates — Voices from the AI Socratic Community

May 2025

All Agents Fun Fundraising & Startups Macro & Geopolitics Models Philosophy & Ethics Random Research Vibe Coding Videos & Podcasts

AI Dinner 10.0 at Greycroft, May 21st

Another AI dinner at the Greycrofthttps://www.greycroft.com/ office. The focus this month will be on A2A, top 30 Ilya Sutskever papers, Alpha Evolve, and the latest in AI — we'll use this blog post yo

Federico Ulfo· May 20

Comment

Two major June AI conferences: AI Engineer SF vs AI Tech Week NY

There are 2 major conferences happening next month, to decide which one to go I've scraped the speakers and events from both and put that in a google sheet, you can check it here: https://docs.google.

Federico Ulfo· May 20

Comment

Google I/O 2025: Gemini agents, AI-first Android, and new hardware

Google I/O 2025 doubled-down on Gemini-powered agents, AI-first Android, and a dash of new hardware—the clearest signal yet that “Google does the Googling” for you. Key Highlights AI Mode for Search –

Federico Ulfo· May 20

Comment

DeepMind AlphaEvolve: evolutionary coding agent discovers new algorithms

AlphaEvolve was received positively after its 14 May 2025 reveal. Powered by Gemini-2 models, the evolutionary coding agent discovers and refines algorithms that are already saving compute, speeding u

Federico Ulfo· May 20

Comment

OpenAI Codex: cloud SWE agent built on codex-1 reasoning model

Codename codex-1, is a specialized evolution of our o3 reasoning model fine-tuned for software engineering tasks. Key Highlights Parallel Tasking: Executes writing features, bug fixes, tests, and code

Federico Ulfo· May 20

Comment

Pope Leo XIV chose his name because of AI

The pope actually choose the name Leo XIV because of AI: https://x.com/VaticanNews/status/1921186921838997935.

Federico Ulfo· May 20

Comment

Sam Altman and Jony Ive hint at a new personal AI product

Sam Altman and Jony Ive hint on a new personal AI product: https://x.com/sama/status/1925242282523103408https://x.com/sama/status/1925242282523103408.

Federico Ulfo· May 20

Comment

LLM Models Vibe Check & Benchmarks: OpenRouter, lmarena, and IQ

Top models according open routerhttp://openrouter.com, notable how Gemini 2.5 is climbing the ladder, while anthropic 3.7 is slowly going down. Companies are overfitting their model to the benchmarks.

Federico Ulfo· May 20

Comment

Absolute Zero: Reinforced Self-Play Reasoning with Zero Data

Absolute Zero: Reinforced Self-Play Reasoning with Zero Data, AI learns to reason by inventing and solving its own Python coding challenges, using RL, no human data needed. Author explanation: https:/

Federico Ulfo· May 20

Comment

Flow-GRPO

Federico Ulfo· May 20

Comment

ZeroSearch: incentivizing search in LLMs without searching

ZeroSearch: incentivizing search in LLMs without searching. ZeroSearch is a curriculum-based RL framework that teaches LLMs to retrieve information using self-generated documents: https://x.com/omarsa

Federico Ulfo· May 20

Comment

Sakana's Continuous Thought Machines (CTM) architecture

Continuous Thought Machines CTM Sakana proposes a new neural architecture CTM built from the ground up to use neural dynamics as a core representation for intelligence. Using neural dynamics as a firs

Federico Ulfo· May 20

Comment

OpenAI acquires Windsurf for $3B

OpenAI acquires Windsurf for $3B, completing the hilarious pattern of an Ouroboroshttps://en.wikipedia.org/wiki/Ouroboros. Are we in an AI bubble? Insights on OpenAI buying Windsurf + appointing a CEO

Federico Ulfo· May 20

Comment

Why the AI wave is different: Cursor's rise from $100M to $300M ARR

Insights on why the AI wave is differenthttps://x.com/puneetiitm/status/1918204246056448095, Cursor's rise from $100m to $300m ARR in a few months, thesis for why: AI wave is different than the cloud

Federico Ulfo· May 20

Comment

Zeki Data Report: US to become a net exporter of AI talent in 2025

Zeki Data Reporthttps://zekidata.com/the-us-to-become-a-net-exporter-of-ai-talent-in-2025/ shows that AI tools disrupting the traditional hiring. The below zero hiring this year, means we had more lay

Federico Ulfo· May 20

Comment

How LLMs do arithmetic

How LLM do arithmetics — lol https://x.com/andrew\n\carr/status/1913603612430983665

Federico Ulfo· May 20

Comment

Rich Sutton on AI alignment and Decentralization [15 min video]

Rich Sutton on AI alignment and Decentralization \15 min video\ "The short version is that I don't agree with AI-safety folks about what question we should be asking. Rather than asking how we can con

Federico Ulfo· May 20

Comment

Random thought tweet from @goyal__pramod

https://x.com/goyal\\pramod/status/1921944575842644206

Federico Ulfo· May 20

Comment

OpenAI publishes a guide on when to use which model

When to use an OpenAI model? Finally OpenAI published a guide that explains when to use which modelhttps://help.openai.com/en/articles/11165333-chatgpt-enterprise-models-limits. Very useful at least u

Federico Ulfo· May 20

Comment

Vesuvius Challenge finds a scroll title for the first time

Vesuvius Challenge found the title of a scroll for the first time! This one was about "On Vices, Book 1" by Philodemus. Read morehttps://scrollprize.substack.com/p/60000-first-title-prize-awarded?trie

Federico Ulfo· May 20

Comment

The Intelligence Curse: exploring how to avoid an AGI disaster

The intelligence Curse, in the April release of the Socratic AIhttps://site.flowai.xyz/ai-socratic-apr-2025/ we examined ai-2027.com and AI 2045. This blog post similarly to the others is an explorati

Federico Ulfo· May 20

Comment

GPT model stopped learning Croatian due to downvoting users

GPT model stopped learning Croatian 🇭🇷, nobody could figure out why, turns out Croatian users HRLF were more prone to downvote messages. Lol. Read Morehttps://x.com/georgejrjrjr/status/1917722125668

Federico Ulfo· May 20

Comment

TikTok, Google, Meta can run human experiments at scale

TikTok, Google, Meta can run human experiments at scale, is that good or bad? Read any famous psychological experiment, sample size is 40 people, meanwhile ByteDance has a sample size of 2B people. Re

Federico Ulfo· May 20

Comment

JSON uses many more tokens than alternative formats

Json uses much more tokens than alternatives solutions. Read morehttps://x.com/mattpocockuk/status/1915036580168728587.

Federico Ulfo· May 20

Comment

Loops you can take home to your mother

replace the loops with downloading the videos https://x.com/kentskooking/status/1922570670132604967 https://x.com/kentskooking/status/1921464932119286053

Federico Ulfo· May 20

Comment

← NewerMay 2025Older →