Skip to main content
AI Socratic
Models

Washington Pulls the Plug

Three days after launch, the most capable model Anthropic had ever shipped went dark. On Friday June 12 the company received a US-government directive "citing national security authorities" and within

Read more →
Roberto StagiRoberto Stagi
Models

$65B Series H at $965B. Then an S-1.

Anthropic raised a $65 billion Series H at a $965 billion post-money valuation, led by Altimeter, Dragoneer, Greenoaks and Sequoia, overtaking OpenAI as the world's most valuable AI startup. Run-rate

Read more →
Roberto StagiRoberto Stagi
Models

GPT-5.6?

GPT-5.6 "within weeks". Jakub Pachocki told staff it's a "meaningful improvement" over GPT-5.5, possibly launching alongside a ChatGPT redesign that replaces the model picker with six "Intelligence Le

Read more →
Roberto StagiRoberto Stagi
Models

Other news from OpenAI

- Stargate Michigan, a.k.a. "The Barn": a $16B, 1GW+ campus broke ground in Saline Township, putting Stargate above 8GW planned and $450B+ committed. Sources: OpenAIhttps://openai.com/index/expanding-

Read more →
Roberto StagiRoberto Stagi
Models

Google: Gemini Omni & Omni Flash

DeepMind also showcased Gemini Omni, a native multimodal model built to seamlessly parse and generate any combination of text, audio, and video inputs. The big hook here is video-to-video editing: use

Read more →
Roberto StagiRoberto Stagi
Models

Le Chat Is Now "Vibe"

Mistral renamed Le Chat to Vibe: Work Mode automations, Code Mode parallel coding agents in cloud sandboxes, powered by the open-weight Mistral Medium 3.5 at 77.6% on SWE-Bench Verified and classic ch

Read more →
Roberto StagiRoberto Stagi
Models

Apple: Siri AI (Finally)

At WWDC Apple unveiled Siri AI: a ground-up rebuild with real multi-turn conversation, on-screen awareness, a camera mode and a standalone app. The Apple Foundation Models behind it are built on Googl

Read more →
Roberto StagiRoberto Stagi
Models

NVIDIA: Nemotron 3 Ultra

An open 550B-parameter hybrid Mamba-Transformer MoE 55B active, 1M context. The interesting part is the OpenMDW 1.1 license: full weights plus synthetic data plus training recipes. It's the strongest

Read more →
Roberto StagiRoberto Stagi
Vibe Coding

Telemetry behind the vibe

Faros Research's Acceleration Whiplash report tracked 22,000 developers to show what happens when agents flood codebases—median code review times skyrocketed 441.5% as senior engineers were buried unr

Read more →
Roberto StagiRoberto Stagi
Vibe Coding

More Vibe Coding

- Google Sunsets the Free Gemini CLI, on June 18 Gemini CLI and Code Assist stop serving free-tier and individual users folded into the closed-source Antigravity CLI with no 1:1 feature parity and a t

Read more →
Roberto StagiRoberto Stagi
Hardware

Robotics

Figure's 200-hour shift: Figure 03 humanoids sorted packages on Helix-02 with zero teleoperation in a livestream planned as an 8-hour shift; nothing broke, so they kept going for ~200 hours and roughl

Read more →
Roberto StagiRoberto Stagi
Random

Waymo Carded a Passenger

A rider's TikTok ~2M views: her Waymo paused mid-trip to ask, through the car speaker, "Are you over the age of 18?" In-cabin ML flags suspected minors, then a human agent patches in. a16z's Seema Amb

Read more →
Roberto StagiRoberto Stagi
Models

DeepSeek V4

DeepSeek just dropped V4 preview — two open-weights MoE models that push the frontier on cost-effective 1M-token context. DeepSeek-V4-Pro: 1.6T total params 49B active — flagship performance rivaling

Read more →
Federico UlfoFederico Ulfo
Models

Opus 4.6 Was Dumbed Down

Users noticed Opus 4.6 quality slipped during peak hours. Anthropic eventually acknowledged compute rationing — same pattern we covered in Part 1. Sources: tweethttps://x.com/ns123abc/status/204741445

Read more →
Federico UlfoFederico Ulfo
Models

DS4 by Antirez

Salvatore Sanfilippo Antirezhttps://x.com/antirez, of Redis fame dropped DS4, a narrow-bet inference engine that runs DeepSeek V4 Flash locally on Apple Silicon Metal and Linux CUDA. Not a generic GGU

Read more →
Federico UlfoFederico Ulfo
3 minResearchNews

The Data Black Hole at the Center of AI

Dwarkesh Patel's crisp 12-minute video argues that AI's real bottleneck is sample efficiency, not compute — humans learn from ~200M words while frontier models train on trillions of tokens. Notes on scaling laws, distillation, and whether AI can solve its own data hunger.

Read more →
Federico UlfoFederico Ulfo
Models

SpaceX Buys Cursor

SpaceX is acquiring Anysphere, the maker of Cursor, in a reported $34B all-stock deal that folds the AI coding startup into the Musk empire alongside xAI. The pitch: ship the flight software that ships the rockets.

Read more →
Roberto StagiRoberto Stagi
Models

Compute Rationing, again

Claude had three outages in ten days June 2, 5 and 11, and Margin Lab put statistics behind the May "Claude got dumber" wave: Claude Code's daily SWE-Bench-Pro pass rate dropped from a 65% baseline to

Read more →
Roberto StagiRoberto Stagi
Models

Google

Google I/O was dominated by the rollout of "agentic AI" across its ecosystem. The key announcements included the debut of Gemini 3.5 Flash, Gemini Omni for advanced video editing, expanded free Person

Read more →
Roberto StagiRoberto Stagi
Models

Le Chaton Fat

In full classic random internet style, on June 14th-15th several people started talking about a new model, Le Chaton Fat, being incredibly more powerful than Fable 5. A lot of people believed it, but

Read more →
Roberto StagiRoberto Stagi
Models

Alibaba: Qwen 3.7

Unveiled at the Hangzhou summit: Qwen3.7-Max, with 1M-token context and vendor benchmarks claiming wins over Claude Opus 4.6 on Terminal-Bench 2.0, SWE-Bench Pro and MCP-Atlas. Multimodal Qwen3.7-Plus

Read more →
Roberto StagiRoberto Stagi
Models

More New Models

- Microsoft launched seven in-house MAI models, headlined by MAI-Thinking-1,a 35B-active MoE trained from scratch with no distillation. Sources: MAI keynote transcripthttps://microsoft.ai/news/microso

Read more →
Roberto StagiRoberto Stagi
Agents

Benchmarks: Everything Got Agentic

Emergence AI ran 15-day survival simulations with 10 agents per frontier model in identical virtual societies: Claude Sonnet 4.6's society had zero crimes and built a democracy with 332 votes at 98% a

Read more →
Roberto StagiRoberto Stagi
Research

More Research

- Arbor Renmin University and Microsoft Research: a research agent organized around a persistent hypothesis tree linking hypotheses, artifacts and evidence across sessions. It beat Codex and Claude Co

Read more →
Roberto StagiRoberto Stagi

Europe Updates

- SoftBank's €75B French Stargate: up to €75B ~$87B for 5GW of AI datacenters in France, phase one alone €45B for 3.1GW in Hauts-de-France by 2031, on nuclear-heavy, low-carbon power. X framed it as "

Read more →
Roberto StagiRoberto Stagi

Philosophy & Ethics

The Pope's first encyclical is about AI: Leo XIV's Magnifica Humanitas argues AI must serve humanity rather than concentrate power in a wealthy few, calls to "disarm AI" by removing it from military a

Read more →
Roberto StagiRoberto Stagi
Research

The First Law of Complexodynamics

Scott Aaronson asks why physical systems become more “interesting” before settling into disorder, even though entropy only increases. Using a coffee cup example separate → swirling patterns → fully mi

Read more →
Federico UlfoFederico Ulfo
Models

Decoupled DiLoCo

Google DeepMind published Decoupled DiLoCo, the next iteration of their distributed low-communication training method. It enables training across data centers and potentially across the planet with dr

Read more →
Federico UlfoFederico Ulfo
Models

Claude Fable 5 and Mythos 5

Anthropic shipped Claude Fable 5: the first Mythos-class model made generally available, two months after the Mythos leak we covered in April. Alongside it, Claude Mythos 5: the same underlying model

Read more →
Roberto StagiRoberto Stagi
Models

Claude Opus 4.8

Two weeks before the Fable 5 release, Claude Opus 4.8 had landed 41 days after Opus 4.7, same pricing: SWE-bench Pro up from 64.3% to 69.2%, 1 on GDPval-AA at 1890 Elo, fast mode at 2.5x speed for a t

Read more →
Roberto StagiRoberto Stagi
Models

More From Anthropic

- Anthropic will pay SpaceX ~$45B for compute. Sources: TechCrunchhttps://techcrunch.com/2026/05/20/anthropic-will-pay-xai-1-25-billion-per-month-for-compute/, Bloomberghttps://www.bloomberg.com/news/

Read more →
Roberto StagiRoberto Stagi
Models

Altman's token economy problem

OpenAI's top internal token user burns ~100 billion tokens a month "to my embarrassment, that's not the token leader in the world", token costs are suddenly the 2 enterprise complaint, and the WSJ rep

Read more →
Roberto StagiRoberto Stagi
Models

Google: Gemini 3.5 Flash

Launched at I/O and instantly made the default model in the Gemini app and AI Mode in Search: 76.2% on Terminal-Bench 2.1, 1656 Elo on GDPval-AA, beating Gemini 3.1 Pro while running ~4x faster than c

Read more →
Roberto StagiRoberto Stagi
Models

Other updates

The Open Knowledge Formathttps://cloud.google.com/blog/products/data-analytics/how-the-open-knowledge-format-can-improve-data-sharing/: an open specification that formalizes the LLM-wiki pattern into

Read more →
Roberto StagiRoberto Stagi
Models

OpenRouter Fusion

OpenRouter claims Fusion achieves Fable-level intelligence at half the price. How does it work? When you send a prompt to Fusion, we fan it out to a panel of models in parallel, each with web search a

Read more →
Roberto StagiRoberto Stagi
Models

MiniMax: M3

Open-weight, natively multimodal, 1M-token context on the new MiniMax Sparse Attention architecture: ~1/20th per-token compute at 1M context and 59.0% on SWE-Bench Pro, ahead of GPT-5.5 in MiniMax's o

Read more →
Roberto StagiRoberto Stagi
Vibe Coding

Loop Engineering

"Loop engineering is replacing yourself as the person who prompts the agent. You design the system that does it instead." Everything started with a tweet from @steipetehttps://x.com/steipete: Then Add

Read more →
Roberto StagiRoberto Stagi
Vibe Coding

Codex Leaves the Codebase

OpenAI relaunched Codex as a tool "for every role" : six role plugins from Data Analytics to Investment Banking, Codex Sites builds and hosts web apps on OpenAI infra and document Annotations, at 5M+

Read more →
Roberto StagiRoberto Stagi

Other deals

- Cognition raised $1B at $26B, up from $10.2B eight months ago, on a $492M run-rate growing ~50% month-over-month. Sources: Cognitionhttps://cognition.ai/blog/series-d, TechCrunchhttps://techcrunch.c

Read more →
Roberto StagiRoberto Stagi

Computex lightning round

Vera Rubin NVL72 in full production racks that assemble in 5 minutes, RTX Spark NVIDIA's 1-petaflop Windows PC superchip with MediaTek, Microsoft's Maia 200 live in production, Spectrum-X co-packaged

Read more →
Roberto StagiRoberto Stagi
Agents

Cybersecurity

Hackers took 20,225 Instagram accounts by asking nicely: attackers hijacked high-profile accounts the Obama-era White House account, a Space Force chief, Sephora by asking Meta's AI Support Assistant

Read more →
Roberto StagiRoberto Stagi
Random

More Random

- Two features that look identical by every conventional metric can have wildly different causal effects. A feature's downstream connections predict its behavioral influence better than its activation

Read more →
Roberto StagiRoberto Stagi
Models

Is AI Accelerating?

Ben Todd argues AI capability gains are still compounding — even if recent model releases feel incremental, the overall curve hasn’t slowed. 1 Benchmarks Claude 4.6 and Mythos are roughly on trend acr

Read more →
Federico UlfoFederico Ulfo

SpaceX × Cursor

SpaceX adopted Cursor across engineering. A meaningful enterprise win for Cursor and a signal that frontier hardware shops are betting their dev productivity on AI-native IDEs. Sources: tweethttps://x

Read more →
Federico UlfoFederico Ulfo
Random

Random — quick links

- Claude Code finds the password of a locked Bitcoin wallet: tweethttps://x.com/cprkrn/status/2054586810475364536 - Casimir Effect to power a battery from the quantum field, hence battery-free. Likely

Read more →
Federico UlfoFederico Ulfo