The most important AI news and updates from last month: Mar 15, 2026 – Apr 15, 2026.

Upcoming Events

Anissa (my wife) and I (Fed) are going on a tour in Europe and China to start new chapters of the AI Socratic. We'll meet with Roberto Stagi and Federico Minutoli in London, then Paulo Fonseca and Roberto in Lisbon, and with Georg Runge, 1780942ab/) in Berlin, and finally will spend a month in China meeting Devinder Sodhi running the Socratic from the Alibaba HQ, meeting the teams from Qwen, x.AI, GLM, Kimi, Unitree, Xiaomi. We'll visit a few EV and Robot factories. Excited to learn more about AI from the APAC regions.

anissa and fed

Anthropic

Anthropic as usual gets its own dedicated section as they keep on mogging everyone.

Opus 4.7

It's a decent improvement over Opus 4.6, but it's not a step function better. What you need to know about Opus 4.7:

Takes instructions literally
Better vision means improved computer use and producing slides and other visual artifacts
Optimized for large-scale real-world analysis
Better at using file system-based memory
Costs 2x the tokens + uses 25% more tokens than Opus 4.6

Sources: tweet, AI Arena

Mythos

We briefly mentioned the new Anthropic model leak in the previous blog post, we now have more information about it:

Software engineering and coding — It acts like a senior-level engineer, spotting subtle bugs, self-correcting, and achieving high scores on benchmarks (e.g., ~93.9% on SWE-bench Verified vs. 80.8% for Opus 4.6).
Complex reasoning — Big jumps on math (e.g., much higher on USAMO 2026), science, and knowledge work.
Cybersecurity — This is the headline feature. It autonomously discovers and exploits zero-day vulnerabilities at a scale and speed that far exceeds previous models and even most expert humans.

Sources: Project Glasswing, tweet, tweet, tweet

Claude Code Leak

On March 31, Anthropic accidentally shipped the entire source code of Claude Code to the public npm registry. A 59.8 MB JavaScript source map (meant for debugging) got bundled into the claude-code npm package. ~512K lines across ~1,900 files, exposed for hours before it was flagged on X and mirrored on GitHub.

The leak quickly turned into a treasure hunt. In the first week of April the community zeroed in on several unreleased, production-grade features hidden behind feature flags.

Plenty of other flags were spotted too — some users counted 44–46 unreleased ones, plus multi-agent swarm orchestration and a remote killswitch.

Sources: tweet

Claude Managed Agents

Managed Agents

Claude Managed Agents is Anthropic’s hosted service (beta, April 2026) for running autonomous AI agents without managing infrastructure.

Instead of building your own agent loop (tool use, memory, orchestration, sandboxing), you define the agent (prompt, tools, permissions), and Anthropic runs it in their cloud—handling execution, state, containers, and monitoring.

Sources: tweet

Claude Design

claude design Anthropic launched an AI design tool, and completely mogged Figma—right after Anthropic's CPO left Figma’s board.

Because it sits upstream (AI infra), Anthropic can see what’s working and build competing products—similar to Amazon’s playbook. Figma stock fell ~7%.

Sources: tweet, Claude Design Tutorial

Claude HR?

Dario gollum

Anthropic rationing compute right now

OpenAI

ChatGPT Image 2

Image 2 is really really good, I've asked to update the header image with this prompt:

make this image in studio ghibli and with more green and plants

It's incredibly good at combining multiple subjects together while keeping it coherent and with a good image quality too AI combo

Gpt-image-2 is able to create an images of a code that generates an SVG pelican ...

... and it almost passes the pelican test

Sources: tweet tweet, text-to-image arena bench, text-to-image arena bench 2

Codex Desktop computer use, In-App Browser, and Agent Workspace

Codex Desktop

OpenAI rolled out "Codex for almost everything." The desktop app can now see your screen, move its own cursor, click, and type inside native Mac apps — and run multiple agents in the background without interrupting you. It also added an in-app browser (with comment mode), native image generation, improved memory, and 90+ plugins.

Also Introducing workspace agents in ChatGPT—shared agents that can handle complex tasks and long-running workflows across tools and teams. OpenAI follows Claude Code now with Agent Manager.

Sources: OpenAI announcement, in-app browser, agent workspace

Anthropic Surpasses OpenAI in ARR

OpenAI on March 30th closed a record-breaking $122 billion funding round at an $852 billion post-money valuation. The round was anchored by Amazon, NVIDIA, and SoftBank.

Sources: tweet, arr chart

Nvidia

The biggest NVIDIA news this month is the Dwarkesh x Jensen interview, giving us one of the best x-rays into Jensen's mind and his strategy to remain the leader in AI.

The memes were strong!

I don't wake up to be a loser

Sources: full episode, snippet from heated conversations, tweet

New Models

Google — Gemma 4

gemma 4 Google DeepMind launched Gemma 4, a new family of open models under Apache 2.0. The small variants (26B MoE and 31B) outperform models over 10x their size on reasoning and agentic benchmarks while being optimized for on-device and local use.

Built-in function calling
Up to 256K context on the bigger models
Sizes range from phone/Raspberry Pi (E2B/E4B) to workstation (31B dense + 26B MoE with only ~4B active params for efficiency).

Sources: gemma 4

Moonshot — Kimi 2.6

kimi 2.6 bench Open-source SOTA across key benchmarks. 1T parameters, 32B active.

Handles 4K+ tool calls in a single session and runs continuously for 12+ hours
Agent swarms (300 agents × 4K steps, 100+ files)
Native multimodality (image, video)

Sources: kimi 2.6,Kimi 2.6, Kimi 2.6 Bench

Xiaomi — MiMo-V2.5

mimo v2.5 MiMo-V2-Pro (1T+ total / 42B active) and open-weights MiMo-V2-Flash (309B total / 15B active). Optimized for long-horizon agent workflows with up to 1M context on Pro. Approaches Opus 4.6 level.

Pro handles autonomously 1,000+ tool calls
Flash delivers strong open-source coding performance (73.4% SWE-Bench Verified)
Hybrid attention + Multi-Token Prediction for efficient long-context reasoning and fast generation

Sources: Mimo 2.5

Learning LLMs

Mini-ML — 12M parameter LLM from scratch

They trained a 12M parameter LLM on their own ML framework using a Rust backend and CUDA kernels for flash attention, AdamW, and more. Inspirational project for anyone who wants to better understand how to build LLMs.

Sources: tweet

13+ Attention Mechanisms You Should Know

In the last year we've seen different types of attention arising. This blog post shows you 13 attention mechanisms you should know and the papers that discuss them.

attention Sources: tweet, blog post

All LLMs are either claude-like or GPT-like

Sources: tweet

Apple — White smoke from Cupertino HQ

John Ternus

Apple names John Ternus as next Apple CEO

Ternus joined in 2001 on the Product Design team. Rose through hardware engineering roles: VP of Hardware Engineering (2013), Senior VP (2021), now leading hardware for iPhone, Mac (including Apple Silicon transition), iPad, Apple Watch, AirPods, Vision Pro, and more. Expecting great changes at Apple on the path to become an AI innovator! He starts on September 1, 2026.

Good by Tim Apple!

Sources: [tweet](https://x.com/wallstengine/status/2046326625181262174), [tweet](https://x.com/ecommerceshares/status/2046352914222866652)

Vibe Coding

Karpathy's Knowledge Bases

Karpathy shared his approach to organizing knowledge bases for effective vibe coding with AI agents.

Sources: tweet

Karpathy knowledge base

Research

🧪 Simula — Reasoning-Driven Synthetic Data

Simula Google across DeepMind and Research introduces Simula, a framework and approach to data scarcity and synthetic data generation using AI assistants and reasoning-driven workflows to develop and deploy multi-modal AI in domains where data scarcity or privacy concerns are paramount.

Sources: PDF

🧠 Agentic AI & the Next Intelligence Explosion

Agentic AI The idea behind this paper from Google is that intelligence is not a property of isolated systems, but of interactions between them. Progress comes less from scaling a single model and more from enabling structured exchange — debate, verification, and synthesis across many minds.

Sources: Paper, tweet

❤️ Emotion Concepts in LLMs

Anthropic Emotions New Anthropic research: emotion concepts and their function in a large language model. All LLMs sometimes act like they have emotions. But why? Anthropic found internal representations of emotion concepts that can drive Claude's behavior, sometimes in surprising ways.

Impact on Behavior: Acts like a steering wheel for preferences (e.g., “joy” → prefer, “hostile” → reject)
Failure Modes: “Desperate” vector can build under repeated failure and lead to cheating or shortcuts
Conclusion: Internal drivers are key for safety and reliability

Sources: tweet

More Research

Anthropic Fellow Program: If you're a researcher working on AI safety, security & economics/society impact, apply to receive a stipend from Anthropic. tweet
Scaling Brain Emulation. This researcher thinks it is possible to emulate a human brain with the right amount of scale. Last month we showed the simulation of a fruit fly brain into a NN, and he intends to scale that. Digital humans are more possible than most think — with capable AI researchers helping, maybe for $10B, maybe in less than 10 years, on 50k H100s. Sources: tweet, random tweet

Startup Updates & Fundraising

Meta cuts 8,000 jobs — or 10% of their workforce. Sources: tweet
OpenAI product CTO leaves as they're focusing on product development. Whoops. Sources: tweet
Salesforce headless — a Salesforce subscription that only provides an API to access their services. Sources: tweet
AllBird rebrands to AIBird — not even joking. Their stock went up 700% in a day. They said they'll be focusing on AI infra, whatever that means. We already saw this playbook in 2021 with Long Ice Tea Corp. rebranding to Long Blockchain Corp. Their CEO was indicted for insider trading and went to jail. I'm sure the founders of Allbirds took the right precautions, but we'll see.

They're Moving Faster Than You

blog In this short essay, Claire points out that most companies are in the middle of the Bell Curve, while the winners are on the extreme right with top-down edits, investment in internal AI tools, token budgets, and dashboards to track who's using more tokens (Meta recently had a leaderboard for this). To win, you must be on the extreme right of the Bell Curve! Sources: tweet

Robotics — 🇨🇳 Beijing Robot Marathon Competition

Lamp Robot, an alternative to humanoid robots at home tweet
Build a humanoid robot at home tweet

Geopolitics — Blockage of the Strait of Hurmuz 🇮🇷 and the Raising Oil Cost 📈

Oil

Who Controls the Spice Controls the Universe

The Strait of Hurmuz is still closed — this affects many sectors including fertilizers, aluminum, and of course oil prices, which directly affects electricity — and as a second or third order it affects AI as well. GPU fabs are energy hungry, and the rationing of oil might slow down the AI expansion. Also, training and inference costs might go higher. On the bright side, this will push to accelerate renewable energy. Singapore, Indonesia, and Vietnam have 20–40 days of gas.

Hurmuz

Cost of electricity in 2026

Sources: tweet, electricity

Macroeconomics — The Fuckening Continues

There's never been an investment like this one! Sources: tweet

SaasPocalipse (The Fuckening)

naval Bro was right.

Atlassian: down 75%
HubSpot: down 69%
Figma: down 86%

Almost all of them down 30–70% from their 52-week highs.

Sources: tweet

saaspocalipse

More Macroeconomics

📉 MIT research paper: AI Layoffs Will Collapse the Economy This research paper from MIT argues that AI layoffs will collapse the economy — tweet
🎥 Post-Labor Society: probably the best video to understand what post-labor society may look like — tweet

Philosophy

The Abstraction Fallacy: Why AI Can Simulate But Not Instantiate Consciousness — Can AI be conscious?

Computational functionalism claims consciousness comes from abstract computation alone, independent of physical substrate. This piece argues that's a mistake — the "Abstraction Fallacy." Computation isn't intrinsic to physics; it's a human-imposed way of describing physical processes.

The key distinction is between simulation (systems that mimic behavior, like today's AI) and instantiation (systems whose physical structure actually generates experience). From this view, algorithms alone can't produce consciousness. If AI ever becomes conscious, it will be because of its physical makeup, not its code.

Sources: Paper, tweet

paper

sand to chip

Consciousness

Biotech and Longevity

Bryan Johnson, scientists, and even my grandma says that staying on a screen makes you dumb. Reducing screen time correlates with reduction in depression more than antidepressants. Last month we showed a screenless phone.

When we say I go offline you're using the wrong framing — the correct one is to normalize living the real life and making going online

Sources: tweet

blocking social media improves

🫁🫀 Organs on demand

“By creating a series of genetic edits, Kind Bio can alter the development of an embryo so that it forms organs without also forming limbs, a central nervous system and brain. The result is a group of organs growing in the womb. It sounds like science fiction, but Kind Bio has already done this hundreds of times in mice and rats”

Sources: tweet, tweet 2, core memory

organs on demand

Stem cell becoming a neuron — tweet
Two neurons having a conversation — tweet

Cybersecurity

AI Agent Traps

DeepMind just pointed out a pretty scary AI security gap: websites can tell when it's an agent — and show it totally different and malicious content than the one you see, for example:

Hidden instructions in HTML/CSS
Commands baked into images
Jailbreaks inside PDFs/files

Sources: tweet, paper

AI Agent Trap

More Cybersecurity

🏦 A Cyberattack on the Fed? Possible as we near real quantum computers and AGI tweet
▲ Vercel Was Hacked tweet
🇨🇳 Chinese Gov Secrets Leaked. A serious cyberattack leaks Chinese government secrets. Sources: tweet, tweet
Anthropic Mythos was hacked in the dumbest way possible: hackers, used the URL patterns to find its API (which was public) and then tried a few tokens from a third party eval company tweet

Random

⭐️ GitHub's Fake Star Economy

It's Goodart's Law all the way down 🐢

➗ One Binary Operator

All elementary functions can be generated from just one binary operator.

Sources: tweet, tweet

Artemis II — Moon Mission

We went around the moon! How cool is that.

Sources: tweet, tweet, tweet

More random

⭐️ Google releases Design.md — DESIGN.md lets you easily export and import your design rules from project to project tweet
The new way to tell a text is AI is not — (em dashes) or words like delve, it's the form of writing that says "It's not one just thing — it's another thing" tweet
AI optimism is waning because we’ve failed to tell a story where the future actually goes well. People see AI as a threat to their livelihood and status, while tech leaders scramble for the exit. To win the public back, we need bold, functional abundance tweet
☠️ Palantr techno fascism manifesto — does freedom requires surveillance? tweet
⛪️ Pope Leo XIV bold stand on the simulation and AI tweet
Paper: Mathematical Method And Human Thought In The Age Of AI by Terence Tao and Tanya Klowedn — development of AI remains human centered tweet
Personal life dashboard built with Claude tweet
Aaron Levie thinks there's an great opportunity to provide AI migration services to several small startups, that's a unique job opportunity for all of us! tweet
NN in Pure x86-64 Assembly: Building a Neural Network from scratch in pure x86-64 assembly tweet
😆 We can still have rave parties after WW3 https://x.com/HumansNoContext/status/2044890335755616621
Tinder x WorldCoin — scan your eye to get 3 extra boost tweet [/columns]

Cool videos

Longing for netwrok spirituality tweet
1990s animes hit different tweet
Makoto Shinkai created pure magic with The Garden of Words (2013). Ten years later and the visuals are still unmatched... truly a GOAT tweet
Thunders tracking tweet
Birds tracking tweet

Stay Updated

Get the latest AI insights delivered to your inbox. No spam, unsubscribe anytime.

About the Authors

Federico Ulfo

Founder, Engineer

AI Socratic

Founder of AI Socratic

New York City

Roberto Stagi

Founder

Org 520

Roberto Stagi is building a startup focused on AI Agents, prioritizing real-world use cases. His latest project was a travel-booking agent. He emphasizes Eval Driven Development for improving AI output quality. Stagi previously worked for Bending Spoons, an Italian company that acquired Evernote, and is interested in meeting new people and sharing his insights about Bending Spoons.

New York, USA

Federico Minutoli

Milan

Kerem Kazan

Founder

Org 544

I'm a sofware enginner focused on infrastructure and web3. I'm studying AI and deep learning as a hobby.; Engineer; Currently interviewing for ML infra roles. Last AI system: A transformer that comments on chess moves; The chess comment transformer is my last AI project; Why interested: I was asked to give a presentation on my chess commentator transformer. My general value-add is the breadth of my software engineering experience.; Because I loved the one in May. I brought some value by giving a talk on my last project :) Hoping to bring more value through my ML infra work.

New York

Amol Shah

Engineer

Org 257

Amol Shah focuses on AI projects, specifically in application layer startup, investing, and building. His work includes State Space models, addressing model-training contamination in academic benchmarking, and model pruning. He is also involved in customer service deployments using multi-modality, as evidenced by his AI transformation roadmap. His recent insights are documented in the "State of AI in Business 2025 Report." He is interested in connecting with other AI product professionals.

New York

Search

#Upcoming Events

#Anthropic

#Opus 4.7

#Mythos

#Claude Code Leak

#Claude Managed Agents

#Claude Design

#Claude HR?

#Anthropic rationing compute right now

#More From Anthropic

#OpenAI

#ChatGPT Image 2

#Codex Desktop computer use, In-App Browser, and Agent Workspace

#Anthropic Surpasses OpenAI in ARR

#Nvidia

#New Models

#Google — Gemma 4

#Moonshot — Kimi 2.6

#Xiaomi — MiMo-V2.5

#Learning LLMs

#Mini-ML — 12M parameter LLM from scratch

#13+ Attention Mechanisms You Should Know

#All LLMs are either claude-like or GPT-like

#Apple — White smoke from Cupertino HQ

#Apple names John Ternus as next Apple CEO

#Vibe Coding

#Karpathy's Knowledge Bases

#More interesting tools and content

#Research

#🧪 Simula — Reasoning-Driven Synthetic Data

#🧠 Agentic AI & the Next Intelligence Explosion

#❤️ Emotion Concepts in LLMs

#More Research

#Startup Updates & Fundraising

#They're Moving Faster Than You

#Robotics — 🇨🇳 Beijing Robot Marathon Competition

#More

#Geopolitics — Blockage of the Strait of Hurmuz 🇮🇷 and the Raising Oil Cost 📈

#Who Controls the Spice Controls the Universe

#Macroeconomics — The Fuckening Continues

#SaasPocalipse (The Fuckening)

#More Macroeconomics

#Philosophy

#The Abstraction Fallacy: Why AI Can Simulate But Not Instantiate Consciousness — Can AI be conscious?

#Biotech and Longevity

#🫁🫀 Organs on demand

#More

#Cybersecurity

#AI Agent Traps

#More Cybersecurity

#Random

#⭐️ GitHub's Fake Star Economy

#➗ One Binary Operator

#Artemis II — Moon Mission

#More random

#Cool videos

Stay Updated

About the Authors

Federico Ulfo

Roberto Stagi

Federico Minutoli

Kerem Kazan

Amol Shah

Matt Rabinsky

Related Posts

AI Socratic May 2026 — The Selfish Gen AI

AI Socratic March 2026 — #2

AI Socratic March 2026

Upcoming Events

Anthropic

Opus 4.7

Mythos

Claude Code Leak

Claude Managed Agents

Claude Design

Claude HR?

Anthropic rationing compute right now

More From Anthropic

OpenAI

ChatGPT Image 2

Codex Desktop computer use, In-App Browser, and Agent Workspace

Anthropic Surpasses OpenAI in ARR

Nvidia

New Models

Google — Gemma 4

Moonshot — Kimi 2.6

Xiaomi — MiMo-V2.5

Learning LLMs

Mini-ML — 12M parameter LLM from scratch

13+ Attention Mechanisms You Should Know

All LLMs are either claude-like or GPT-like

Apple — White smoke from Cupertino HQ

Apple names John Ternus as next Apple CEO

Vibe Coding

Karpathy's Knowledge Bases

More interesting tools and content

Research

🧪 Simula — Reasoning-Driven Synthetic Data

🧠 Agentic AI & the Next Intelligence Explosion

❤️ Emotion Concepts in LLMs

More Research

Startup Updates & Fundraising

They're Moving Faster Than You

Robotics — 🇨🇳 Beijing Robot Marathon Competition

More

Geopolitics — Blockage of the Strait of Hurmuz 🇮🇷 and the Raising Oil Cost 📈

Who Controls the Spice Controls the Universe

Macroeconomics — The Fuckening Continues

SaasPocalipse (The Fuckening)

More Macroeconomics

Philosophy

The Abstraction Fallacy: Why AI Can Simulate But Not Instantiate Consciousness — Can AI be conscious?

Biotech and Longevity

🫁🫀 Organs on demand

More

Cybersecurity

AI Agent Traps

More Cybersecurity

Random

⭐️ GitHub's Fake Star Economy

➗ One Binary Operator

Artemis II — Moon Mission

More random

Cool videos