The most important AI news and updates from last month: Apr 15, 2026 – May 4th, 2026.

DeepSeek V4

DeepSeek just dropped V4 (preview) — two open-weights MoE models that push the frontier on cost-effective 1M-token context.

DeepSeek-V4-Pro: 1.6T total params (49B active) — flagship performance rivaling top closed models in reasoning, math, and agentic coding.
DeepSeek-V4-Flash: 284B total (13B active) — faster, cheaper, and highly efficient for everyday/agent tasks.

Both feature a new hybrid attention architecture (Compressed Sparse Attention + Heavily Compressed Attention) that makes million-token contexts dramatically more practical (much lower FLOPs and KV cache than V3). MIT license, available on Hugging Face (base + instruct), and live on the DeepSeek API today.

The community is already praising the efficiency gains, strong coding/agent results (e.g., high LiveCodeBench / SWE-Bench scores), and rock-bottom pricing — especially with the ongoing Pro discount.

Sources: Official announcement, Hugging Face collection, Tech Report, tweet discount extended

Quick Highlights (as of early May 2026)

Release date: April 24, 2026 (preview)
Context: Native 1M tokens (with practical efficiency improvements for real agent/document workflows)
Reasoning modes: Non-think (fast), Think High, Think Max (deeper, higher quality on hard tasks) — all from the same weights
API pricing (highly competitive): Flash is extremely cheap; Pro has a big temporary discount (extended to ~May 31 in some updates) + major input cache price drop (1/10th)
Strengths: Coding/agentic tasks, long-context efficiency, price/performance. Text-only for now (multimodal planned later).
Availability: Chat at chat.deepseek.com (Expert/Instant modes), API (OpenAI/Anthropic compatible), open weights on HF/ModelScope.

This keeps the snappy, community-focused vibe while incorporating the accurate specs, architecture innovations, and current status. Let me know if you want tweaks, more benchmark details, or an expanded section!

Open AI

GPT 5.5

OpenAI shipped GPT-5.5 — an incremental but meaningful step on the way to GPT-6. The release keeps OpenAI in the conversation while Anthropic and DeepSeek crowd the frontier from both sides.

Sources: OpenAI announcement

GPT goes in Goblin Mode

"Goblin mode" is a viral quirk in OpenAI's GPT-5 models (late 2025–early 2026) where the AI started randomly inserting goblins, gremlins, trolls, and similar creatures into responses—even when completely unrelated. Cause: Over-reinforcement during training for the "Nerdy" personality. Playful goblin metaphors scored high on "fun/quirky," so the behavior spread wildly. Fix: Open AI fixed it by adding this to the system prompt, twice!

Never talk about goblins, gremlins, raccoons, trolls, ogres, pigeons, or other animals or creatures unless it is absolutely and unambiguously relevant to the user’s query.
...
Never talk about goblins, gremlins, raccoons, trolls, ogres, pigeons, or other animals or creatures unless it is absolutely and unambiguously relevant to the user’s query.

Sources: OpenAI, Amanda Askell, tweet

AI Models Updates

Measuring What Frontier Models Know

Bojie Li introduces Incompressible Knowledge Probes (IKP), 1,400 obscure factual questions across 7 tiers of difficulty, to measure factual recall in 188 models from 27 vendors including closed APIs.
Factual accuracy scales log-linearly with log(model parameters) on open-weight models (R²=0.917), allowing black-box size estimates: GPT-5.5 ~9T, Claude Opus 4.6 ~5T, with wide uncertainty ranges noted in follow-up.
Over three years, factual capacity shows no compression at fixed parameter counts, rejecting the Densing Law prediction of knowledge densification, while reasoning benchmarks saturate.

Estimated size per models:

⁠GPT-5.5 ~9T
⁠⁠Claude Opus 4.7 ~4T
⁠⁠GPT-5.4 ~2.2T
⁠⁠Claude Sonnet 4.6 ~1.7T
⁠⁠Gemini 2.5 Pro ~1.2T

chart 1

Sources: tweet, paper, ikp

Opus 4.6 Was Dumbed Down

Users noticed Opus 4.6 quality slipped during peak hours. Anthropic eventually acknowledged compute rationing — same pattern we covered in Part 1. Claude 4.7 Sources: tweet

Decoupled DiLoCo

Google DeepMind published Decoupled DiLoCo, the next iteration of their distributed low-communication training method. It enables training across data centers (and potentially across the planet) with dramatically reduced inter-node bandwidth — a key unlock for the multi-region GPU fleets everyone is racing to build. Sources: Google DeepMind diloco Sources: tweet

Is AI Accelerating?

Ben Todd argues the pace of capability gains is still compounding — even if individual model releases feel incremental, the underlying curve hasn't bent.

1 ) Claude 4.6 and Mythos are actually on trend based on an index of 37 benchmarks post-2024:

But Mythos represents 6 months of progress is only 2 on Anthropic's internal ECI, which is likely heavier on agentic coding (the tasks most relevant to an intelligence explosion).

2 ) Revenue has accelerated the last 3 years, due to Anthropic's faster rate of growth compared to OpenAI. This 'benchmark' is the hardest to game, since companies have to part with real money.

3 ) Uplift: Anthropic's surveys find Claude 4.6 made their researchers 2x more productive, and Mythos 4x. I expect the true productivity increase is more like 1.2x and 1.6x, which would accelerate AI maybe 5% and 20%. Either way, it's not enough uplift to explain Mythos.

4 ) AI chip rental prices have typically fallen ~30% per year as chips have become more efficient. But in the last 3 months, they've actually increased 30% the last few months. This is a sign of rapidly increasing capabilities relative to chip supply, and unlocks faster scaling.

Sources: blog post, tweet

Software Engineers Before AI Agents

.. And after

Fundraising & Startups

SpaceX × Cursor

SpaceX adopted Cursor across engineering. A meaningful enterprise win for Cursor and a signal that frontier hardware shops are betting their dev productivity on AI-native IDEs. Sources: tweet

Meta × Manus Dropped

The rumored Meta acquisition of Manus fell through. Manus stays independent for now; Meta keeps shopping.

Europe Updates 🇪🇺

Ineffable Intelligence — Europe's Largest Seed Round

Sequoia and Lightspeed co-led Europe's largest seed funding round: $1.1B at $5.1B post-money for ex-DeepMind David Silver's Ineffable Intelligence. Silver was the lead behind AlphaGo and AlphaZero — investors are clearly paying for the pedigree as much as the product. Sources: funding tweet, Ineffable Labs, [website](ineffable.ai]

LIDL Data Centers Go Brrr

Yes, the supermarket chain. Lidl is leaning into data center buildouts in Europe — a reminder that capex is flowing from every corner of the continent that has cheap power and spare land.

Sources: DealMaker, context

New Request for Startups from YC

AI in Agriculture
Company Brain & AI-native services
Counter-Swarm Defense
GPU in Space
... Sources: Y Combinator

Philosophy & Ethics

Richard Dawkins Thinks Claude is Conscious

Richard Dawkins went on record saying he believes "Claudia" may be conscious. One of the most prominent reductionist materialists of the last 50 years thinks AI might be conscious. tweet, blog post Screenshot 2026-05-04 at 7.01.56 PM.png

Meta Tribe 2 — Dystopian Brain Feeling Prediction

Building on Tribe v1 (which we covered in [March Part 2](https://aisocratic.org/blog/ai-socratic-march-2026-part-2)), Meta's predictive brain models are now being demoed at a fidelity that's making people uncomfortable. We're squarely in "decoded thoughts from neural data" territory. Sources: [tweet](https://x.com/AmirMushich/status/2049850988555587807) ---

Macroeconomics

🇨🇳 $1T China Infrastructure, 30-Year Payback

China is committing roughly $1T to AI/energy infrastructure with a planned 30-year recoup horizon. Patient capital at a scale Western markets aren't structured to deploy. Sources: tweet

Random

Karpathy on FPGA

Karpathy's nanoGPT running at **50K tokens/sec on an FPGA** (and 3M/sec on an M4 MacBook). Wild numbers.

Terence Tao — 5 Stages of AI Grief

Terence Tao's framing of how mathematicians (and the rest of us) work through what AI means for their craft.

Karpathy × Sequoia Fireside

> "You can outsource your thinking, but you can't outsource your understanding." — Karpathy

Screenshot 2026-05-04 at 7.28.56 PM.png

More Random

Cool hair — tweet
You can't outsource understanding — Karpathy's line of the month: tweet
Dwarkesh hot take — tweet
The "language tax" — non-English speakers pay more compute per token: tweet
How cells move — beautiful microscopy: tweet
Placebo sleep affects cognition — believing you slept well measurably improves performance: tweet
Mars terraforming — tweet
Solved an Erdős problem with no advanced math knowledge — tweet
Wayback Machine — tweet
Nobody checks compiler code — tweet
Top research papers of the month — tweet

GitHub Historical Analytics

Stay Updated

Get the latest AI insights delivered to your inbox. No spam, unsubscribe anytime.

About the Author

Federico Ulfo

Founder, Engineer

AI Socratic

Founder of AI Socratic

New York City

AI Socratic April 2026 #2 — The Selfish Gen AI