DeepSeek V4
DeepSeek just dropped V4 preview — two open-weights MoE models that push the frontier on cost-effective 1M-token context. DeepSeek-V4-Pro: 1.6T total params 49B active — flagship performance rivaling
DeepSeek just dropped V4 preview — two open-weights MoE models that push the frontier on cost-effective 1M-token context. DeepSeek-V4-Pro: 1.6T total params 49B active — flagship performance rivaling
GPT 5.5 OpenAI shipped GPT-5.5 — an incremental but meaningful step on the way to GPT-6. The release keeps OpenAI in the conversation while Anthropic and DeepSeek crowd the frontier from both sides. S
Models don't always say what they think, they instead encode their thinking into tokens that are not human readable. Anthropic introduces a solution to train models to convert internal neural activati
Sakana AI & NVIDIA's ICML 2026 paper introduces TwELL — a new sparse format for LLM feedforward layers that achieves 95% unstructured sparsity via ReLU + light L1 while staying fully compatible with f
Scott Aaronson asks why physical systems become more “interesting” before settling into disorder, even though entropy only increases. Using a coffee cup example separate → swirling patterns → fully mi
~97% of your vector database is mathematically empty. Your RAG system is retrieving from noise. Sources: tweethttps://x.com/anirudhbvce/status/2052532004919361958
@Thariq from Claude Code suggests to use HTML instead of MD files, this to me sounds like the typical "never ask the barber if you need a haircut", but @Karpathy also confirm that HTML are actually an
- Bojie Li introduces Incompressible Knowledge Probes IKP, 1,400 obscure factual questions across 7 tiers of difficulty, to measure factual recall in 188 models from 27 vendors including closed APIs.
Users noticed Opus 4.6 quality slipped during peak hours. Anthropic eventually acknowledged compute rationing — same pattern we covered in Part 1. Sources: tweethttps://x.com/ns123abc/status/204741445
Google DeepMind published Decoupled DiLoCo, the next iteration of their distributed low-communication training method. It enables training across data centers and potentially across the planet with dr
Ben Todd argues AI capability gains are still compounding — even if recent model releases feel incremental, the overall curve hasn’t slowed. 1 Benchmarks Claude 4.6 and Mythos are roughly on trend acr
Salvatore Sanfilippo Antirezhttps://x.com/antirez, of Redis fame dropped DS4, a narrow-bet inference engine that runs DeepSeek V4 Flash locally on Apple Silicon Metal and Linux CUDA. Not a generic GGU
Note: ssh was created in 1995, tmux was created in 2007.
Rankings by category show that frontier models have distinct strengths and tradeoffs. Source: tweethttps://x.com/arena/status/2054223408427372831
China's CXMT RAM is selling for $150, while the global average price is around $300 to $400. Chinese memory giants CXMT and YMTC are aggressively ramping production. Sources: tweethttps://x.com/BullTh
Sources: DHSgov tweethttps://x.com/DHSgov/status/2057817233200418837
🇨🇳 China is committing roughly $1T to AI/energy infrastructure with a planned 30-year recoup horizon. Patient capital at a scale Western markets aren't structured to deploy. Sources: tweethttps://x.
Fiber optics is still happening at the battlefield, although not as much as it used to be. It's extremely pricey now. We used to buy 50km spool for $300, now it's easily $2500. At least a positive sec
Source: tweethttps://x.com/karpathy/status/2056753169888334312
SpaceX adopted Cursor across engineering. A meaningful enterprise win for Cursor and a signal that frontier hardware shops are betting their dev productivity on AI-native IDEs. Sources: tweethttps://x
The rumored Meta acquisition of Manus fell through. Manus stays independent for now; Meta keeps shopping.
Sequoia and Lightspeed co-led Europe's largest seed funding round: $1.1B at $5.1B post-money for ex-DeepMind David Silver's Ineffable Intelligence. Silver was the lead behind AlphaGo and AlphaZero — i
🇪🇺 Yes, the supermarket chain. Lidl is leaning into data center buildouts in Europe — a reminder that capex is flowing from every corner of the continent that has cheap power and spare land. Sources
- AI in Agriculture - Company Brain & AI-native services - Counter-Swarm Defense - GPU in Space ... Sources: Y Combinatorhttps://x.com/ycombinator/status/2048834285197812146
Richard Dawkins went on record saying he believes "Claudia" may be conscious. One of the most prominent reductionist materialists of the last 50 years thinks AI might be conscious. Sources: tweethttps
If LLMs can produce complex behavior from simple rules, then consciousness may not be a mystical add-on to physics. Sources: tweethttps://x.com/realBigBrainAI/status/2053135484554195142
Do you know how hard you have to abuse a mammal for them not to have children? — Connor Leahyhttps://x.com/NPCollapse This quote is from a talk at the Nexus Conference in Amsterdam in 2025: 'Apocalyps
Building on Tribe v1 which we covered in March Part 2https://aisocratic.org/blog/ai-socratic-march-2026-part-2, Meta's predictive brain models are now being demoed at a fidelity that's making people u
Dwarkesh recently started running a new blackboard lectures series with some of the top researchers and engineers in the space.. and we are all here for it 🙌 How GPT, Claude, and Gemini are actually
Anthropic CFO Krishna Rao shared that the company’s revenue run-rate grew from $9B to $30B in one quarter, with 500%+ NDR, 9 of the Fortune 10 as customers, and over 90% of internal code written by Cl
Sequoia Capital's AI Ascent 2026 convened Greg Brockman, Andrej Karpathy, Demis Hassabis, Boris Cherny, Dmitri Dolgov, and more with 150+ leading founders and researchers to discuss the present and fu
Learn collider bias: among elite chess players / NBA players / elite academics, those with the lowest IQ are the best. Sources: tweethttps://x.com/MariosGeorgakis/status/2053017502662246409
Sources: tweethttps://x.com/samsheffer/status/2056820022144905380
- Claude Code finds the password of a locked Bitcoin wallet: tweethttps://x.com/cprkrn/status/2054586810475364536 - Casimir Effect to power a battery from the quantum field, hence battery-free. Likely
Search across events, members, and blog posts