Community Updates
A stream of short-form insights, takes, and news from the AI Socratic community.
GradMem: Writing Context into LLM Memory via Test-Time Gradient Descent
GradMem: Writing Context into LLM Memory via Test-Time Gradient Descenthttps://x.com/yurakuratov/status/2034989528818098615, GradMem introduces writing context into memory using test-time gradient des
100M Token Context Without Collapse on 2×A800 GPUs
100M Token Context Without Collapse: <9% Degradation on 2×A800 GPUshttps://x.com/marmaduke091/status/2034736279884025916, New research achieves 100M token context windows with less than 9% degradation
LLM Internals: By Layer 10, Models Are Language-Agnostic
LLM Internals: By Layer 10, Models Don't Know What Language They're Readinghttps://x.com/dnhkng/status/2036187044704035089, A new blog post reveals that when feeding the same sentence in English and C
LLM Fused with Mini Computer: Switching Between Text and Machine Code
LLM Fused with Mini Computer: Switching Between Text and Machine Code in Single GPUhttps://x.com/EastlondonDev/status/2036759710070645071, A developer demonstrates an LLM brain fused with a mini compu
Columbia Exposes Flaws in Private AI Inference: 280GB per Query
Columbia University Exposes Flaws in Private AI Inference: Prior Methods Used 280GB per Queryhttps://x.com/godofprompt/status/2035697480227266950, Columbia University researchers prove that the entire
ARC-AGI-3 Announced: Humans Score 100%, AI < 1%
This is so far the only unsaturated agentic intelligence benchmark. Unlike benchmarks that test what models already know, ARC-AGI-3 tests how they learn and acquire new skills, providing a formal meas
Quantization Explained
Quantization Explained tweet id="2036844409145512255"