Skip to main content
AI Socratic

Community Updates

A stream of short-form insights, takes, and news from the AI Socratic community.

Apple Opening Up Siri to Other Models

Apple Opening Up Siri To Other tweet id="2037230804942610548"

Federico Ulfo
Federico Ulfo
Mar 30, 2026

Gemini Embedding 2: Natively Multimodal Embedding Model

Gemini Embedding 2 Gemini Embedding 2 is our first natively multimodal embedding model that maps text, images, video, audio and documents into a single embedding space, enabling multimodal retrieval a

Federico Ulfo
Federico Ulfo
Mar 30, 2026

OpenAI GPT-5.4 (xhigh) Released

Offers roughly the same benchmark performance as Gemini 3.1 Pro, but for ~25% $USD/M tokens. Sources: Artificial Analysishttps://artificialanalysis.ai/models/gpt-5-4

Federico Ulfo
Federico Ulfo
Mar 3, 2026

Google Gemini 3.1 Flash-Lite

This is the fastest lightweight model. Google has been releasing the Flash model shortly after releasing the Pro models, Jeff Dean in the Latent Space Pod confirmed that the flash models are a distill

Federico Ulfo
Federico Ulfo
Mar 3, 2026

Alibaba Qwen 3.5 Small Model Series

Introducing Qwen 3.5 Small Model Series: Qwen3.5-0.8B · Qwen3.5-2B · Qwen3.5-4B · Qwen3.5-9B. These small models are built on the same Qwen3.5 foundation — native multimodal, improved architecture, sc

Federico Ulfo
Federico Ulfo
Mar 3, 2026

xAI Grok 4.20 with Parallel Agents

xAI new version of Grok runs 4 Grok4 agents in parallel. The result is not too bad. xAI added a new SuperGrok Heavy tier that runs 16 agents. While Grok is still far from OpenAI and Anthropic level, i

Federico Ulfo
Federico Ulfo
Mar 3, 2026

StepFun's Step 3.5 Flash

Sparse MoE model with 196B total params, but only 11B activated per token, this model was designed to fit into 128 GB memory i.e. it can run on DGX spark or other local setups. It is one of the first

Federico Ulfo
Federico Ulfo
Mar 3, 2026

Search

Search across events, members, and blog posts