Community Updates
A stream of short-form insights, takes, and news from the AI Socratic community.
Apple Opening Up Siri to Other Models
Apple Opening Up Siri To Other tweet id="2037230804942610548"
Gemini Embedding 2: Natively Multimodal Embedding Model
Gemini Embedding 2 Gemini Embedding 2 is our first natively multimodal embedding model that maps text, images, video, audio and documents into a single embedding space, enabling multimodal retrieval a
OpenAI GPT-5.4 (xhigh) Released
Offers roughly the same benchmark performance as Gemini 3.1 Pro, but for ~25% $USD/M tokens. Sources: Artificial Analysishttps://artificialanalysis.ai/models/gpt-5-4
Google Gemini 3.1 Flash-Lite
This is the fastest lightweight model. Google has been releasing the Flash model shortly after releasing the Pro models, Jeff Dean in the Latent Space Pod confirmed that the flash models are a distill
Alibaba Qwen 3.5 Small Model Series
Introducing Qwen 3.5 Small Model Series: Qwen3.5-0.8B · Qwen3.5-2B · Qwen3.5-4B · Qwen3.5-9B. These small models are built on the same Qwen3.5 foundation — native multimodal, improved architecture, sc
xAI Grok 4.20 with Parallel Agents
xAI new version of Grok runs 4 Grok4 agents in parallel. The result is not too bad. xAI added a new SuperGrok Heavy tier that runs 16 agents. While Grok is still far from OpenAI and Anthropic level, i
StepFun's Step 3.5 Flash
Sparse MoE model with 196B total params, but only 11B activated per token, this model was designed to fit into 128 GB memory i.e. it can run on DGX spark or other local setups. It is one of the first