Skip to main content
AI Socratic
February 2025
Models

xAI launches Grok 3

xAI launched Grok3 this week, it is an order of magnitude more capable than Grok 2, with 10x more computing power thanks to xAI's Colossus 100k H100s.

Grok 3 excels in math, science, coding, and general knowledge, with notable performance in image understanding tasks, achieving a 73.2% score on the MMMU benchmark (xAI Blog).

It's seems to be o1 level but it also introduced DeepResearch (they called it DeepSearch) and Search capabilities. The subscription costs $22/month. The most interesting feature of Grok is still the direct access to the twitter feed.

So while G3 is not full o3 level, which didn't launch yet, it signal xAI entering the arena in a competitive way.

https://x.com/adonis_singh/status/1892109817830851060

Federico UlfoFederico Ulfo
Hardware

Microsoft Launches Quantum Chip Majorana 1

https://x.com/MSFTQuantum/status/1892252249797124598

Microsoft’s Majorana 1, unveiled today, is the world’s first quantum processor using topological qubits. Here are three key insights:

  • Stable Qubits: It harnesses Majorana zero modes for more stable, error-resistant qubits, advancing quantum computing scalability.
  • Extreme Conditions: The processor relies on near-absolute-zero temperatures and magnetic fields to create a rare topological state of matter.
  • Scalable Future: Microsoft aims to scale to a million qubits on one chip, building on its 2024 Quantinuum collaboration for reliable quantum systems.
Federico UlfoFederico Ulfo

Hyperscaler Capex

https://x.com/kimmonismus/status/1889784565935317169

The hardware AI landscape is accelerating at a breathtaking pace. Tech giants like Microsoft, Google, Meta, and Amazon spending $528 billion on AI infrastructure in 2024 to pursue artificial general intelligence (AGI). Leaders like Dario Amodei and Sam Altman predict AGI could arrive within 1-2 years, promising breakthroughs in health and energy but also warning of job losses, inequality, and societal risks. Despite challenges like electricity and chip shortages, the post calls for urgent global discussions to ensure equitable outcomes as humanity approaches this transformative era.

Federico UlfoFederico Ulfo
Models

OpenAI Launches O3, Operator, and DeepResearch

We're just at the beginning of the year and OpenAI launched 3 new products under the pro subscription for $200/month.

  • O3 is a new category of GPT models that score 87% on the ARC challenge. OpenAI released o3-mini and o3-mini-high for coding.

  • Operator is an AI agent mode that can be used with chatgpt-4o to use a desktop simulator and run actions that require browsing a web page and clicking links. Here's Karpathy's take on Operator.

  • DeepResearch enables running long research that collects content across multiple sources and summarizes it into a coherent report. It's a super powerful tool that has been received with a bang. It really makes the OpenAI pro subscription worth it. DeepResearch is currently the highest scoring in the Humanity's Last Exam.

https://x.com/tomaspueyo/status/1887270096013529530

Federico UlfoFederico Ulfo
← NewerFebruary 2025Older →

Search

Search across events, members, and blog posts