The biggest event in January has been the launch of DeepSeek R1, which shook the market pushing NVIDIA stock down by 20% in a few days.
The reason behind this drop is that the team behind DeepSeek said the training costed only $5m, 1/30th of what OpenAI o1 is estimated to cost, with similar benchmark results. The model is open source but the weights are not.

DeepSeek-R1 makes it to the top 3 of the Arena Chatbots. The initial take from the industry and social media, is that DeepSeek performs better than o1, at least anecdotally. Most AI influencers pushed the narrative of R1 with urgency and wow to capture attention. So while several benchmarks are positive for R1, not all of them captured the reality.

DeepSeek-R1 performs well in math but it's performing to a lower level when it comes to other tests, including the AIW test. The reality check only came after the market drop of course.
All the top leaders have some take on DeepSeek to mitigate the damage: Alexandr Wang from Scale AI said DeepSeek must have 50k NVIDIA H100s, but they can't disclose that information because of the chip export restrictions. Satya Nadella mentioned the Jevons paradox, saying that as AI gets more efficient and affordable, we'll see its use skyrocket. Dario Amodei called on harder chip restriction to China.
This blog post circulated few days before the big drop, it's worth a read to understand the feeling of the moment, and the bear narrative for NVIDIA.
How Did We Get Here?
DeepSeek looks like it appeared out of nowhere but their team has been constantly delivering improved LLMs.

This event had a list of winners and losers:
Winners
-
High Fly the edge fund that very likely shorted the $NVIDIA stock, and pushed the market in panic mode. DeepSeek won this trading battle and shoke up the US showing that China is not only catching up but also pushing to become an AI leader.
-
NVIDIA, despite the 20% price drop, is a clear winner since AI is clearly not just a hype moment, and as price drops, we'll need more chips not less.
-
Google, building models and selling inference has a weak moat, distribution is a strong moat and Google has an edge.
-
Hugging Face started reproducing R1 and fully open-sourced everything, including the weights.
Losers
-
OpenAI, which spent billions of dollars on research to build o1 and is currently losing money on the pro subscriptions. With the launch of o3-mini, Operator, and DeepResearch, OpenAI regained the lead shortly after. But open source is catching up, and none of Google, Meta, or OpenAI have no moat against open source.
-
The US: China is now leading across multiple sectors, including drones, robotics, and now they're catching up on AI.
-
Taiwan is probably at higher risk now, a fallout between US and China could result in an invasion.
RL and Research
The impressive part of DeepSeek R1 is that it is able to expand on DeepSeek V3 without Supervised Fine-Tuning (SFT). Discovering some of the core discovery for 🍓Q*, gpt o1 codename. Everything we see coming out of R1 is an emergent property of RL.
Here are 2 classes to better understand how DeepSeek R1 works:
https://www.youtube.com/watch?v=XMnxKGVnEUc&t=329s
Key Insights From This Story
The amount of learning from what happened is outstanding and at best this has been a necessary step to get all of us laser focus on the target.
- Dario Amodei said that R1 is within the expected curve of growth, and it's actually not a step function above the curve.
- Haseeb Qureshi, DragonFly manager: deflationary cost of AI, makes AI moats are to sustain. Google has the strongest distribution and long-term edge, while OpenAI remains competitive despite the shakeup.
- Running DeepSeek from their server is not safe. Teams across the globe to save money started using DeepSeek.
- Since the weights are not disclosed, it's possible that the training set has some malicious content, so it's not safe to run, and a ban proposal is in act.
Full Sources List
- DeepSeek uses similar discovery of o1 https://x.com/markchen90/status/1884303237186216272
- DeepSeek appeared out of nowhere lol https://x.com/osanseviero/status/1884356079217434995
- HuggingFace deep seek open source ⭐ https://x.com/ClementDelangue/status/1883154611348910181
- https://x.com/eliebakouch/status/1883148201257234867, https://x.com/QGallouedec/status/1883143736869413171
- Understand deepseek v3 r1 blog posts https://x.com/cneuralnetwork/status/1883418147379945823
- diagram on how deepseek r1’s GRPO works https://x.com/Hesamation/status/1883992881914077493
- Yet another tale of Rise and fall - DeepSeek R1, benchmark ⭐ https://x.com/JJitsev/status/1883158738661691878
- 1.58bit DeepSeek https://x.com/UnslothAI/status/1883899061893546254
- DeepSeek running on apple Exo https://x.com/alexocheema/status/1884017521985995178
- most underrated aspect of DeepSeek is emergence https://x.com/amasad/status/1883998501300007279
- deep seek visual explainer https://x.com/omarsar0/status/1883994661918028009
- Chess game Xi Sama https://x.com/thekriskay/status/1883706165710016905
- Janus Pro-7b multimodal, https://x.com/rowancheung/status/1883917681642070282
- @SatyaNadella Jevons Paradox https://x.com/amasad/status/1883756969263304756
- Likely DeepSeek parent hedge fund is shorting $NVDA https://x.com/ChrisCamillo/status/1883855442943844727
- Theory on how DeepSeek trained at 1/30 the cost https://x.com/wordgrammer/status/1883712727073607859, https://x.com/wordgrammer/status/1883435405724553346, Emad https://x.com/EMostaque/status/1882965806134514000
- AlexandrWang DeepSeek has 50k H100 and used them secretly https://x.com/protosphinx/status/1882853844448911627
- @emostaque simpler way to understand deepseek https://x.com/EMostaque/status/1883863316688458003
- DeepSeek panic mode https://x.com/TheShortBear/status/1882783200998498542
- @hosseeb DeepSeek is dragging down the NASDAQ: DS is deflationary and intelligence is cheaper than we thought, NASDAQ is an index for producers not consumers. Nobody has a moat. Google is probably the winner out of this because of their distribution and data https://x.com/hosseeb/status/1883862843663237610
- @alexandr_wang DeepSeek is a wake up call for America https://x.com/alexandr_wang/status/1883368885640102092, every previous breakthrough was in the us (response to that https://x.com/JFPuget/status/1883517095465414724)
- Compare DeepSeek with o1 using RAG https://x.com/akshay_pachaar/status/1883497754649047192
- No moat in being closed sourced but only having a talented team that can keep innovating https://x.com/iScienceLuvr/status/1883254324769538075
- Neal Khosla, DeepSeek is a CCP state psyop https://x.com/nealkhosla/status/1882859736737194183, https://x.com/tunguz/status/1882845667942605154
- DeepSeek is as good as o1 https://x.com/RnaudBertrand/status/1882600399657705725
- Meta in panic mode https://x.com/SilverSpookGuy/status/1882525126039916819
- Arnaud Bertrand DS moment is about the world realizing China is leading https://x.com/RnaudBertrand/status/1882732479288942781
- DS r1 is the model of choice of university in the US overnight https://x.com/AnjneyMidha/status/1882669123492368586
- DS r1 top 3 in Arena https://x.com/lmarena_ai/status/1882749951924715578
- Hugging face technical dive into DeepSeek R1 https://x.com/Hesamation/status/1882151628012441663
- ARC eval https://x.com/arcprize/status/1881761987090325517
- Reward is all you need https://x.com/burny_tech/status/1881459096655991146
- DeepSeek r1 is here https://x.com/deepseek_ai/status/1881318130334814301
- ai.com now redirect to deepseek.com
- Demis Hassabis says DS is the best work out of China but there’s no actual new scientific advance https://x.com/tsarnick/status/1888681884647067981
- DeepSeek lied about costs https://x.com/7etsuo/status/1886770680794104221
- DeepSeek had rounding error https://x.com/wordgrammer/status/1885865920343466188
- open r1 https://x.com/QGallouedec/status/1885986730341200364
- semi analysis on deepseek https://x.com/SemiAnalysis_/status/1885192148037112023
- illustration of r1 https://x.com/himanshustwts/status/1885046490395029569
- r1 https://x.com/chamath/status/1885021294678122884
- deepseek came from nowhere https://x.com/iScienceLuvr/status/1884736091619537346