Mini-ML: 12M Parameter LLM Trained From Scratch in Rust
April 21, 2026
They trained a 12M parameter LLM on their own ML framework using a Rust backend and CUDA kernels for flash attention, AdamW, and more. Inspirational project for anyone who wants to better understand how to build LLMs.
Sources: tweet
Sign in as a member to join the conversation.
Loading comments…
Get the latest AI insights delivered to your inbox. No spam, unsubscribe anytime.
Search across events, members, and blog posts