Not that it ever left, but it surely seems Chinese language AI startup DeepSeek is again within the information — this time with an up to date model of its V3 mannequin, launched in December.
On Tuesday, the corporate formally introduced V3-0324, named after its launch month and day. A day earlier, folks observed DeepSeek had uploaded the brand new mannequin to HuggingFace, however with little extra data.
What’s new in DeepSeek’s V3-0324 mannequin?
Like R1 — DeepSeek’s top-performing mannequin launched in January and an OpenAI competitor — the brand new model is open supply (in that its weights are public, not its precise code) underneath an MIT license.
🚀 DeepSeek-V3-0324 is out now!
🔹 Main enhance in reasoning efficiency
🔹 Stronger front-end improvement abilities
🔹 Smarter tool-use capabilities
✅ For non-complex reasoning duties, we advocate utilizing V3 — simply flip off “DeepThink”
🔌 API utilization stays unchanged
📜 Fashions are… pic.twitter.com/QVuPwCODne— DeepSeek (@deepseek_ai) March 25, 2025
In a put up on X, DeepSeek famous that the replace reveals higher coding abilities for internet improvement and a “main enhance in reasoning efficiency,” but it surely nonetheless recommends it’s used for much less advanced reasoning duties. R1 stays the lab’s high reasoning mannequin, rating in fourth place on the Chatbot Enviornment.
DeepSeek stated the replace reveals improved efficiency over V3 on a number of industry-standard benchmarks, most notably the AIME (American Invitational Arithmetic Examination) math benchmark, scoring almost 20 factors increased.
Whereas benchmarks have turn out to be too simple for many fashions, an issue often called benchmark saturation, AIME remains to be thought of tougher than most. In January, Scale AI and the Middle for AI Security (CAIS) launched Humanity’s Final Examination to fight saturation.
That stated, as a result of it’s based mostly on highschool math content material, AIME’s solutions are publicly accessible on-line, that means they are often included in coaching information.
In accordance with DeepSeek, different enhancements embrace “enhanced” writing model and improved high quality, particularly for longer-form content material. Some Reddit commenters are speculating that the discharge of the improve might foreshadow the arrival of R2, which is anticipated to be as disruptive as R1.
Methods to strive DeepSeek’s V3-0324 mannequin
You possibly can entry V3-0324 now by way of HuggingFace or instantly by means of DeepSeek’s web site and app, although it’s possible you’ll wish to think about the most important safety holes and consumer privateness issues first. Whereas V3 and R1 proved to be very simply and dangerously jailbroken, it is unclear as of now whether or not DeepSeek added any layers of safety in V3-0324.
Need extra tales about AI? Join Innovation, our weekly e-newsletter.