Jia Li
@JiaLi52524397
Co-founder at Numina; former AI scienctist at Mistral AI; Co-founder & ex CSO at Cardiologs
Happy to introduce Kimina-Prover-72B ! Reaching 92.2% on miniF2F using Test time RL. It can solve IMO problems using more than 500 lines of Lean 4 code ! Check our blog post here: huggingface.co/blog/AI-MO/kim… And play with our demo ! demo.projectnumina.ai

Impressive result. Absolutely great team work, given the resources compared to OAI and GDM. Congrats @huajian_xin
Lovely to see the impressive performance of the Seed Prover developed by the ByteDance Seed team at IMO 2025 — achieving a silver-level score (30 out of 42) within three days, and reaching (35 out of 42) with extended compute time. leanprover.zulipchat.com/#narrow/channe…
Impressive result ! High performance with low pass rate. Congrats to the Goedel prover team
(1/4)🚨 Introducing Goedel-Prover V2 🚨 🔥🔥🔥 The strongest open-source theorem prover to date. 🥇 #1 on PutnamBench: Solves 64 problems—with far less compute. 🧠 New SOTA on MiniF2F: * 32B model hits 90.4% at Pass@32, beating DeepSeek-Prover-V2-671B’s 82.4%. * 8B > 671B: Our 8B…
Hello World! 👋 We're thrilled to officially launch the X account for Numina, dedicated to advancing frontier AI in mathematics. Stay tuned for updates on our research, achievements, and the future of mathematical AI! #AI4Math #FormalMath #LeanProver #AutomatedReasoning…
Combinatorics are the two last problems unsolved by AlphaProof at last year's IMO。 Introducing CombiBench @Kimi_Moonshot , a benchmark focusing on combinatorics problems ! 🔥 🏆moonshotai.github.io/CombiBench/ 📘Dataset -> huggingface.co/datasets/AI-MO…

The Kimina Team (Numina & @Kimi_Moonshot collaboration) recently released Kimina-Prover Preview, achieving SOTA on miniF2F. We're now releasing a demo of our 72B model, and open-sourcing the Kimina Lean Server, which we used for our training pipeline!
Just released all the correct proofs and full thinking traces from Kimina-Prover Preview! 🧠📜 Explore them on GitHub: 🔗 github.com/MoonshotAI/Kim… Also, our arXiv preprint is live! If you find it helpful, consider citing us 🙏 📄 arxiv.org/abs/2504.11354
We believe formal math is the future. 🔥Introducing Kimina-Prover Preview, a Numina & @Kimi_Moonshot collaboration, the first large formal reasoning model for Lean 4, achieving 80.78% miniF2F. github.com/MoonshotAI/Kim…
Very proud of our new model, Kimina-Prover Preview! It’s the first large reasoning model for theorem proving, and achieves a SOTA on miniF2F (80%). I strongly believe in RL for formal mathematics. Here is why. 🧵
We believe formal math is the future. 🔥Introducing Kimina-Prover Preview, a Numina & @Kimi_Moonshot collaboration, the first large formal reasoning model for Lean 4, achieving 80.78% miniF2F. github.com/MoonshotAI/Kim…
🔬 Sharing an early look at Kimina-Prover, our new Lean theorem proving model from our collaboration with Numina! @JiaLi52524397 🏆 Using an RL pipeline for proof exploration, Kimina-Prover Preview achieved 80.7% on the miniF2F — currently SOTA on this benchmark. We see promise…
We just published the second OpenR1 update with OpenR1-220k-Math, our new large-scale dataset for mathematical reasoning generated by DeepSeek R1. We generated 800k+ reasoning traces on 512 H100s in 3 days 🚀
🚀 NuminaMath 1.5 is here! 🚀 900k+ high-quality competition math problems with CoT solutions, new problem metadata, manually verified Olympiad problems, and more! 📚🏅 Check it out: 🔗 huggingface.co/datasets/AI-MO… Thanks to @Will424408 @dsleo
🚀 Introducing Goedel-Prover: A 7B LLM achieving SOTA open-source performance in automated theorem proving! 🔥 ✅ Improving +7% over previous open source SOTA on miniF2F 🏆 Ranking 1st on the PutnamBench Leaderboard 🤖 Solving 1.9X total problems compared to prior works on Lean…
Project Numina is thrilled to announce a €3m research grant from XTX Markets to support the development of open-source AI tools for mathematicians and general progress in AI reasoning! businesswire.com/news/home/2024…
magnet:?xt=urn:btih:7278e625de2b1da598b23954c13933047126238a&dn=pixtral-12b-240910&tr=udp%3A%2F%2Ftracker.opentrackr.org%3A1337%2Fannounce&tr=udp%3A%2F%https://t.co/2UepcMHjvL%3A1337%2Fannounce&tr=http%3A%2F%https://t.co/NsTRgy7h8S%3A80%2Fannounce