Ricardo Rei
@RicardoRei7
Head of AI Research @swordhealth
EuroBERT is going to @COLM_conf 2025! Can’t wait to be in Montreal with @gisship and @DuarteMRAlves to see all the great research everyone’s bringing!
We just released M-Prometheus, a suite of strong open multilingual LLM judges at 3B, 7B, and 14B parameters! Check out the models and training data on Huggingface: huggingface.co/collections/Un… and our paper: arxiv.org/abs/2504.04953
Happy to share our new work: ZSB is a framework to create benchmarks for any task and automatically evaluate any LLM. We show that it correlates highly with human evaluation on some of the most used benchmarks like chatbot arena
New paper out 🚀 Zero-shot Benchmarking: A Framework for Flexible and Scalable Automatic Evaluation of Language Models: arxiv.org/abs/2504.01001. We present a framework and release a repository for creating reliable benchmarks for (V)LM tasks quickly and fully automatically.
🧵 (3/7) 🌐 EuroBERT is open-source: 👉 Models (210M, 610M, 2.1B params) 👉 Training snapshots 👉 Full training framework Explore here: [huggingface.co/EuroBERT]() Code coming soon! [github.com/Nicolas-BZRD/E…]()
🇪🇺 One month after the AI Action Summit 2025 in Paris, I am thrilled to announce EuroBERT, a family of multilingual encoder exhibiting the strongest multilingual performance for task such as retrieval, classification and regression over 15 languages, mathematics and code. ⬇️ 1/6
🚨New machine translation dataset alert! 🚨We expanded the language coverage of WMT24 from 9 to 55 en->xx language pairs by collecting new reference translations for 46 languages in a dataset called WMT24++ Paper: arxiv.org/abs/2502.12404… Data: huggingface.co/datasets/googl…
Good to see @EU_Commission promoting OS LLMs in Europe. However (1) "OpenEuroLLM" is appropriating a name (#EuroLLM) which already exists, (2) it is certainly *not* the "first family of open-source LLMs covering all EU languages" 🧵
AI made in 🇪🇺 OpenEuroLLM, the first family of open source Large Language Models covering all EU languages, has earned the first STEP Seal for its excellence. It brings together EU startups, research labs and supercomputing hosts to train AI on European supercomputers ↓
EuroLLM-9B is now ranking as the best LLM of its size (tied with Gemma) on the European LLM Leaderboard! Check it here: huggingface.co/spaces/openGPT…
Today we release EuroLLM-9B: the best EU-made multilingual LLM of its size! Check the blog post for more info and results: huggingface.co/blog/eurollm-t…. Stay tuned for the technical report and bigger and more powerful models!
I'm also happy to talk about widn.ai and #EuroLLM, feel free to DM me! 🚀
3) We have another poster in the main conference, “QUEST: Quality-Aware Metropolis-Hastings Sampling for Machine Translation” with @goncalorafaria @swetaagrawal20 @tozefarinhas @RicardoRei7 @accezz (Thursday Dec 12 16:30-19:30).
Extremely happy with the release of EuroLLM 9B and with its results! This model is truly Multilingual, supporting 35 languages and covering all European Union official languages! Follow the 🧵 below for more details..👇👇
Today we release EuroLLM-9B: the best EU-made multilingual LLM of its size! Check the blog post for more info and results: huggingface.co/blog/eurollm-t…. Stay tuned for the technical report and bigger and more powerful models!
💻 Thank you @slatornews for inviting Unbabel to speak on the current state of AI in translations. Key takeaways: 🔶 LLMs will continue to outperform NMT 🔶 LLM research will become evaluation centric 🔶 MT 2.0 will focus more on persoanlization and transcreation
Super excited to announce the launch of Widn.ai! 🎉 This results from several years of work in Translation Evaluation and the application of that knowledge to LLM research, leading to the creation of Tower. Now, Tower is accessible and ready for anyone!
💥 Today we’re excited to announce the launch of hubs.li/Q02Y2GpL0 - our new standalone AI solution built for businesses looking to scale quickly with cost-effective translations you can trust. 👇 Learn more about Widn and try it for free. hubs.li/Q02Y2G4q0
In the morning, we will also present our work on Tower v2 which ranked first in 8/11 language pairs on WMT 2024 General shared task! "Tower-v2: Unbabel-IST 2024 Submission for the General MT Shared Task", WMT, Nov 15, 11:00-12:00
Me and @nunonmg will be giving a keynote at WMT this Friday at 14:00 about why research on MT is still fascinating in the era of LLMs! Don’t miss it!
Me and @nunonmg will be giving a keynote at WMT this Friday at 14:00 about why research on MT is still fascinating in the era of LLMs! Don’t miss it!
Also, don't miss @RicardoRei7 and @nunonmg keynote at WMT Friday Nov 15 14:00-15:00: "What Makes MT Research Special in the LLM Age?"