Manos Zaranis

@ManosZaranis

PhD student @istecnico | 2020 Alumni ECE NTUA

Joined March 2018

385Following

103Followers

Pinned

Manos Zaranis@ManosZaranis · Jun 23

🚨Meet MF²: Movie Facts & Fibs: a new benchmark for long-movie understanding! 🤔Do you think your model understands movies? Unlike existing benchmarks, MF² targets memorable events, emotional arcs 💔, and causal chains 🔗 — things humans recall easily, but even top models like…

ManosZaranis's tweet image. 🚨Meet MF²: Movie Facts &amp; Fibs: a new benchmark for long-movie understanding!
🤔Do you think your model understands movies?

Unlike existing benchmarks, MF² targets memorable events, emotional arcs 💔, and causal chains 🔗 — things humans recall easily, but even top models like…

9.0K

Pinned

Manos Zaranis@ManosZaranis · Dec 2

The second, even better and bigger model is now out: EuroLLM-9B 🇪🇺 Ranks as the best open EU-made LLM of its size, proving competitive or superior when going up against models like Meta's Llama 3.1, Qwen 2.5, and Google's Gemma-2. Blog post & models: lnkd.in/d9JJvmd7

AAK@_akhaliq · Sep 25

EuroLLM Multilingual Language Models for Europe The quality of open-weight LLMs has seen significant improvement, yet they remain predominantly focused on English. In this paper, we introduce the EuroLLM project, aimed at developing a suite of open-weight multilingual LLMs…

2.0K

Manos Zaranis@ManosZaranis · Jun 23

🚀 Big news! Tower+ is here — our strongest open-weight multilingual model yet!

RRicardo Rei@RicardoRei7 · Jun 23

🚀 Tower+: our latest model in the Tower family — sets a new standard for open-weight multilingual models! We show how to go beyond sentence-level translation, striking a balance between translation quality and general multilingual capabilities. 1/5 arxiv.org/pdf/2506.17080

120

Manos Zaranis@ManosZaranis · Jun 21

Check out TREQA! TL;DR: We evaluate translation quality of complex content through QA using LLMs.

PPatrick Fernandes@psanfernandes · May 16

MT metrics excel at evaluating sentence translations, but struggle with complex texts We introduce *TREQA* a framework to assess how translations preserve key info by using LLMs to generate & answer questions about them arxiv.org/abs/2504.07583 (co-lead @swetaagrawal20) 1/15

101

Manos Zaranis Retweeted

José Maria Pombal@zmprcp · Apr 2

New paper out 🚀 Zero-shot Benchmarking: A Framework for Flexible and Scalable Automatic Evaluation of Language Models: arxiv.org/abs/2504.01001. We present a framework and release a repository for creating reliable benchmarks for (V)LM tasks quickly and fully automatically.

1.0K

Manos Zaranis@ManosZaranis · Feb 5

The position is advertised for 12 months, but it has the possibility of a further 2-year extension

NNafise Sadat Moosavi@NafiseSadat · Feb 4

🚀 Postdoc Opportunity! 🚀 With @benbenhh & in collaboration with @inuikentaro, we’re hiring a Postdoc at @sheffieldNLP for Uncertainty Quantification in Foundation Models, with a chance to spend time at RIKEN, Japan! 📅 Deadline: 2 March 🔗 More info: jobs.ac.uk/job/DLS026/res…

2.0K

Manos Zaranis Retweeted

Saul Santos@Saul_Santos1997 · Feb 3

🚀 New paper alert! 🚀 Ever tried asking an AI about a 2-hour movie? Yeah… not great. Check: ∞-Video: A Training-Free Approach to Long Video Understanding via Continuous-Time Memory Consolidation! 🔗 arxiv.org/abs/2501.19098 w/ @tozefarinhas , @mcneural_ , @andre_t_martins

944

Manos Zaranis@ManosZaranis · Dec 2

We built the best EU-made LLM of its size! It supports all EU languages (and more), and beats Meta's Llama-3.1 on multilingual benchmarks. Congrats to everyone involved; super proud of this work.

UUnbabel@Unbabel · Dec 2

🚀We’re proud to launch EuroLLM, a multilingual model supporting all 24 EU languages! Developed with @istecnico, @EdinburghUni, and @UnivParisSaclay on the MareNostrum supercomputer, its set to be a game changer for AI innovation. 👉Learn more - hubs.li/Q02ZZRkl0

453