Patrick Fernandes

@psanfernandes

PhD Student @LTIatCMU & @istecnico Previously research @Google, @Microsoft & @Unbabel

Pittsburgh

Joined February 2011

271Following

647Followers

Pinned

Patrick Fernandes@psanfernandes · May 12, 2023

*Human feedback* was the necessary secret sauce in making #chatgpt so human-like But what exactly is feedback? And how can we leverage it to improve our models? Check out our new survey on the use of (human) feedback in Natural Language Generation! arxiv.org/abs/2305.00955 1/16

psanfernandes's tweet image. *Human feedback* was the necessary secret sauce in making #chatgpt so human-like
But what exactly is feedback? And how can we leverage it to improve our models?

Check out our new survey on the use of (human) feedback in Natural Language Generation!

arxiv.org/abs/2305.00955

1/16

432

261

79.0K

Patrick Fernandes Retweeted

Lindia Tjuatja @ ACL 2025@lltjuatja · Jun 9

When it comes to text prediction, where does one LM outperform another? If you've ever worked on LM evals, you know this question is a lot more complex than it seems. In our new #acl2025 paper, we developed a method to find fine-grained differences between LMs: 🧵1/9

139

26.0K

Patrick Fernandes@psanfernandes · May 16

Very proud of this work with @psanfernandes @swetaagrawal20 @ManosZaranis @gneubig at @sardine_lab_it and @LTIatCMU. TL;DR: We evaluate translation quality of complex content by checking question answering invariance.

PPatrick Fernandes@psanfernandes · May 16

MT metrics excel at evaluating sentence translations, but struggle with complex texts We introduce *TREQA* a framework to assess how translations preserve key info by using LLMs to generate & answer questions about them arxiv.org/abs/2504.07583 (co-lead @swetaagrawal20) 1/15

1.0K

Patrick Fernandes Retweeted

Slator@slatornews · May 7

👉 slator.ch/QuestionAnswer… Two new research papers propose using question answering ❓ to evaluate #AI #translation, 🤖 challenging 🧐 how the language industry evaluates translation quality. @zoeykii @umdclip @MarineCarpuat @JohnsHopkins @psanfernandes @LTIatCMU @istecnico…

364

Patrick Fernandes@psanfernandes · Apr 26

Come and chat with us about a powerful (but surprisingly underused) *test-time compute* scaling technique to improve your LLMs!

IIan Wu@ianwu97 · Apr 26

I will be presenting my #ICLR2025 Spotlight work “Better Instruction-Following Through Minimum Bayes Risk” today (Sat) at 3pm! Swing by #205 in hall 3 to chat with me and @psanfernandes

343

Patrick Fernandes Retweeted

José Maria Pombal@zmprcp · Apr 8

We just released M-Prometheus, a suite of strong open multilingual LLM judges at 3B, 7B, and 14B parameters! Check out the models and training data on Huggingface: huggingface.co/collections/Un… and our paper: arxiv.org/abs/2504.04953

8.0K