Tyler Chang

@tylerachang

Research scientist @GoogleDeepMind. He/him/his.

Joined June 2022

126Following

172Followers

Pinned

Tyler Chang@tylerachang · Jun 25

We're organizing a shared task to develop a multilingual physical commonsense reasoning evaluation dataset! Details on how to submit are at: sigtyp.github.io/st2025-mrl.html

CCatherine Arnett @ ACL 🇦🇹@linguist_cat · Jun 24

As part of the workshop, we are also organizing a shared task to develop a collaborative physical commonsense reasoning evaluation dataset. See the shared task page for more information: sigtyp.github.io/st2025-mrl.html.

771

Tyler Chang@tylerachang · Jun 24

Excited to announce the call for papers for the Multilingual Representation Learning workshop #EMNLP2025 sigtyp.github.io/ws2025-mrl.html with @_dataman_ @linguist_cat Jiayi Wang @fdschmidt @tylerachang @hila_gonen and amazing speakers: Alice Oh, Kelly Marchisio, & Pontus Stenetorp

CCatherine Arnett @ ACL 🇦🇹@linguist_cat · Jun 24

The call for papers is out for the 5th edition of the Workshop on Multilingual Representation Learning which will take place in Suzhou, China co-located with EMNLP 2025! See details below!

2.0K

Tyler Chang@tylerachang · Apr 24

Presenting our work on training data attribution for pretraining this morning: iclr.cc/virtual/2025/p… -- come stop by in Hall 2/3 #526 if you're here at ICLR!

TTyler Chang@tylerachang · Dec 13

We scaled training data attribution (TDA) methods ~1000x to find influential pretraining examples for thousands of queries in an 8B-parameter LLM over the entire 160B-token C4 corpus! medium.com/people-ai-rese…

1.0K

Tyler Chang@tylerachang · Dec 19

One of the major pieces of feedback that we got on the last Turing test is that it was "too easy" because it used a 2-player format where you just speak to *either* a human or a model. We've revamped the site to make it more similar to Turing's original setup:

TTuring Test Live@turingtestlive · Dec 19

Turing test live uses a 3-party format, where you chat with a human and an AI simultaneously. Can you tell them apart? Live now and every day from 1–2 PM & 8–9 PM GMT at turingtest.live.

1.0K

Tyler Chang Retweeted

Catherine Arnett @ ACL 🇦🇹@linguist_cat · Nov 22

✨New pre-print!✨Successful language technologies should work for a wide variety of languages. But some languages have systematically worse performance than others. In this paper we ask whether performance differences are due to morphological typology. Spoiler: I don’t think so!

101

8.0K

Tyler Chang@tylerachang · Nov 14

@tylerachang and my paper “When is Multilinguality a Curse?” was awarded outstanding paper! Thank you @emnlpmeeting ❤️

EEMNLP 2025@emnlpmeeting · Nov 14

Announcing the 20 **Outstanding Papers** for #EMNLP2024

4.0K

Tyler Chang Retweeted

Catherine Arnett @ ACL 🇦🇹@linguist_cat · Sep 24

Our paper “When is Multilinguality a Curse?” will be presented at #EMNLP2024! We found that multilingual data hurts high-resource language performance, but improves low-resource performance as much as increasing training data by 33% @tylerachang arxiv.org/pdf/2311.09205

5.0K

Tyler Chang Retweeted

Catherine Arnett @ ACL 🇦🇹@linguist_cat · Aug 21

Super excited to finally release the Goldfish models, joint work with @tylerachang. These are small, comparable models for 350 languages. These are the first dedicated monolingual language models for many of these languages. huggingface.co/goldfish-models

16.0K

Tyler Chang Retweeted

Catherine Arnett @ ACL 🇦🇹@linguist_cat · Mar 4, 2024

New preprint with @tylerachang and Benjamin Bergen! We find that some languages need up to five times as much storage in bytes to convey the same amount of information arxiv.org/pdf/2403.00686…

2.0K