Teddi Worledge

@TeddiWorledge

(she/her) Computer Science PhD Student @Stanford. Formerly @Berkeley.

Joined November 2023

108Following

102Followers

Pinned

Teddi Worledge@TeddiWorledge · Nov 27

🧵LLMs are great at synthesizing info, but unreliable at citing sources. Search engines are the opposite. What lies between them? Our new paper runs human evals on 7 systems across the✨extractive-abstractive spectrum✨for utility, citation quality, time-to-verify, & fluency!

TeddiWorledge's tweet image. 🧵LLMs are great at synthesizing info, but unreliable at citing sources. Search engines are the opposite. What lies between them?

Our new paper runs human evals on 7 systems across the✨extractive-abstractive spectrum✨for utility, citation quality, time-to-verify, &amp; fluency!

13.0K

Pinned

Teddi Worledge Retweeted

Nicole Meister@nicole__meister · Nov 12

Prior work has used LLMs to simulate survey responses, yet their ability to match the distribution of views remains uncertain. Our new paper [arxiv.org/pdf/2411.05403] introduces a benchmark to evaluate how distributionally aligned LLMs are with human opinions. 🧵

161

27.0K

Teddi Worledge Retweeted

Luke Bailey@LukeBailey181 · Dec 13

Can interpretability help defend LLMs? We find we can reshape activations while preserving a model’s behavior. This lets us attack latent-space defenses, from SAEs and probes to Circuit Breakers. We can attack so precisely that we make a harmfulness probe output this QR code. 🧵

369

221

56.0K

Teddi Worledge Retweeted

Irena Gao@irena_gao · Oct 29

Many providers offer inference APIs for the same models: for example, there were over nine Llama-3 8B APIs in Summer 2024. Do all of these APIs serve the same completion distribution as the original model? In our new paper, ✨Model Equality Testing: Which Model is This API…

169

44.0K

Teddi Worledge Retweeted

Logan Engstrom@logan_engstrom · Sep 13

Announcing a deadline extension for the ATTRIB workshop! Submissions are now due September 25th, with an option to submit October 4th if at least one paper author volunteers to be an emergency reviewer. More info here: attrib-workshop.cc

7.0K

Teddi Worledge@TeddiWorledge · Sep 10

come to twitter for the presidential debate gossip. stay for the perplexity correlations.

TTristan Thrush@TristanThrush · Sep 10

Do you want to select great LLM pretraining data but don’t have 1000 H100s for a ton of mixture experiments? What about a method that requires none of your own training, matches the best known existing method, and has some nice theory? New preprint: Perplexity Correlations

2.0K

Teddi Worledge Retweeted

CLS@ChengleiSi · Sep 9

Automating AI research is exciting! But can LLMs actually produce novel, expert-level research ideas? After a year-long study, we obtained the first statistically significant conclusion: LLM-generated ideas are more novel than ideas written by expert human researchers.

771

4.0K

3.0K

1.1M

Teddi Worledge@TeddiWorledge · Apr 10, 2024

I’m fighting… against vague notions of LLM attributions. 😤 Check out our paper (w. @TeddiWorledge, Nicole, Caleb and Carlos) here: arxiv.org/abs/2311.12233

SSaTML Conference@satml_conf · Apr 10, 2024

In today's last session on forensic analysis of ML systems, @judyhshen kicks it off by presenting their SoK on attribution in LLMs.

15.0K

Teddi Worledge Retweeted

Alexander Wan@alexwan55 · Feb 21, 2024

What happens when RAG models are provided with documents that have conflicting information? In our new paper, we study how LLMs answer subjective, contentious, and conflicting queries in real-world retrieval-augmented situations.

310

274

51.0K

Teddi Worledge@TeddiWorledge · Mar 28, 2024

if you care about pruning LLMs, you should check out our new paper!! this was a fun project, and am grateful to have gotten the chance to work with this fantastic group of people see the thread below for more👇

DDan Roberts@danintheory · Mar 27, 2024

Do LLMs really need to be so L? That's a rejected title for a new paper w/ @Andr3yGR, @kushal_tirumala, @Hasan_Shap, @PaoloGlorioso1 on pruning open-weight LLMs: we can remove up to *half* the layers of Llama-2 70B w/ essentially no impact on performance on QA benchmarks. 1/

2.0K

Teddi Worledge Retweeted

Krista Opsahl-Ong@kristahopsalong · Mar 8, 2024

Got a pipeline with **multiple prompts**, like a DSPy program? What's the right way to jointly optimize these prompts? Introducing MIPRO, a Multi-prompt Instruction Proposal Optimizer. We integrated MIPRO into DSPy. It can deliver +11% gains over existing DSPy optimizers! 🧵👇

485

521

139.0K

Teddi Worledge Retweeted

Judy Shen@judyhshen · Dec 14, 2023

What types of attributions do modern LLM applications require? Check out our contributed talk [Friday, 10:30am] by @TeddiWorledge at the ATTRIB23 workshop [Rm 271-273] on "Unifying Corroborative and Contributive Attributions in Large Language Models" arxiv.org/abs/2311.12233

8.0K