Lisa Dunlap

@lisabdunlap

messin around with model evals @berkeley_ai | prev @lmarena_ai

Joined October 2021

275Following

1KFollowers

Lisa Dunlap Retweeted

RunLLM@RunLLM · Jul 18

🎬 AI Hot Take 🔥 "What makes a bad AI product is a product that's too focused on AI." - UC Berkeley Professor @profjoeyg 👀 Watch the full mini-documentary here! youtu.be/MyjT2nBpbE8

498

Lisa Dunlap@lisabdunlap · Jul 19

Of all the coding assistants I use, Gemini is by far the cheekiest. I said it was querying the wrong key and it replied: “You’re absolutely right. The devil’s in the details—and in this case, the detail is the capitalization.”

574

Lisa Dunlap Retweeted

Yutong Bai@YutongBAI1002 · Jun 27

What would a World Model look like if we start from a real embodied agent acting in the real world? It has to have: 1) A real, physically grounded and complex action space—not just abstract control signals. 2) Diverse, real-life scenarios and activities. Or in short: It has to…

123

504

325

150.0K

Lisa Dunlap@lisabdunlap · Jun 19

Congrats Elon for snagging such a good dev

TTianle (Tim) Li@LiTianleli · Jun 18

Late, but big life update: I deferred my PhD and I started @xai a month ago as Member of Technical Staff, working on Post-training and Reasoning for @grok! I'm grateful for the chance to work alongside brilliant minds to solve the toughest problems standing between us and 🤖…

1.0K

Lisa Dunlap@lisabdunlap · Jun 19

looking through some arena battles and came across this gem

2.0K

Lisa Dunlap Retweeted

Mandi Zhao@ZhaoMandi · Jun 16

Our lab at Stanford usually do research in AI & robotics, but very occasionally we indulge in being functional alcoholics -- Recently we hosted a lab cocktail night, and created drinks with research-related puns like 'reviewer#2' and 'make 6 figures', sharing the full recipes…

272

26.0K

Lisa Dunlap Retweeted

Aleksander Holynski@holynski_ · Jun 14

MegaSaM got an award! Big congrats to the team!!!!! 🥳🥳🎉🎉 @zhengqi_li, Richard, @forrestercole2, @jin_linyi, @QianqianWang5, Vickie @akanazawa, @Jimantha

134

6.0K

Lisa Dunlap@lisabdunlap · Jun 13

Incredibly late to the game but if you are @CVPR come check out our poster on the vision arena (353) right now!

llmarena.ai@lmarena_ai · Jun 19, 2024

Exciting news - Chatbot Arena now supports image uploads📸 Challenge GPT-4o, Gemini, Claude, and LLaVA with your toughest questions. Plot to code, VQA, story telling, you name it. Let's get creative and have fun! Leaderboard coming soon. Credits to builders @chrischou03…

2.0K

Lisa Dunlap Retweeted

Yutong Bai@YutongBAI1002 · Jun 12

I’m going to give a talk at AI4CC workshop in #CVPR2025 at 2 pm CDT, regarding the recent world model project we did, at the Karl F. Dean Ballroom (or Grand Ballroom) on the 4th floor, room A1. @CVPR ai4cc.net Come say hi if you want to discuss :)

3.0K

Lisa Dunlap@lisabdunlap · Jun 11

At @CVPR ? Come see my talk on building evals which embrace the fuzziness of generative models at the EVAL-FoMo workshop today! This talk had everything - from Chatbot Arena to model vibes to designing UI's :P Details: June 11th, 4:30pm, room 210

2.0K

Lisa Dunlap Retweeted

Grace Luo@graceluo_ · Jun 6

✨New preprint: Dual-Process Image Generation! We distill *feedback from a VLM* into *feed-forward image generation*, at inference time. The result is flexible control: parameterize tasks as multimodal inputs, visually inspect the images with the VLM, and update the generator.🧵

176

1.0K

941

128.0K

Lisa Dunlap Retweeted

Mir Miroyan@mirmiroyan · Jun 6

We release Search Arena 🌐 — the first large-scale (24k+) dataset of in-the-wild user interactions with search-augmented LLMs. We also share a comprehensive report on user preferences and model performance in the search-enabled setting. Paper, dataset, and code in 🧵

233

181

42.0K