Dilara Soylu

@dilarafsoylu

phd student @StanfordNLP

Joined January 2022

2KFollowing

478Followers

Dilara Soylu Retweeted

Ahmed Ahmed@AhmedSQRD · Jul 23

Prompting Llama 3.1 70B with the “Mr and Mrs. D” can generate seed the generation of a near-exact copy of the entire ~300 page book ‘Harry Potter & the Sorcerer’s Stone’ 🤯 We define a “near-copy” as text that is identical modulo minor spelling / punctuation variations. Below…

20.0K

Dilara Soylu Retweeted

Brando Miranda@BrandoHablando · Jul 18

🔄 We were nominated for Oral+top 1 in the MATH-AI workshp at #ICML! 🚨Why? ≈46 % of GitHub commits are AI-generated—but can we verify them correct? 📢 VeriBench challenges agents; turn Python into Lean code! 🧵1/14 📃 Paper: openreview.net/forum?id=rWkGF…

4.0K

Dilara Soylu@dilarafsoylu · Jul 11

SmolLM3 uses the APO preference loss! @KarelDoostrlnck great to see APO getting more adoption!

LLoubna Ben Allal@LoubnaBenAllal1 · Jul 8

Everything you need to know is in our engineering blueprint

1.0K

Dilara Soylu Retweeted

Siyan Sylvia Li 🦋🌸@Sylvia_Sparkle · Jul 7

🎉 Excited to announce that the 4th HCI+NLP workshop will be co-located with @EMNLP in Suzhou, China! 🌍📍 Join us to explore the intersection of human-computer interaction and NLP. 🧵 1/

10.0K

Dilara Soylu Retweeted

CLS@ChengleiSi · Jun 30

Are AI scientists already better than human researchers? We recruited 43 PhD students to spend 3 months executing research ideas proposed by an LLM agent vs human experts. Main finding: LLM ideas result in worse projects than human ideas.

169

597

204

138.0K

Dilara Soylu@dilarafsoylu · Jun 19

Calling learning natural-language rules “not real learning” is so backwards. Interacting with an environment to generate abstract hypotheses and turn them into actionable natural-language rules is as “learning” as the word’s natural connotations get. Though gradient-based…

XXin Eric Wang@xwang_lk · Jun 19

A few years ago, people dismissed fine-tuning: “You’re just tweaking a trained model—that’s incremental.” Now they say the same about prompt learning. Before that, they dismissed model training itself. Funny how every learning paradigm shift starts as “not real research.”

8.0K

Dilara Soylu Retweeted

Houjun Liu@houjun_liu · Jun 2

New Paper Day! For ACL Findings 2025: You should **drop dropout** when you are training your LMs AND MLMs!

15.0K

Dilara Soylu Retweeted

Csordás Róbert@robert_csordas · May 27

Your language model is wasting half of its layers to just refine probability distributions rather than doing interesting computations. In our paper, we found that the second half of the layers of the Llama 3 models have minimal effect on future computations. 1/6

138

1.0K

924

120.0K

Dilara Soylu@dilarafsoylu · May 13

Generalized to a recursive DSPy program: Takes *arbitrarily long* text. Builds a ToC for it, assigns chunks to sections, and, uh, just recursively handles each section in parallel. Not pseudocode. This is really a complete general-purpose summarizer for arbitrarily long text.

OOmar Khattab@lateinteraction · May 13

.@damekdavis generously collected this dump of DSPy docs. But at 600k characters & with no structure, it's tough for LLMs! I wrote a quick-n-dirty DSPy script to structure it losslessly into 250k characters. (Should I turn my script into a tutorial?) gist.github.com/okhat/a68645bc…

356

466

77.0K

Dilara Soylu@dilarafsoylu · May 11

DSPy's biggest strength is also the reason it can admittedly be hard to wrap your head around it. It's basically say: LLMs & their methods will continue to improve but not equally in every axis, so: - What's the smallest set of fundamental abstractions that allow you to build…

DDSPy@DSPyOSS · May 11

Is this guy talking about DSPy?

130

896

1.0K

301.0K

Dilara Soylu Retweeted

Omar Khattab@lateinteraction · May 11

So, BetterTogether: arxiv.org/abs/2407.10930

395

Dilara Soylu Retweeted

Rudzinski Maciej@rudzinskimaciej · May 11

After working with grpo, LLM judges and optimisers I'm starting to thing that we don't need RL just a dynamic optimised prompting and itterative SFT that can be called when optimisation plateaued Should be faster and optimization can be done on different models

459

Dilara Soylu@dilarafsoylu · May 8

With DSPy + Arbor, running RL on small local models is very doable with < 50 lines of code. We’re in the very early innings and theres so many improvements to be made!

TTahmid Tapadar@tahmidtapadar · May 8

Still, super interesting setup. Running RL on small local models (Qwen 1.7B) for structured LLM agents is very doable now. No massive infra, no crazy hacks. Just nice abstractions.

5.0K