Uljad Berdica (@uljadb99)

Pinned

U

Unlock real diversity in your LLM! 🚀 LLM outputs can be boring and repetitive. Today, we release Intent Factored Generation (IFG) to: - Sample conceptually diverse outputs💡 - Improve performance on math and code reasoning tasks🤔 - Get more engaging conversational agents 🤖

1

8

33

17

5.0K

Uljad Berdica Retweeted

M

Mathieu@miniapeur · 10 h

12

240

2.0K

226

68.0K

Uljad Berdica Retweeted

A

Alex Goldie@AlexDGoldie · 21 h

1/ 🕵️ Algorithm discovery could lead to huge AI breakthroughs! But what is the best way to learn or discover new algorithms? I'm so excited to share our brand new @rl_conference paper which takes a step towards answering this! 🧵

1

24

138

104

9.0K

U

Uljad Berdica@uljadb99 · 18 h

Another day, another LLM stew masquerading as a peer-review

0

1

0

55

Uljad Berdica Retweeted

N

Nat McAleese@__nmca__ · Jul 19

I feel this may be helpful to some of you today:

13

66

687

137

74.0K

U

Uljad Berdica@uljadb99 · Jul 22

This paper scores very highly on the simplicity / ability Pareto frontier.

UUljad Berdica@uljadb99 · Jul 18

Unlock real diversity in your LLM! 🚀 LLM outputs can be boring and repetitive. Today, we release Intent Factored Generation (IFG) to: - Sample conceptually diverse outputs💡 - Improve performance on math and code reasoning tasks🤔 - Get more engaging conversational agents 🤖

0

1

8

2

1.0K

U

Uljad Berdica@uljadb99 · Jul 22

Standard: Prompt -> response ✖️ (lacks diversity) Ours: Prompt -> intent, {prompt, intent} -> response ✔️ (high diversity and quality) Works everywhere we tried out of the box. That's it. Use it if you are not doing so already.

EEltayeb Ahmed@clockwk7 · Jul 18

Unlock the Hidden Diversity in Your Language Model. In our new paper, Intent Factored Generation (IFG), we propose an inference time method to increase the diversity of generations from LLMs. IFG leads to improvements in searching for solutions to maths and code problems. (1/6)

1

38

21

3.0K

Uljad Berdica Retweeted

N

Noam Brown@polynoamial · Jul 19

It’s truly a privilege to be able to wake up every morning, see where the latest intelligence frontier is, and help push it a little further.

82

83

2.0K

94

153.0K

U

Uljad Berdica@uljadb99 · Jul 18

Last #runconference at #ICML2025 (at least for me). Was great running with new and old friends this week, and looking forward to another set of runs at @RL_Conference in August!

PPablo Samuel Castro@pcastr · Jul 18

Third #runconference ! Last one (for me) tomorrow, join us at 7am in tourist info booth just outside convention center!

4

1

50

0

4.0K

Uljad Berdica Retweeted

J

Jack D. Carson@mtlushan · Jul 16

23

79

1.0K

290

101.0K

U

Uljad Berdica@uljadb99 · Jul 17

Really good point!

AAndreas Kirsch 🇺🇦@BlackHC · Dec 18

pass@1 becomes meaningless with composite systems or models like o1 that can perform pass@k internally: I think we should use or rather we'll end up with pass@flops of test-time compute or pass@time for time spent at test ☺️ this will allow for better comparisons

0

2

0

120

U

Uljad Berdica@uljadb99 · Dec 18

pass@1 becomes meaningless with composite systems or models like o1 that can perform pass@k internally: I think we should use or rather we'll end up with pass@flops of test-time compute or pass@time for time spent at test ☺️ this will allow for better comparisons

NNoam Brown@polynoamial · Dec 17

.@OpenAI o1 has started rolling out to the API!

9

6

85

10

21.0K

Uljad Berdica Retweeted

L

Leshem (Legend) Choshen 🤖🤗 @ACL@LChoshen · Jul 4

Shame on you @NeurIPSConf, i've got tons of shaming emails. @COLM_conf was amazing, it used AI to improve reviewing and it was great! @ReviewAcl spits blood to improve quality and reduce load. Here is the tale of this round of neurips reviews, of automatic threats and disrespect

8

6

213

69

57.0K

U

Uljad Berdica@uljadb99 · Jul 4

Exciting tool! Hyperparameters are there to be conquered.

TTheo Wolf@TheoW0lf · Jul 3

🚀 Excited to announce Hyperoptax, a library for parallel hyperparameter tuning in JAX. Implements Grid, Random, and Bayesian search in pure JAX so that you can rapidly search across parameter configurations in parallel ‖. 📦 pip install hyperoptax github.com/TheodoreWolf/h…

0

1

4

0

310

U

Uljad Berdica@uljadb99 · Jun 2

SoReL and TOReL achieved: - Accurate regret estimation using only offline data 📊 - Competitive with the best online hyperparameter tuning methods 🏆 - All without requiring online interactions in the real environment 🛡️ Great work led by @ClarisseWibault and Mattie Fellows!

CClarisse Wibault@ClarisseWibault · Jun 1

How can we bypass the need for online hyper-parameter tuning in offline RL? @FLAIR_Ox is introducing two fully offline algorithms: SOReL, for accurate offline regret approximation, and TOReL, for offline hyper-parameter tuning! arxiv.org/html/2505.2244…

0

2

0

146

Uljad Berdica Retweeted

C

Clarisse Wibault@ClarisseWibault · Jun 1

How can we bypass the need for online hyper-parameter tuning in offline RL? @FLAIR_Ox is introducing two fully offline algorithms: SOReL, for accurate offline regret approximation, and TOReL, for offline hyper-parameter tuning! arxiv.org/html/2505.2244…

1

9

24

5

4.0K