rasdani

@rasdani_

Founding AI Engineer @ellamindAI, open source AI @DiscoResearchAI abundance is upon us 🚀📈✨ramen enjoyoor 🍜

~/.cache/huggingface

Joined April 2022

3KFollowing

270Followers

Pinned

rasdani@rasdani_ · Feb 3

🧑‍🏭 🌐 From the depths of Discord servers, where anon AI enthusiasts would mingle to the forefront of European AI research @OpenEuroLLM . 🧑‍🔬🇪🇺 🙏 It is truly fulfilling to give back and build a strong foundation in Europe🦾🇪🇺 Come join us on this journey at @ellamindAI 🚀

eellamind@ellamindAI · Feb 3

1/4 🇪🇺 Big news! We're joining the @OpenEuroLLM project - Europe's largest open-source AI collaboration yet! Alongside 19 leading institutions, we'll develop next-gen multilingual LLMs that combine performance with European values.

329

Pinned

rasdani Retweeted

Machine Learning Street Talk@MLStreetTalk · Jun 5

Dropping tomorrow on MLST - the serious problems with Chatbot Arena. We will talk about the recent investment and the explosive paper from Cohere researchers which identified several significant problems with the benchmark.

109

5.0K

rasdani Retweeted

Casper Hansen@casper_hansen_ · Jul 22

Recipe to post-train Qwen3 1.7B into a DeepResearch model What does it mean for something small to think deeply? Meet Lucy, a post‑trained Qwen3‑1.7B as a DeepResearch model based on @willccbb's verifiers. Primary Rule-based Rewards: - Answer correctness We check whether the…

521

525

39.0K

rasdani Retweeted

Jan P. Harries@jphme · Jul 21

I asked @OpenAI 's o3 and @GoogleDeepMind 's Gemini 2.5 Pro to evaluate OpenAI´s and GDM´s IMO 2025 solutions. Result? Narrow victory for OpenAI 🏆 cc @scaling01 @rfurmaniak @GregHBurnham @alexwei_ @tszzl some details below 👇

6.0K

rasdani@rasdani_ · Jul 19

I have to say in many respects I've had more quality conversations in Berlin than in SF. Never has it happened in SF that people actually pull out pen and paper to talk precisely about hard concepts where surface level talk is just not enough. The overton window is wider and…

ssamsja@samsja19 · Jul 19

I realized at our Berlin event that there are a lot of talented and ambitious young ppl in Europe. Just (almost) no inspiring company to build the future nor VC that have the balls to give them a chance. No wonder why everybody wants to come to sf|

191

14.0K

rasdani@rasdani_ · Jul 19

LLazarz@Laz4rz · Jul 18

Berlin I’m in you

211

46.0K

rasdani Retweeted

Lazarz@Laz4rz · Jul 18

Berlin I’m in you

132

17.0K

rasdani Retweeted

OpenEuroLLM@OpenEuroLLM · Jul 18

📢 First release: 38 monolingual reference LLMs (2.15B params) via @hplt_eu + #OpenEuroLLM ⚙️Trained on 100B tokens from HPLT v2 dataset 🌍 Cover EU langs + others ⚙️ Based on LLaMA, trained on #LUMI 📈 Useful for evaluation Downloads + more info at openeurollm.eu/blog/hplt-oell…

640

rasdani@rasdani_ · Jun 27

The race for LLM "cognitive core" - a few billion param model that maximally sacrifices encyclopedic knowledge for capability. It lives always-on and by default on every computer as the kernel of LLM personal computing. Its features are slowly crystalizing: - Natively multimodal…

OOmar Sanseviero@osanseviero · Jun 26

I’m so excited to announce Gemma 3n is here! 🎉 🔊Multimodal (text/audio/image/video) understanding 🤯Runs with as little as 2GB of RAM 🏆First model under 10B with @lmarena_ai score of 1300+ Available now on @huggingface, @kaggle, llama.cpp, ai.dev, and more

382

1.0K

10.0K

5.0K

1.2M

rasdani@rasdani_ · Jun 27

recorded my first @LeRobotHF dataset today 🦾🤖 I finally got my hands on it @ErikKaum 😊

18.0K

rasdani Retweeted

Nathan Lambert@natolambert · Jun 11

v1.1 of RLHF book just pushed with the promised expansion on RLVR / reasoning models. * summary of the major reasoning model reports so far, * common practices/tricks and who used them, * related reasoning work before o1, * async RL, * other improvements. rlhfbook dot com

218

162

19.0K

rasdani Retweeted

Quentin@qtnx_ · Jun 11

for those that are too lazy to read papers, Magistral is an RL-only model (think R1-Zero), no distillation from open source traces from larger reasoning models. the magistral paper reports results when SFT is done before RL finally, this is a preview :)

212

10.0K

rasdani@rasdani_ · Jun 10

Latest Mistral report is also a solid contribution to RL scaling: dealing with verifiers latency/variance across rollups.

QQuentin@qtnx_ · Jun 10

you have no idea how back we are

361

203

51.0K

rasdani Retweeted

Nathan Lambert@natolambert · Jun 11

Replay buffers are coming for your LM reinforcement learning recipes. Async is the way. Some early work on it! Improving Data Efficiency for LLM Reinforcement Fine-tuning Through Difficulty-targeted Online Data Selection and Rollout Replay Sun et al.

373

288

25.0K

rasdani Retweeted

Wolfram Ravenwolf@WolframRvnwlf · Jun 11

Our lunch breaks? More like AI power sessions: We skip the fluff & share real-world AI tips + highlights. Our founder just shared practical tips on mastering Claude Code, our employee of the month. Hearing practical use cases - like using Claude Code's "rewind" to perfect prompts…

411

rasdani Retweeted

Nathan Lambert@natolambert · Jun 11

Major reasoning models so far with technical reports (focused on those w RL): 2025-01-22 — DeepSeek R1 — arxiv.org/abs/2501.12948 2025-01-22 — Kimi 1.5 — arxiv.org/abs/2501.12599 2025-03-31 — Open-Reasoner-Zero — arxiv.org/abs/2503.24290 2025-04-10 — Seed 1.5-Thinking —…

198

1.0K

82.0K

rasdani Retweeted

Nathan Lambert@natolambert · Jun 8

Another generative / inference-time scaling reward modeling paper. It's the direction things are going.

548

521

56.0K

rasdani Retweeted

Zafir Stojanovski@zafstojano · Jun 2

Super excited to share 💪🧠Reasoning Gym! 🧵 We provide over 100 data generators and verifiers spanning several domains (algebra, arithmetic, code, geometry, logic, games) for training the next generation of reasoning models. In essence, we can generate an infinite amount of…

143

105

10.0K

rasdani Retweeted

Oliver Stanley@_OliverStanley · Jun 2

Introducing Reasoning Gym: Over 100 procedurally generated reasoning environments for evaluation and RLVR of language models. Generate virtually infinite training or evaluation data with fine-grained difficulty control and automatic verifiers. 🧵 1/

274

207

41.0K

rasdani@rasdani_ · May 29

We've been thinking about what the "ideal" architecture should look like in the era where inference is driving AI progress. GTA & GLA are steps in this direction: attention variants tailored for inference: high arithmetic intensity (make GPUs go brr even during decoding), easy to…

TTed Zadouri@tedzadouri · May 29

"Pre-training was hard, inference easy; now everything is hard."-Jensen Huang. Inference drives AI progress b/c of test-time compute. Introducing inference aware attn: parallel-friendly, high arithmetic intensity – Grouped-Tied Attn & Grouped Latent Attn

465

359

54.0K

rasdani Retweeted

Lisan al Gaib@scaling01 · May 30

Introducing LisanBench LisanBench is a simple, scalable, and precise benchmark designed to evaluate large language models on knowledge, forward-planning, constraint adherence, memory and attention, and long context reasoning and "stamina". "I see possible futures, all at once.…

645

287

78.0K