LLM360

@llm360

LLM360 is an open research lab enabling community-owned AGI through open-source large model research and development.

Joined November 2023

72Following

2KFollowers

Pinned

LLM360@llm360 · Oct 7

📢📢 We are releasing TxT360: a globally deduplicated dataset for LLM pretraining 🌐 99 Common Crawls 📘 14 Curated Sources 👨‍🍳 recipe to easily adjust data weighting and train the most performant models Dataset: huggingface.co/datasets/LLM36… Blog: huggingface.co/spaces/LLM360/…

llm360's tweet image. 📢📢
We are releasing TxT360: a globally deduplicated dataset for LLM pretraining
🌐 99 Common Crawls
📘 14 Curated Sources
👨‍🍳 recipe to easily adjust data weighting and train the most performant models

Dataset:
huggingface.co/datasets/LLM36…

Blog:
huggingface.co/spaces/LLM360/…

242

120

50.0K

LLM360@llm360 · Jun 14

Our team is lucky to have "early access" of this work from the IFM talk given by @ssahoo_

SSubham Sahoo@ssahoo_ · Jun 13

🚨 “The Diffusion Duality” is out! @ICML2025 ⚡️ Few-step generation in discrete diffusion language models by exploiting the underlying Gaussian diffusion. 🦾Beats AR on 3/7 zero-shot likelihood benchmarks. 📄 Paper: arxiv.org/abs/2506.10892 💻 Code: github.com/s-sahoo/duo 🧠…

2.0K

LLM360@llm360 · Jun 4

KV-caching is great, but will it work for Diffusion Language Models. @zhihanyang_ and team showed how to make it work with 65x speedup 🚀! Checkout the new preprint: arxiv.org/abs/2506.01928 The LLM360 team is very interested to explore new architectures.

ZZhihan Yang@zhihanyang_ · Jun 3

📢Thrilled to share our new paper: Esoteric Language Models (Eso-LMs) > 🔀Fuses autoregressive (AR) and masked diffusion (MDM) paradigms > 🚀First to unlock KV caching for MDMs (65x speedup!) > 🥇Sets new SOTA on generation speed-vs-quality Pareto frontier How? Dive in👇…

3.0K

LLM360@llm360 · Apr 24

The MBZUAI IFM and the LLM360 team's first day at @iclr_conf, come to visit our new Institute of Foundation Models! Booth D04 in Hall 2! We’re looking forward to meeting researchers and engineers to introduce them to @mbzuai .

llm360's tweet image. The MBZUAI IFM and the LLM360 team's first day at @iclr_conf, come to visit our new Institute of Foundation Models! Booth D04 in Hall 2!

We’re looking forward to meeting researchers and engineers to introduce them to @mbzuai .

4.0K

LLM360 Retweeted

EleutherAI@AiEleuther · Apr 22

Looking for EleutherAI at #ICLR2025? Come say hi at any of our five posters or the Open Science for Foundation Models workshop where @BlancheMinerva is giving the opening keynote. 🧵

3.0K

LLM360 Retweeted

Shangshang Wang@UpupWang · Apr 23

😋 Want strong LLM reasoning without breaking the bank? We explored just how cost-effectively RL can enhance reasoning using LoRA! [1/9] Introducing Tina: A family of tiny reasoning models with strong performance at low cost, providing an accessible testbed for RL reasoning. 🧵

352

307

31.0K

LLM360 Retweeted

Qian Liu@sivil_taram · Jan 9

🎉 Announcing the first Open Science for Foundation Models (SCI-FM) Workshop at #ICLR2025! Join us in advancing transparency and reproducibility in AI through open foundation models. 🤝 Looking to contribute? Join our Program Committee: bit.ly/4acBBjF 🔍 Learn more at:…

175

40.0K