Ruibin Yuan

@abc43992899

ML Student @HKUST, Music Tech MS @CarnegieMellon. Co-Founder of Multimodal Art Projection Community (MAP)

Hong Kong

Joined January 2017

248Following

727Followers

Pinned

Ruibin Yuan@abc43992899 · Feb 25

🔥

ccocktail peanut@cocktailpeanut · Feb 24

YuE Song Generation AI now works even for 8GB VRAM! Thanks to Morpheus who sent a PR to add the 8GB VRAM option to the pinokiofactory repo, now 8GB vram machines can generate songs on their local machine.

228

Ruibin Yuan Retweeted

Tianyu Zheng@zhengtianyu4 · Jul 10

🚀 Thrilled to announce our new work: FR3E (First Return, Entropy-Eliciting Explore)! LLM reasoning with Reinforcement Learning often struggles with unstable and inefficient exploration. We propose FR3E, a structured framework to make it more robust & efficient.

8.0K

Ruibin Yuan Retweeted

Jianwei Yu@TomasJwYu · Jun 20

An impressive song generation project from Tencent It delivers impressive audio fidelity while maintaining fast generation speed. 📄 Paper: arxiv.org/abs/2506.07520 💻 Code: github.com/tencent-ailab/… 🧪 Try it here: huggingface.co/spaces/tencent…

308

Ruibin Yuan Retweeted

Zhongzhi Li@ZhongzhiLi4 · Jun 10

Our team from the Microsoft Research Asia, UCLA, Chinese Academy of Sciences, Tsinghua University, and released a paper, “TL;DR: Too Long, Do Re-weighting for Efficient LLM Reasoning Compression”proposing an innovative training method that effectively compresses the reasoning.

1.0K

Ruibin Yuan Retweeted

Zhongzhi Li@ZhongzhiLi4 · Jun 12

Hi everyone! The field of LLM-based reasoning has seen tremendous progress and rapid development over the past few months. We’ve updated our survey for exciting advances, now covering over 500 papers! 《From system 1 to system 2 a survey of reasoning large language models》

3.0K

Ruibin Yuan Retweeted

Xiao Liang@MasterVito0601 · Jun 13

🙋‍♂️ Can RL training address model weaknesses without external distillation? 🚀 Please check our latest work on RL for LLM reasoning! 💯 TL;DR: We propose augmenting RL training with synthetic problems targeting model’s reasoning weaknesses. 📊Qwen2.5-32B: 42.9 → SwS-32B: 68.4

134

11.0K

Ruibin Yuan Retweeted

Kimi.ai@Kimi_Moonshot · Apr 25

Announcing 🎙️ Kimi-Audio! Our new open-source audio foundation model advances capabilities in audio understanding, generation, and conversation. Key Features & Achievements: ✅ Universal audio foundation model handles diverse tasks like speech recognition, audio understanding,…

227

1.0K

572

120.0K

Ruibin Yuan@abc43992899 · Apr 28

🔥Kimi-Audio, a universal audio foundation model pre-trained on 13+ million hours of audio data and achieving SOTA performance on 10+ audio benchmarks. Tech Report: arxiv.org/abs/2504.18425 Model & Code & Evalkit: github.com/MoonshotAI/Kim… Congrats to the excellent team!

KKimi.ai@Kimi_Moonshot · Apr 25

7.0K

Ruibin Yuan Retweeted

Wenhu Chen@WenhuChen · Apr 15

🚀 General-Reasoner: Generalizing LLM Reasoning Across All Domains (Beyond Math) Most recent RL/R1 works focus on math reasoning—but math-only tuning doesn't generalize to general reasoning (e.g. drop on MMLU-Pro and SuperGPQA). Why are we limited to math reasoning? 1. Existing…

334

290

44.0K

Ruibin Yuan Retweeted

AK@_akhaliq · Mar 19

AudioX Diffusion Transformer for Anything-to-Audio Generation

394

261

35.0K

Ruibin Yuan Retweeted

Luxi Chen@Luxi_Chen123 · Mar 19

We are excited to introduce FlexWorld, a framework capable of generating 3D scenes from a single image that supports flexible viewpoint navigation, including 360° rotation and zooming. Code and model weights are open-source—try it out! Project Page：ml-gsai.github.io/FlexWorld/

2.0K

Ruibin Yuan Retweeted

Allie K. Miller@alliekmiller · Mar 13

YuE just dropped as an open-source AI that turns lyrics into fully-formed songs - vocals, instruments, structure and all. Generates pretty coherent, full-length tracks across multiple languages. Paper: arxiv.org/pdf/2503.08638 Demos are here (including a metal song called 'step…

2.0K

Ruibin Yuan@abc43992899 · Mar 12

Now YuE paper is finally out, check it out! arxiv: arxiv.org/abs/2503.08638 demo: map-yue.github.io code: github.com/multimodal-art… @huggingface @_akhaliq

AAK@_akhaliq · Jan 28

IT KEEPS GETTING BETTER: YuE (乐) open-source full-song music generation model that rivals Suno AI! It’s Hugging Face & LLAMA-compatible for easy fine-tuning.

848

Ruibin Yuan@abc43992899 · Mar 12

Thanks for sharing our work!

TTanishq Abraham is at ICML@iScienceLuvr · Mar 12

YuE: Scaling Open Foundation Models for Long-Form Music Generation "We tackle the task of long-form music generation—particularly the challenging lyrics-to-song problem—by introducing YuE (乐), a family of open foundation models based on the LLaMA2 architecture. Specifically,…

163

Ruibin Yuan Retweeted

SparkAudio@spark_audio · Mar 5

We're excited to introduce our TTS model Spark-TTS: ✅ Qwen2.5 architecture – single-stage, single-stream ✅Natural voice cloning & cross-lingual synthesis ✅ Voice Creation 📄 Paper: arxiv.org/pdf/2503.01710 🖥 Code: github.com/SparkAudio/Spa… 🎧 Demo: sparkaudio.github.io/spark-tts/

3.0K

Ruibin Yuan@abc43992899 · Mar 5

Check out Spark-TTS on Hugging Face: 🤗huggingface.co/SparkAudio/Spa… You can also give it a try directly here: 🤗huggingface.co/spaces/Mobvoi/….

SSparkAudio@spark_audio · Mar 5

754

Ruibin Yuan Retweeted

Haibin@eric_haibin_lin · Mar 5

❗️Open source MOE kernels alert❗️ Introducing COMET, a computation/communication library for MoE models from Bytedance. Battle-tested in our 10k+ GPU clusters, COMET shows promising efficiency gains and significant GPU-hour savings (millions 💰💰💰). Integration of DualPipe &…

222

142

83.0K

Ruibin Yuan Retweeted

Mimansa Jaiswal@MimansaJ · Feb 24

I interviewed for LLM/ML research scientist/engineering positions last Fall. Over 200 applications, 100 interviews, many rejections & some offers later, I decided to write the process down, along with the resources I used. Links to the process & resources in the following tweets

442

4.0K

7.0K

395.0K

Ruibin Yuan Retweeted

Sasha Rush@srush_nlp · Feb 24

Explained here: youtu.be/C7KnW8VFp4U?si…

10.0K