LLaMA Factory (@llamafactory_ai)

Pinned

L

LLaMA Factory@llamafactory_ai · Jun 16

LLaMA-Factory v0.9.3 released! Thank you for 50k GitHub stars 🌟 Fully open-source, no-code fine-tuning on Gradio UI for nearly 300+ models -- including Qwen3, Llama 4, Gemma 3, InternVL3, Qwen2.5-Omni, etc. • Install locally via our Docker image: hub.docker.com/r/hiyouga/llam… •…

llamafactory_ai's tweet image. LLaMA-Factory v0.9.3 released! Thank you for 50k GitHub stars 🌟

Fully open-source, no-code fine-tuning on Gradio UI for nearly 300+ models -- including Qwen3, Llama 4, Gemma 3, InternVL3, Qwen2.5-Omni, etc.

• Install locally via our Docker image: hub.docker.com/r/hiyouga/llam…

•…

1

37

128

83

18.0K

L

LLaMA Factory@llamafactory_ai · Jul 23

New tech report out! 🚀 Scaling Up RL: Unlocking Diverse Reasoning in LLMs via Prolonged Training An expanded version of our ProRL paper — now with more training insights and experimental details. Read it here 👉 arxiv.org/abs/2507.12507

SShizhe Diao@shizhediao · Jun 2

Does RL truly expand a model’s reasoning🧠capabilities? Contrary to recent claims, the answer is yes—if you push RL training long enough! Introducing ProRL 😎, a novel training recipe that scales RL to >2k steps, empowering the world’s leading 1.5B reasoning model💥and offering…

2

13

110

81

10.0K

L

LLaMA Factory@llamafactory_ai · Jul 21

That would be so 🔥🔥🔥 @Alibaba_Qwen @Kimi_Moonshot

TTeknium (e/λ)@Teknium1 · Jul 21

If i could have a wish today i would wish kimi and qwen release their post training datasets like nous does 🫣🤗 We could all be building off eachothers work a lot easier that way!

8

6

126

7

12.0K

LLaMA Factory Retweeted

S

Sedrick Keh@sedrickkeh2 · Jul 18

📢📢📢 Releasing OpenThinker3-1.5B, the top-performing SFT-only model at the 1B scale! 🚀 OpenThinker3-1.5B is a smaller version of our previous 7B model, trained on the same OpenThoughts3-1.2M dataset.

1

29

112

29

11.0K

L

LLaMA Factory@llamafactory_ai · Jul 8

Introduce Easy Dataset No-code framework for synthesizing fine-tuning data from unstructured documents using LLMs/Ollamas Supports OCR, chunking, QA augmentation, and export to LlamaFactory/Unsloth fine-tuning frameworks huggingface.co/papers/2507.04…

0

11

70

66

5.0K

L

LLaMA Factory@llamafactory_ai · Jul 2

LLaMA-Factory supported the multi-modal fine-tuning of the open-source GLM-4.1V-Thinking model at Day0 🔥

�𝚐𝔪𝟾𝚡𝚡𝟾@gm8xx8 · Jul 2

𝒁𝑯𝑰𝑷𝑼 𝑺𝑯𝑰𝑷𝑺 GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning GLM-4.1V-9B-Thinking introduces explicit intermediate reasoning through reinforcement learning with curriculum sampling, improving performance on tasks requiring…

0

2

17

4

944

LLaMA Factory Retweeted

T

TuringPost@TheTuringPost · Jun 21

PPO and GRPO — a workflow breakdown of the most popular reinforcement learning algorithms ➡️ Proximal Policy Optimization (PPO): The Stable Learner It’s used everywhere from dialogue agents to instruction tuning as it balances between learning fast and staying safe. ▪️ How PPO…

9

115

547

620

44.0K

L

LLaMA Factory@llamafactory_ai · Jun 18

LLaMA Factory on ROCm 🔥

AAI at AMD@AIatAMD · Jun 17

Fine-tune Llama-3.1 8B with Llama-Factory on AMD GPUs with this step-by-step guide: bit.ly/4k14ORL Discover more fine-tuning tutorials on the ROCm AI Developer Hub: bit.ly/4kLQiOQ

0

2

4

2

681

LLaMA Factory Retweeted

A

AI at AMD@AIatAMD · Jun 17

Fine-tune Llama-3.1 8B with Llama-Factory on AMD GPUs with this step-by-step guide: bit.ly/4k14ORL Discover more fine-tuning tutorials on the ROCm AI Developer Hub: bit.ly/4kLQiOQ

2

19

156

21

57.0K

L

LLaMA Factory@llamafactory_ai · Jun 18

LLaMA-Factory now supports fine-tuning the Falcon H1 family of models using Full-FineTune or LoRA, kudos @DhiaRhayem

llamafactory_ai's tweet image. LLaMA-Factory now supports fine-tuning the Falcon H1 family of models using Full-FineTune or LoRA, kudos @DhiaRhayem

0

8

13

2

2.0K

L

LLaMA Factory@llamafactory_ai · Jun 17

Insane milestone for Llama Factory!

LLLaMA Factory@llamafactory_ai · Jun 16

LLaMA-Factory v0.9.3 released! Thank you for 50k GitHub stars 🌟 Fully open-source, no-code fine-tuning on Gradio UI for nearly 300+ models -- including Qwen3, Llama 4, Gemma 3, InternVL3, Qwen2.5-Omni, etc. • Install locally via our Docker image: hub.docker.com/r/hiyouga/llam… •…

0

3

13

2

1.0K

LLaMA Factory Retweeted

v

verl project@verl_project · Jun 7

DeepSeek 671b and Qwen3 236b support with Megatron backend is now available as preview in verl v0.4.0 🔥🔥🔥 We will continue optimizing MoE model performance down the road. DeepSeek 671b: verl.readthedocs.io/en/latest/perf… verl v0.4: github.com/volcengine/ver…

0

12

105

33

6.0K

L

LLaMA Factory@llamafactory_ai · Jun 6

Open weights, Open data, Open code -- SOTA reasoning model with only 7B parameters. Excited to see LlamaFactory powering its training 🥳

RRyan Marten@ryanmart3n · Jun 5

Announcing OpenThinker3-7B, the new SOTA open-data 7B reasoning model: improving over DeepSeek-R1-Distill-Qwen-7B by 33% on average over code, science, and math evals. We also release our dataset, OpenThoughts3-1.2M, which is the best open reasoning dataset across all data…

3

2

29

6

2.0K

LLaMA Factory Retweeted

R

Ryan Marten@ryanmart3n · Jun 5

Paper: arxiv.org/abs/2506.04178 Model: huggingface.co/open-thoughts/… Dataset: huggingface.co/datasets/open-… Code: github.com/open-thoughts/… Blog: openthoughts.ai/blog/ot3 (10/N)

4

12

109

66

10.0K

LLaMA Factory Retweeted

R

Ryan Marten@ryanmart3n · Jun 5

Announcing OpenThinker3-7B, the new SOTA open-data 7B reasoning model: improving over DeepSeek-R1-Distill-Qwen-7B by 33% on average over code, science, and math evals. We also release our dataset, OpenThoughts3-1.2M, which is the best open reasoning dataset across all data…

32

191

920

725

189.0K

L

LLaMA Factory@llamafactory_ai · May 26

tbf @llamafactory_ai is the next LMArena: Open-source, built with Gradio, huge impact in the world of LLMs and MLLMs! 🙌

AAvi Chawla@_avichawla · May 24

Fine-tune 100+ LLMs directly from a UI! LLaMA-Factory lets you train and fine-tune open-source LLMs and VLMs without writing any code. Supports 100+ models, multimodal fine-tuning, PPO, DPO, experiment tracking, and much more! 100% open-source with 50k stars!

0

18

69

59

21.0K