mrfakename

@realmrfakename

LLMs, TTS, & Open Source https://huggingface.co/mrfakename

Bay Area

Joined December 2021

297Following

2KFollowers

Pinned

mrfakename@realmrfakename · Jun 15

Introducing LLMCost: stop overpaying for LLMs LLMCost is a centralized dashboard that shows prices for 100+ LLMs across different providers, including @LambdaAPI, @FireworksAI_HQ, @inference_net, and more. Find the cheapest provider for any LLM. 100+ LLMs, updated nightly.

realmrfakename's tweet image. Introducing LLMCost: stop overpaying for LLMs

LLMCost is a centralized dashboard that shows prices for 100+ LLMs across different providers, including @LambdaAPI, @FireworksAI_HQ, @inference_net, and more.

Find the cheapest provider for any LLM.

100+ LLMs, updated nightly.

3.0K

mrfakename@realmrfakename · 7 h

Can we get any assurance about copyright, etc?

cclem 🤗@ClementDelangue · 8 h

Love to see this from @WhiteHouse!

207

mrfakename Retweeted

Mistral AI@MistralAI · Jul 22

In our continued commitment to open-science, we are releasing the Voxtral Technical Report: arxiv.org/abs/2507.13264 The report covers details on pre-training, post-training, alignment and evaluations. We also present analysis on selecting the optimal model architecture, which…

169

1.0K

298

46.0K

mrfakename Retweeted

Lisan al Gaib@scaling01 · Jul 22

Qwen about to release a 480B MoE for coding with 1 million context! "Qwen3-Coder-480B-A35B-Instruct is a powerful coding-specialized language model excelling in code generation, tool use, and agentic tasks."

787

148

126.0K

mrfakename@realmrfakename · Jul 22

Qwen3-Coder

CCasper Hansen@casper_hansen_ · Jul 22

if you loved kimi k2, you will love what a certain chinese team is about to release which is highly competitive with 1M context length

374

mrfakename@realmrfakename · Jul 22

Looks like Hugging Face is running out of GPUs 😂 Half the time when I try to start a Space I get a scheduling error

236

mrfakename@realmrfakename · Jul 22

Gradio demo for DMOSpeech 2 now live! 2x faster F5-TTS with improved WER and stability through RL training, from the authors of F5-TTS: huggingface.co/spaces/mrfaken…

realmrfakename's tweet image. Gradio demo for DMOSpeech 2 now live!

2x faster F5-TTS with improved WER and stability through RL training, from the authors of F5-TTS:

huggingface.co/spaces/mrfaken…

236

mrfakename Retweeted

Junyang Lin@JustinLin610 · Jul 21

no hybrid thinking mode tonight

381

45.0K

mrfakename@realmrfakename · Jul 20

I highly suspect that Baseten is serving K2 in FP4 See this HF repo (might be removed soon lol)

XXeophon@xeophon_ · Jul 20

445

mrfakename Retweeted

Xeophon@xeophon_ · Jul 20

747

69.0K