Caleb

@calebfahlgren

Software + Product @huggingface🤗

Joined January 2018

965Following

3KFollowers

Caleb@calebfahlgren · 11 h

now you can view json prettified such as the tools list from a string column on huggingface thanks to @calebfahlgren from @huggingface datasets team for implementing this feature in addition to conversation json view 🙌

iinterstellarninja@intrstllrninja · Jul 22

today i'm releasing 50k rows of tool-use reasoning dataset compilation on huggingface includes following BFCL scenarios: - single turn tool-use - multiturn tool-use - multistep tool-use - relevance reasoning huggingface.co/datasets/inter…

525

Caleb@calebfahlgren · 12 h

There's now support for viewing JSON in string / dict columns in @huggingface datasets!!! 🔍 Great for all the tool calling datasets like the brand new hermes tool use dataset by @intrstllrninja

CCaleb@calebfahlgren · Jul 11

NEW 🔥!! There's can now view JSON for List cells on @huggingface datasets. Now there's no excuse for looking at your data! 🫣

1.0K

Caleb Retweeted

Aravind Srinivas@AravSrinivas · Jul 23

Incredible results! Open source is winning.

1.0K

280

97.0K

Caleb Retweeted

Qwen@Alibaba_Qwen · Jul 22

>>> Qwen3-Coder is here! ✅ We’re releasing Qwen3-Coder-480B-A35B-Instruct, our most powerful open agentic code model to date. This 480B-parameter Mixture-of-Experts model (35B active) natively supports 256K context and scales to 1M context with extrapolation. It achieves…

253

1.0K

8.0K

4.0K

1.5M

Caleb Retweeted

Quentin Lhoest 🤗@lhoestq · Jul 22

Parquet Content Defined Chunking is available in PyArrow 21 :) Ideal for deduplication

442

Caleb Retweeted

interstellarninja@intrstllrninja · Jul 22

308

191

52.0K

Caleb Retweeted

Tesla@Tesla · Jul 21

Tesla Diner & Supercharger in Hollywood, LA Open 24/7, starting now

2.0K

5.0K

31.0K

4.0K

23.9M

Caleb Retweeted

Tanishq Abraham is at ICML@iScienceLuvr · Jul 21

Kimi K2 paper dropped! describes: - MuonClip optimizer - large-scale agentic data synthesis pipeline that systematically generates tool-use demonstrations via simulated and real-world environments - an RL framework that combines RLVR with a self- critique rubric reward mechanism…

172

977

598

56.0K

Caleb Retweeted

Qwen@Alibaba_Qwen · Jul 21

Bye Qwen3-235B-A22B, hello Qwen3-235B-A22B-2507! After talking with the community and thinking it through, we decided to stop using hybrid thinking mode. Instead, we’ll train Instruct and Thinking models separately so we can get the best quality possible. Today, we’re releasing…

216

578

4.0K

849

877.0K

Caleb Retweeted

Quentin Lhoest 🤗@lhoestq · Jul 21

A new Pandas feature landed 3 days ago and no one noticed. Upload ONLY THE NEW DATA to dedupe-based storage like @huggingface (Xet). Data that already exist in other files don't need to be uploaded. Possible thanks to the recent addition of Content Defined Chunking for Parquet.

16.0K

Caleb Retweeted

Cline@cline · Jul 18

🤗🤗🤗 🤗❤️🤗 @huggingface & Cline = your LLM playground 🤗🤗🤗 You can access Kimi K2 & 6,140 (!) other open source models in Cline.

502

270

156.0K

Caleb@calebfahlgren · Jul 18

The @huggingface Inference Providers is getting even easier to use! Now with a unified OpenAI client route. Just use the model id and it works. You can also set your preferred provider with `:groq` for example. Here's how easy it is to use @GroqInc and Kimi K2

calebfahlgren's tweet image. The @huggingface Inference Providers is getting even easier to use! Now with a unified OpenAI client route.

Just use the model id and it works. You can also set your preferred provider with `:groq` for example.

Here's how easy it is to use @GroqInc and Kimi K2

12.0K

Caleb Retweeted

Nathan Lambert@natolambert · Jul 17

Is it "bad" that everyone is distilling from / training on Chinese models? While not directly bad, there is a large soft power component. Many completions that soapbox about Chinese socialist ideals / PRC values that filter into future AI models / spread all over the internet.

100

9.0K

Caleb Retweeted

Together AI@togethercompute · Jul 17

Together AI Sets a New Bar: Fastest Inference for DeepSeek-R1-0528 We’ve upgraded the Together Inference Engine to run on @NVIDIA Blackwell GPUs—and the results speak for themselves: 📈 Highest known serverless throughput: 334 tokens/sec 🏃‍Fastest time to first answer token:…

102

36.0K

Caleb@calebfahlgren · Jul 15

We open-sourced 99% of US caselaw on @huggingface. Both AI and legal tech companies are selling this data for a high premium. You can simply just build a wrapper around it and freely compete with them now. That is why we love open-source. huggingface.co/datasets/commo…

EEleanor Berger@intellectronica · Jul 14

Why are legal services such a lucrative opportunity for AI? ・ ⚖️ Legal work is programming with words, a natural fit for AI. ・ 🚀 The market pull is unprecedented, with sales cycles cut from months to weeks. ・ 💻 AI engineers have an unfair advantage, able to build the…

408

3.0K

520.0K

Caleb Retweeted

jeff@jeffreyhuber · Jul 16

i think about this a lot

132

2.0K

1.0K

211.0K