Together AI (@togethercompute)

Pinned

T

Together AI@togethercompute · Jul 17

Together AI Sets a New Bar: Fastest Inference for DeepSeek-R1-0528 We’ve upgraded the Together Inference Engine to run on @NVIDIA Blackwell GPUs—and the results speak for themselves: 📈 Highest known serverless throughput: 334 tokens/sec 🏃‍Fastest time to first answer token:…

togethercompute's tweet image. Together AI Sets a New Bar: Fastest Inference for DeepSeek-R1-0528

We’ve upgraded the Together Inference Engine to run on @NVIDIA Blackwell GPUs—and the results speak for themselves:
📈 Highest known serverless throughput: 334 tokens/sec
🏃‍Fastest time to first answer token:…

7

14

100

23

35.0K

T

Together AI@togethercompute · 7 h

Qwen 3 is now the best instruct model (non-reasoning) amongst both closed and open source LLMs.

AArtificial Analysis@ArtificialAnlys · 7 h

Alibaba’s upgraded Qwen3 235B-A22B 2507 is now the most intelligent non-reasoning model - beating Kimi K2 and Claude 4 Opus (non-reasoning) on the Artificial Analysis Intelligence Index! Qwen3 235B 2507 is a non-reasoning model (it is not trained to ‘think’ before it answers).…

3

44

9

3.0K

T

Together AI@togethercompute · 7 h

Another incredible OSS model release this summer: the new Qwen 3 update is now live on @togethercompute APi.

TTogether AI@togethercompute · 9 h

🧠 Qwen3 just leveled up on Together AI 🚀 Qwen3-235B-A22B-Instruct-2507-FP8 isn't just another model update - it's a leap forward 📈

2

3

22

2

3.0K

T

Together AI@togethercompute · 19 h

We built an open source voice note taking app using our fast Whisper implementation! Check it out -> usewhisper.io

6

5

60

19

3.0K

T

Together AI@togethercompute · Jul 21

We made it easier for LLMs & code editors to use Together AI! You can now get our llms.txt: docs.together.ai/llms.txt This lets LLMs/code editors know the structure of our docs when working with our APIs.

togethercompute's tweet image. We made it easier for LLMs &amp; code editors to use Together AI!

You can now get our llms.txt: docs.together.ai/llms.txt

This lets LLMs/code editors know the structure of our docs when working with our APIs.

0

17

5

2.0K

Together AI Retweeted

H

Hassan@nutlope · Jul 12

Building an app to help folks take notes with their voice & transform them with AI! Will be free, open source, and powered by the new ultrafast Whisper model from @togethercompute. Launching in a few days!

21

12

249

187

27.0K

T

Together AI@togethercompute · Jul 17

We rolled out a new inference engine built for NVIDIA Blackwell! DeepSeek R1 offers the best peak performance (386 TPS) across any service or silicon in production today for the full R1 model with impressive latency and throughput at higher batch sizes. You can try the…

TTogether AI@togethercompute · Jul 17

Together AI Sets a New Bar: Fastest Inference for DeepSeek-R1-0528 We’ve upgraded the Together Inference Engine to run on @NVIDIA Blackwell GPUs—and the results speak for themselves: 📈 Highest known serverless throughput: 334 tokens/sec 🏃‍Fastest time to first answer token:…

0

3

37

4

4.0K

T

Together AI@togethercompute · Jul 17

🎉 Congratulations to Together AI for raising the bar with record-fast inference on the DeepSeek-R1-0528 model, accelerated by our #NVIDIABlackwell platform—built for next-level compute, memory, and bandwidth to uplift the entire AI ecosystem. #AcceleratedComputing Learn more…

TTogether AI@togethercompute · Jul 17

Together AI Sets a New Bar: Fastest Inference for DeepSeek-R1-0528 We’ve upgraded the Together Inference Engine to run on @NVIDIA Blackwell GPUs—and the results speak for themselves: 📈 Highest known serverless throughput: 334 tokens/sec 🏃‍Fastest time to first answer token:…

0

12

76

5

7.0K

Together AI Retweeted

N

NVIDIA Data Center@NVIDIADC · Jul 17

👏👏👏

0

1

8

0

2.0K

T

Together AI@togethercompute · Jul 17

We now have the fastest speeds for DeepSeek R1 – up to 330 tokens/sec running on B200s! Here it is in action – video is not sped up!

TTogether AI@togethercompute · Jul 17

Together AI Sets a New Bar: Fastest Inference for DeepSeek-R1-0528 We’ve upgraded the Together Inference Engine to run on @NVIDIA Blackwell GPUs—and the results speak for themselves: 📈 Highest known serverless throughput: 334 tokens/sec 🏃‍Fastest time to first answer token:…

9

75

18

9.0K

Together AI Retweeted

C

Clémentine Fourrier 🍊@clefourrier · Jul 17

Can LLMs predict the future? In FutureBench, friends from @togethercompute create new questions from evolving news & markets: As time passes, we'll see which agents are the best at predicting events that have yet to happen! 🔮 Also cool: by design, dynamic & uncontaminated eval

2

8

35

10

4.0K

T

Together AI@togethercompute · Jul 17

Read more: together.ai/blog/futureben… See the leaderboard: huggingface.co/spaces/togethe…

0

1

0

1.0K

T

Together AI@togethercompute · Jul 17

Most AI benchmarks test the past. But real intelligence is about predicting the future. Introducing FutureBench — a new benchmark for evaluating agents on real forecasting tasks that we developed with @huggingface 🔍 Reasoning > memorization 📊 Real-world events 🧠 Dynamic,…

togethercompute's tweet image. Most AI benchmarks test the past.

But real intelligence is about predicting the future.

Introducing FutureBench — a new benchmark for evaluating agents on real forecasting tasks that we developed with @huggingface

🔍 Reasoning &gt; memorization
📊 Real-world events
🧠 Dynamic,…

5

16

89

35

25.0K