Robert Nishihara (@robertnishihara)

Pinned

R

Robert Nishihara@robertnishihara · Jun 15

Beyond pre-training, here's how I imagine most learning will work. 1. AI models / systems will maintain large collections of retrievable knowledge. This will include facts like "the capital of California is Sacramento" and tactics like "when playing Monopoly, buy a bunch of…

RRobert Nishihara@robertnishihara · Jun 1

We're missing techniques for "training-time reasoning." Right now there's a lot of progress on inference-time reasoning, which is incredibly cool (I use o3 all the time). If I think about how I learn stuff, e.g., when reading a technical paper, it's very compute intensive. Most…

1

5

45

40

10.0K

Pinned

Robert Nishihara Retweeted

L

Linda Vivah (Haviv)@lindavivah · Jul 22

📣 Excited to share that I’ve joined @anyscalecompute as a Staff Developer Advocate! This is the brilliant team behind ✨Ray✨ @raydistributed - the open-source compute engine powering AI workloads at OpenAI, Spotify, Netflix, Uber, Amazon, Pinterest, and many more! Can’t…

11

8

114

11

6.0K

R

Robert Nishihara@robertnishihara · Jul 25

Models just want to generalize. For the past years, we’ve been pushing the frontier of controllability in video, releasing new models and techniques for inpainting, outpainting, segmentation, stylization, keyframing, motion and camera control. Aleph is a single in-context model…

RRunway@runwayml · Jul 25

Introducing Runway Aleph, a new way to edit, transform and generate video. Aleph is a state-of-the-art in-context video model, setting a new frontier for multi-task visual generation, with the ability to perform a wide range of edits on an input video such as adding, removing…

12

31

182

48

26.0K

R

Robert Nishihara@robertnishihara · Jul 25

I always love when companies use Ray for a large variety of different workloads.

rray@raydistributed · Jul 25

How @klaviyo uses Ray for data processing, training, hyperparameter tuning, and model serving! klaviyo.tech/ray-data-train…

0

3

18

1

2.0K

Robert Nishihara Retweeted

W

Weights & Biases@weights_biases · Jul 22

🚀 AI workloads are exploding. @robertnishihara of @anyscalecompute shows how Kubernetes, Ray, PyTorch and vLLM snap together into one open-source compute stack. Think auto GPU pools, microsecond serving, real-world patterns. See the full talk below!

1

2

7

4

1.0K

R

Robert Nishihara@robertnishihara · Jul 22

If you're building with @vllm_project, speak at the dedicated vLLM track at Ray Summit in November.

rray@raydistributed · Jul 22

Last year, the creators of @vllm_project at UC Berkeley hosted a massive two-day vLLM event featuring presentations from Roblox, Uber, Apple, Intel, Alibaba, Neural Magic, IBM, Handshake, Databricks, Anyscale, and others on how they are using and optimizing vLLM. This covered…

0

3

13

3

3.0K

R

Robert Nishihara@robertnishihara · Jul 20

Everyone talks about how voice mode (once polished) will be a major UX unlock for AI, which is correct. An equally important frontier, which no one has touched yet, is AI group chats. Lots of hard product challenges to solve there, but it'll be hard to imagine AI without it once…

2

1

11

7

2.0K

R

Robert Nishihara@robertnishihara · Jul 20

I started reading this thread and then got distracted trying to solve the math problem. It's a great problem and very enjoyable to think about. I highly encourage you to get out a sheet of paper, draw some triangles, and take a crack at it.

AAlexander Wei@alexwei_ · Jul 19

2/N We evaluated our models on the 2025 IMO problems under the same rules as human contestants: two 4.5 hour exam sessions, no tools or internet, reading the official problem statements, and writing natural language proofs.

0

8

0

1.0K

R

Robert Nishihara@robertnishihara · Jul 17

Reinforcement learning is a big investment area for us at @anyscalecompute, and we're hiring actively for RL! If you're interested in building systems & algorithms for RL, message me.

RRobert Nishihara@robertnishihara · Jul 16

Congratulations to my brilliant co-founder Philipp Moritz (@pcmoritz) and the legendary John Schulman, Sergey Levine, Pieter Abbeel, and Michael Jordan on their Test-of-Time Honorable Mention at ICML 2025 today! For creating TRPO. This was done during the previous wave of…

0

9

1

1.0K

R

Robert Nishihara@robertnishihara · Jul 17

Huge congrats to @pcmoritz, co-founder of @anyscalecompute for the Test-of-Time Honorable Mention at #ICML2025

RRobert Nishihara@robertnishihara · Jul 16

Congratulations to my brilliant co-founder Philipp Moritz (@pcmoritz) and the legendary John Schulman, Sergey Levine, Pieter Abbeel, and Michael Jordan on their Test-of-Time Honorable Mention at ICML 2025 today! For creating TRPO. This was done during the previous wave of…

0

1

7

0

1.0K

R

Robert Nishihara@robertnishihara · Jul 17

well-deserved!

RRobert Nishihara@robertnishihara · Jul 16

Congratulations to my brilliant co-founder Philipp Moritz (@pcmoritz) and the legendary John Schulman, Sergey Levine, Pieter Abbeel, and Michael Jordan on their Test-of-Time Honorable Mention at ICML 2025 today! For creating TRPO. This was done during the previous wave of…

0

1

6

0

1.0K

R

Robert Nishihara@robertnishihara · Jul 16

Extremely deserved honor for a foundational paper.

RRobert Nishihara@robertnishihara · Jul 16

Congratulations to my brilliant co-founder Philipp Moritz (@pcmoritz) and the legendary John Schulman, Sergey Levine, Pieter Abbeel, and Michael Jordan on their Test-of-Time Honorable Mention at ICML 2025 today! For creating TRPO. This was done during the previous wave of…

0

1

10

5

3.0K

R

Robert Nishihara@robertnishihara · Jul 16

In large part due to Philipp's work on TRPO, reinforcement learning was one of the original motivating use cases that led us to build @raydistributed. You can see how we framed it in our early Ray paper (on page 1). arxiv.org/abs/1712.05889

RRobert Nishihara@robertnishihara · Jul 16

Congratulations to my brilliant co-founder Philipp Moritz (@pcmoritz) and the legendary John Schulman, Sergey Levine, Pieter Abbeel, and Michael Jordan on their Test-of-Time Honorable Mention at ICML 2025 today! For creating TRPO. This was done during the previous wave of…

0

6

18

5

5.0K

R

Robert Nishihara@robertnishihara · Jul 16

Congratulations @pcmoritz!

RRobert Nishihara@robertnishihara · Jul 16

Congratulations to my brilliant co-founder Philipp Moritz (@pcmoritz) and the legendary John Schulman, Sergey Levine, Pieter Abbeel, and Michael Jordan on their Test-of-Time Honorable Mention at ICML 2025 today! For creating TRPO. This was done during the previous wave of…

0

1

5

0

976

Robert Nishihara Retweeted

T

Turing@turingcom · Jul 16

What a night at the Turing × @FoundationCap × @anyscalecompute happy hour! @ashugarg, @robertnishihara & @jonsidd broke down how Reinforcement Learning is evolving into enterprise-grade agents. Imagine the most complex apps you use daily—now imagine RL agents mastering…

0

6

14

2

1.0K