VR

@Viswanatha20

Passionate about CV & DL. Exploring Diffusion Models, LLMs, etc. | Amazon | @UWMadison

Seattle

Joined February 2016

142Following

36Followers

Pinned

VR Retweeted

Lucas Beyer (bl16)@giffmana · Jun 17

Gemini 2.5 paper TL;DR. Technical part in thread. Contributors: ~1k 2.5 Pro timed out counting after 600s 2.5 Flash counts 1228 in 60s o3 counts 919 "after dedup" in 4m9s No grouping or "leads", just one long list. I guess too much infighting or poaching from this in the past?

1.0K

831

169.0K

VR Retweeted

Riley Goodside@goodside · Jun 17

POV: You have angered ChatGPT with the stupidity of your question

611

49.0K

VR@Viswanatha20 · Jun 17

98.5th percentile for 10 cents is now considered "BAD NEWS for LLMs"

RRohan Paul@rohanpaul_ai · Jun 16

This is really BAD news of LLM's coding skill. ☹️ The best Frontier LLM models achieve 0% on hard real-life Programming Contest problems, domains where expert humans still excel. LiveCodeBench Pro, a benchmark composed of problems from Codeforces, ICPC, and IOI (“International…

246

57.0K

VR Retweeted

VR@Viswanatha20 · Jun 6

OpenReview is now open for camera-ready submissions. Please try submitting your paper again.

201

VR@Viswanatha20 · May 30

I just watched a great compilation of various people's views about what is coming: x.com/Scr0nkf1nkle/s…

SScr0nkf1nkle@Scr0nkf1nkle · May 29

The Great AI Job Displacement Is Closer Than You Think

150

366

2.0K

324.0K

VR Retweeted

Andrej Karpathy@karpathy · Apr 7

x.com/i/article/1909…

211

822

6.0K

5.0K

996.0K

VR Retweeted

Andrej Karpathy@karpathy · Mar 27

The reality of building web apps in 2025 is that it's a bit like assembling IKEA furniture. There's no "full-stack" product with batteries included, you have to piece together and configure many individual services: - frontend / backend (e.g. React, Next.js, APIs) - hosting…

1.0K

2.0K

19.0K

10.0K

1.7M

VR Retweeted

Aran Komatsuzaki@arankomatsuzaki · Mar 25

Nvidia presents: FFN Fusion: Rethinking Sequential Computation in Large Language Models 1.71x speedup in inference latency and 35x lower per-token cost while maintaining strong performance across benchmarks

288

152

29.0K

VR Retweeted

Inception@InceptionAILabs · Feb 26

We are excited to introduce Mercury, the first commercial-grade diffusion large language model (dLLM)! dLLMs push the frontier of intelligence and speed with parallel, coarse-to-fine text generation.

225

987

5.0K

2.0K

1.9M

VR Retweeted

Alex Albert@alexalbert__ · Jan 23

We've rolled out Citations in the Anthropic API. Citations allows Claude to ground its answers in user-provided information and provide precise references to the sentences and passages used in its responses. Here's how it works:

135

2.0K

838

181.0K

VR Retweeted

Rowan Cheung@rowancheung · Oct 22

Anthropic just announced Computer Use It allows Claude to control your computer screen based on a prompt and take actions on your behalf The use cases in agentic coding with automated debugging, customer support, and education are going to be INSANE

196

1.0K

9.0K

5.0K

1.0M

VR Retweeted

Andrew Ng@AndrewYNg · Jun 11, 2024

I think AI agentic machine translation has huge potential for improving over traditional neural machine translation, and am releasing as open-source a demonstration I'd been playing with as a fun weekend project. Using an agentic workflow, this demonstration (i) Prompts an LLM…

351

2.0K

1.0K

543.0K