VR
@Viswanatha20
Passionate about CV & DL. Exploring Diffusion Models, LLMs, etc. | Amazon | @UWMadison
Gemini 2.5 paper TL;DR. Technical part in thread. Contributors: ~1k 2.5 Pro timed out counting after 600s 2.5 Flash counts 1228 in 60s o3 counts 919 "after dedup" in 4m9s No grouping or "leads", just one long list. I guess too much infighting or poaching from this in the past?
POV: You have angered ChatGPT with the stupidity of your question
98.5th percentile for 10 cents is now considered "BAD NEWS for LLMs"
This is really BAD news of LLM's coding skill. ☹️ The best Frontier LLM models achieve 0% on hard real-life Programming Contest problems, domains where expert humans still excel. LiveCodeBench Pro, a benchmark composed of problems from Codeforces, ICPC, and IOI (“International…
OpenReview is now open for camera-ready submissions. Please try submitting your paper again.
I just watched a great compilation of various people's views about what is coming: x.com/Scr0nkf1nkle/s…
The Great AI Job Displacement Is Closer Than You Think
The reality of building web apps in 2025 is that it's a bit like assembling IKEA furniture. There's no "full-stack" product with batteries included, you have to piece together and configure many individual services: - frontend / backend (e.g. React, Next.js, APIs) - hosting…
Nvidia presents: FFN Fusion: Rethinking Sequential Computation in Large Language Models 1.71x speedup in inference latency and 35x lower per-token cost while maintaining strong performance across benchmarks
We are excited to introduce Mercury, the first commercial-grade diffusion large language model (dLLM)! dLLMs push the frontier of intelligence and speed with parallel, coarse-to-fine text generation.
We've rolled out Citations in the Anthropic API. Citations allows Claude to ground its answers in user-provided information and provide precise references to the sentences and passages used in its responses. Here's how it works:
Anthropic just announced Computer Use It allows Claude to control your computer screen based on a prompt and take actions on your behalf The use cases in agentic coding with automated debugging, customer support, and education are going to be INSANE
I think AI agentic machine translation has huge potential for improving over traditional neural machine translation, and am releasing as open-source a demonstration I'd been playing with as a fun weekend project. Using an agentic workflow, this demonstration (i) Prompts an LLM…