Greg Kamradt

@GregKamradt

President @arcprize, Founder https://www.leverage.to, builder/engineer

San Francisco, CA

Joined January 2011

914Following

42KFollowers

Pinned

Greg Kamradt@GregKamradt · Jul 18

Intelligence is interactive Life does not happen in a single turn, but yet, frontier AI is measured with static benchmarks Today we're previewing a preview of ARC-AGI-3 an Interactive Reasoning Benchmark You can play (and build agents) on it today

AARC Prize@arcprize · Jul 18

Today, we're announcing a preview of ARC-AGI-3, the Interactive Reasoning Benchmark with the widest gap between easy for humans and hard for AI We’re releasing: * 3 games (environments) * $10K agent contest * AI agents API Starting scores - Frontier AI: 0%, Humans: 100%

19.0K

Greg Kamradt@GregKamradt · 14 h

Anyone have a connection at @Alibaba_Qwen? Trying to reproduce the results on @arcprize and getting different metrics Want to get a hold of them and find out how they tested

GGreg Kamradt@GregKamradt · Jul 21

.@arcprize listed on the @Alibaba_Qwen model card 2nd model card for us in 2 weeks Excited for ARC-AGI to be seen as a supported way to measure model performance x.com/Alibaba_Qwen/s…

7.0K

Greg Kamradt@GregKamradt · 19 h

"utm_source=openai" Has a URL parameter ever added more market value to a company than this?

2.0K

Greg Kamradt@GregKamradt · 20 h

AGI is a threshold of capability There will be as many variations as there sorting algorithms

MMustafa Suleyman@mustafasuleyman · 20 h

So strange how most people refer to "AGI" generically as one monolith, like they'll all be the same. There isn't going to be one AGI, or one type of AGI. There's going to be an infinite variety of flavors.

3.0K

Greg Kamradt@GregKamradt · Jul 21

.@arcprize listed on the @Alibaba_Qwen model card 2nd model card for us in 2 weeks Excited for ARC-AGI to be seen as a supported way to measure model performance x.com/Alibaba_Qwen/s…

QQwen@Alibaba_Qwen · Jul 21

Performance

19.0K

Greg Kamradt@GregKamradt · Jul 21

We've had 3 leaders in the past 7 days for @arcprize Top score ($50K pool) prize heating up

AARC Prize@arcprize · Jul 21

New ARC Prize 2025 High Score 17.6% by Giotto. ai (@podesta_aldo)

5.0K

Greg Kamradt@GregKamradt · Jul 20

This thread has a great intro on build agents for ARG-AGI-3 Competition open for 27 more days

AAlex Reibman 🖇️@AlexReibman · Jul 20

TL;DR: - We build an agent consistently capable of passing the level 1 and partially completing the level 2. - To do so, we had to assist the agent with some pre-computed values provided into its context (i.e. the door, key color, etc, the rules of the game, etc.) - The agent was…

5.0K

Greg Kamradt@GregKamradt · Jul 20

Before the preview, we worked with the team to beta test ARC-AGI-3 using our own agents. The games are devilishly hard, and it took a lot of tricks to get them to work. Here are our key learnings (🧵):

AARC Prize@arcprize · Jul 18

8.0K

Greg Kamradt@GregKamradt · Jul 20

My bar for robotics agi (do anything a human can) is get under my house and fix a pipe in the crawl space then come up and make me sign an invoice

rroon@tszzl · Jul 20

my bar for agi is an ai that can learn to run a gas station for a year without a team of scientists collecting the Gas Station Dataset

5.0K