smly
@smly
AI Fellow & Senior Manager at Rist. ex-PFN / 4x Kaggle Grandmaster https://kaggle.com/confirm / @GoogleDevExpert (Kaggle) / Mahjong AI http://mjai.app
🚀 Kaggle Benchmarks is here! Get competition-grade rigor for AI model evaluation. Let Kaggle handle infrastructure while you focus on AI breakthroughs. View model performance on 70+ leaderboards, including @AIatMeta's MultiLoKo. Dive in: kaggle.com/blog/announcin…
Today we’re releasing our first public preview of ARC-AGI-3: the first three games. Version 3 is a big upgrade over v1 and v2 which are designed to challenge pure deep learning and static reasoning. In contrast, v3 challenges interactive reasoning (eg. agents). The full version…
Humans and AI compete at “AtCoder World Tour Finals 2025” The “AtCoder World Tour Finals 2025” #AWTF2025, hosted by AtCoder (@atcoder), was held on July 16-17. With OpenAI as a sponsor, this year’s event marked a first for the AWTF, with AI participating in the competition. An…
we're competing in the @atcoder World Finals programming contest. real nailbiter — OpenAI has been #1 for most of the contest. looked like it might be over when @FakePsyho pulled ahead, but we've just retaken the lead. 1 hour and 20 minutes to go!
【AWTF2025】 The finals begin tomorrow! Both days will be streamed live! Twelve top finalists will compete in each of the Heuristic and Algorithm divisions. Don't miss the intense battles live from Shibuya! ▼ July 16 – Heuristic Division 📺 youtube.com/live/TG3ChQH61…
Fantastic to see OpenAI agents competing against humans in real-time in the AWTF competition! Love to see self-proclaimed Kaggle GM-level agents step up to similar, truly "contamination-free" challenges and compete directly. youtube.com/watch?v=TG3ChQ…
📣 ICML 2025! Join us today and contribute to the ICML AI Expert Benchmark. Stop by Kaggle Booth #121 to see community-driven evaluation in action & contribute to novel GenAI tasks. Maximize your lunch break, deepen your ML expertise! #ICML2025
The Kaggle IMC retrospective meetup has started. What a great start with a surprise message from @ducha_aiki !! So grateful for his heartwarming words! 🙌 #IMC振り返り会
今日は IMC’25 振り返り会で発表します。今こそ COLMAP がアツいという話をします。turing.connpass.com/event/360266/
Me and a gaggle of Kagglers will be @ #ICML2025 next week! We have some stuff I'm seriously excited to show you if you're there. Say hi! kaggle.com/blog/kaggle-at…
Google Developer Expertに就任しました! Happy to share that I'm officially a Google Developer Expert in @kaggle! Big thanks to @chauhan_nilay16. Excited to keep boosting the Kaggle community together with @smly, @upura0, @helloiamleonie, and all the amazing folks.
Claude Code わかってきた。思考が必要な仕事を任せてしまうとズルして時間が無駄になる。頭からっぽでも可能なことは良い仕事をする。バランスに慣れが必要
Assuming steady AI progress, the design goal for ARC v2 was to endure 12-18 months. Goal for v3 is >3 years. (Note this July 17 event is a preview launch of the first v3 interactive reasoning games to get feedback. We plan to ship full v3 dataset early 2026).
ARC-AGI 3 was just announced, the AI benchmark by Francois Chollet. This came unexpected because ARC-AGI 2 is very far from saturation at this point. @arcprize
三麻用 libriichi3p がコード非公開だったので自分で実装して100万件の牌譜でvalidationした。配布バイナリと異なりaction spaceや特徴量エンコードで2m-8mを除いた別物。満足