Benjamin Feuer
@FeuerBenjamin
PhD Candidate in Computer Science, NYU, Deep Learning
If you are attending #ICML2025, check out our DataWorld workshop on Sat July 19. We have updated the website with more info on speakers & accepted papers! dataworldicml2025.github.io Also happy to chat offline about all things ✨ data ✨
New research paper for you to read over your July 4th break (if you're US-based) -- Vision is a skeleton key! 🗝️ We convert a small VLM into an "everything classifier" by transforming data into visualizations that VLMs can naturally understand and reason about. We call it…

Announcing OpenThinker3-7B, the new SOTA open-data 7B reasoning model: improving over DeepSeek-R1-Distill-Qwen-7B by 33% on average over code, science, and math evals. We also release our dataset, OpenThoughts3-1.2M, which is the best open reasoning dataset across all data…
📣We are extending our deadline to May 31st!📣 Looking forward to seeing everyone's submissions :)
📢 Announcing our data-centric workshop at ICML 2025 on unifying data curation frameworks across domains! 📅 Deadline: May 24, AoE 🔗 Website: dataworldicml2025.github.io We have an amazing lineup of speakers + panelists from various institutions and application areas.
Many agents (Claude Code, Codex CLI) interact with the terminal to do valuable tasks, but do they currently work well enough to deploy en masse? We’re excited to introduce Terminal-Bench: An evaluation environment and benchmark for AI agents on real-world terminal tasks. Tl;dr…
📢 Announcing our data-centric workshop at ICML 2025 on unifying data curation frameworks across domains! 📅 Deadline: May 24, AoE 🔗 Website: dataworldicml2025.github.io We have an amazing lineup of speakers + panelists from various institutions and application areas.
If AI isn’t truly open, it will fail us. We can’t close in a black box our greatest invention yet just so that a few can freely monetize. AI needs its Linux moment, and so we started working towards it. This can only succeed if we all work together! #oumi #opensource…
Oumi:build state-of-the-art foundation models, end-to-end. Oumi is a fully open-source platform designed to train, evaluate, and deploy foundation models end-to-end. It supports models from 10M to 405B parameters, enabling fine-tuning using LoRA, QLoRA, DPO, and other…
Oumi is a fully open-source platform to help you build state-of-the-art foundation models, end-to-end.
After careful consideration, I have decided to leave X for BlueSky. I hope to see many of you there with me very soon! @benjaminfeuer.bsky.social
🐞 Check out this zero-shot AI model dataset with 6M images of important species that is vital to farming and environmental research! Learn more: ow.ly/Kzig50TRSaC #ZeroShotLearning #AIModels #AgriculturalTech #EnvironmentalResearch @chegday @FeuerBenjamin @AII4RA
Micah's an incredible mentor and one of the most creative early-career AI/ML thinkers out there -- if you're looking for a potential PhD advisor, can't imagine a better choice!
📢I’ll be admitting multiple PhD students this winter to Columbia University 🏙️ in the most exciting city in the world! If you are interested in dissecting modern deep learning systems to probe how they work, advancing AI safety, or automating data science, apply to my group.