elvis

@omarsar0

Building with AI agents @dair_ai • Prev: Meta AI, Galactica LLM, Elastic, PaperswithCode, PhD • I share insights on how to build with LLMs & AI Agents ⬇️

Joined September 2015

636Following

256KFollowers

Pinned

elvis@omarsar0 · 11 h

AI Agents Evaluation Evaluation is key to developing reliable and scalable agentic systems. Really enjoyed this conversation with @ptkbhv on *everything* related to AI agent evaluation. One of the many deep dives we have done at the @dair_ai academy. Feel free to share with…

145

26.0K

Pinned

elvis Retweeted

elvis@omarsar0 · 11 h

145

26.0K

elvis@omarsar0 · 3 h

Thoughts after coding relentlessly with tools like Replit Agent and Claude Code for the past few days: Current coding models are more impressive that we all think. Clever memory management, search, and context engineering is what’s falling short. Tool calling also needs more…

4.0K

elvis Retweeted

elvis@omarsar0 · 10 h

A Structural Planning Framework for LLM Agent System in Enterprise Agentic systems for enterprise are a work in progress. Reliability is a real problem. No secret that planning works, but structural planning can further help improve the reliability of AI agents. My notes:

100

133

9.0K