elvis
@omarsar0
Building with AI agents @dair_ai • Prev: Meta AI, Galactica LLM, Elastic, PaperswithCode, PhD • I share insights on how to build with LLMs & AI Agents ⬇️
AI Agents Evaluation Evaluation is key to developing reliable and scalable agentic systems. Really enjoyed this conversation with @ptkbhv on *everything* related to AI agent evaluation. One of the many deep dives we have done at the @dair_ai academy. Feel free to share with…
AI Agents Evaluation Evaluation is key to developing reliable and scalable agentic systems. Really enjoyed this conversation with @ptkbhv on *everything* related to AI agent evaluation. One of the many deep dives we have done at the @dair_ai academy. Feel free to share with…
Thoughts after coding relentlessly with tools like Replit Agent and Claude Code for the past few days: Current coding models are more impressive that we all think. Clever memory management, search, and context engineering is what’s falling short. Tool calling also needs more…
A Structural Planning Framework for LLM Agent System in Enterprise Agentic systems for enterprise are a work in progress. Reliability is a real problem. No secret that planning works, but structural planning can further help improve the reliability of AI agents. My notes: