RELAI

@ReliableAI

Optimizer of AI Agents on Your Data.

Joined November 2021

0Following

161Followers

Pinned

RELAI@ReliableAI · May 1

Our legal reasoning benchmark is live! Check it out! Let us know if you want to add your model or data to our leaderboard: calendly.com/d/crx2-k7b-pcm

SSoheil Feizi@FeiziSoheil · May 1

🚨 Releasing the SCOTUS 2024 Legal Scenarios Benchmark 🚨 We’re excited to launch a new benchmark with 200+ realistic legal dilemmas from 2024 Supreme Court slip opinions—built using RELAI Data Agents. We tested top LLMs on legal reasoning: 🥇 o4-mini — 76.4% @OpenAI @sama…

215

RELAI@ReliableAI · Apr 22

Our leaderboard is now live! Check it out at: relai.ai Let us know if you want to add your model or data to our leaderboard: calendly.com/d/crx2-k7b-pcm

SSoheil Feizi@FeiziSoheil · Apr 22

🚀 RELAI Leaderboard is now live at relai.ai 🔍 Models: Currently our leaderboard shows performances for: o4-mini and GPT-4o @OpenAI @sama, Grok 2 @xai @elonmusk @ChrSzegedy, Gemini 2.0 Flash @Gemini @JeffDean, and Llama 3.3 70B @AIatMeta 📊 Benchmarks: we use…

225

RELAI@ReliableAI · Apr 22

Vote! Quantitative results on RELAI leaderboard will be released soon! ⌛

SSoheil Feizi@FeiziSoheil · Apr 22

🤖 In your experience, LLMs struggle the most with understanding and generating code for:

914

RELAI@ReliableAI · Apr 21

🚀 Meet our Data Agents! 📅 Want your own custom benchmarks? Book a demo here: calendly.com/d/crx2-k7b-pcm

SSoheil Feizi@FeiziSoheil · Apr 21

🚀 Introducing Data Agents— generate accurate, reasoning-based AI benchmarks from your own data in minutes! ⚡ With Data Agents, we’ve created 100+ benchmarks with 100K+ samples using docs from tools like React, PyTorch, Kubernetes, LangChain, and more. 📂 All benchmarks are…

248

RELAI@ReliableAI · Mar 28

We're hiring a backend developer! Apply here: forms.gle/AWVQZd59TNLtED…

SSoheil Feizi@FeiziSoheil · Mar 28

🚀We're Hiring: Backend Developer at RELAI (relai.ai)!🚀 If you're excited about building reliable AI systems and working on impactful backend projects, we'd love to hear from you! Apply here: forms.gle/AWVQZd59TNLtED…

564

RELAI@ReliableAI · Jan 15

Congratulations to RELAI's founder & CEO for receiving the #PECASE award!

UUMD Department of Computer Science@umdcs · Jan 15

🎉🎉👏 Congrats to Associate Professor Soheil Feizi (@FeiziSoheil) on being honored by Pres. Biden (@POTUS) with the Presidential Early Career Award for Scientists and Engineers (PECASE)—the nation’s highest award for early-career researchers! Read more: go.umd.edu/Feizi-PECASE

805

RELAI Retweeted

Soheil Feizi@FeiziSoheil · Dec 19

Honored to have contributed to the House Bipartisan Task Force on Artificial Intelligence's report. Learn more about it here: science.house.gov/press-releases… Access the full report: speaker.gov/wp-content/upl… I hope this effort contributes to advancing AI that is reliable, safe, and…

2.0K

RELAI@ReliableAI · Nov 21

We are hiring! Join us to make AI reliability achievable and accessible for everyone! Details below:

SSoheil Feizi@FeiziSoheil · Nov 21

Please spread the word! 🚀 Join RELAI (relai.ai) to work on cutting-edge AI projects! We have full-time and internship positions available in: - AI/ML Research & Development: lnkd.in/dEbyQz-U - Software Developers: lnkd.in/daa2jhRy - Sales…

477

RELAI@ReliableAI · Nov 15

We evaluated several hallucination detection methods on OpenAI's recently released SimpleQA benchmark. RELAI agents detected over 76% of GPT-4o's hallucinations with just a 5% false positive rate. Even more impressively, RELAI detected nearly 1/3 of GPT-4o's hallucinations with…

RRELAI@ReliableAI · Nov 15

x.com/i/article/1856…

1.0K

RELAI@ReliableAI · Nov 15

x.com/i/article/1856…

8.0K

RELAI@ReliableAI · Oct 31

👻 No tricks, just AI reliability! 🎃 See our hallucination detection agents in action this Halloween! Get your own access (no costume required): platform.relai.ai/auth/waitlist #HappyHalloween #AI

ReliableAI's tweet image. 👻 No tricks, just AI reliability! 🎃 See our hallucination detection agents in action this Halloween!

Get your own access (no costume required): platform.relai.ai/auth/waitlist

#HappyHalloween #AI

946

RELAI@ReliableAI · Oct 31

Our agents are working 24/7, and more registration codes are rolling out! Try out our hallucination detection agents for free now! Request your registration code here: platform.relai.ai/auth/waitlist

1.0K

RELAI@ReliableAI · Oct 29

Hi AI world ✋ We are committed to make AI reliability accessible and achievable for everyone! First up: RELAI agents for hallucination detection Chat with popular LLMs, Verify with RELAI. Try it for free today: relai.ai #AI #LLM #ReliableAI #Hallucination

ReliableAI's tweet card. Rely on RELAI agents for your AI reliability needs, from model evaluation and debugging to leveraging state-of-the-art system-level and user-facing safeguards.

36.0K