Julia Neagu (@JuliaANeagu)

Pinned

J

Julia Neagu@JuliaANeagu · Jul 11

If you're shipping LLMs to production and still finding out about critical from your users, this course is for you. Real-time evals, automated detection, and the tools we use at @QuotientAI to keep AI grounded. On July 30th @jxnlco and myself are laying it all out.

jjason liu@jxnlco · Jul 10

how do i catch hallucinations? come learn to implement monitoring systems that catch AI errors as they happen in live production environments with @JuliaANeagu and @QuotientAI if you register, you'll be sent the recording and study notes after they're done!…

0

3

11

4

3.0K

J

Julia Neagu@JuliaANeagu · Jul 22

new model suite just dropped. 𝗹𝗶𝗺𝗯𝗶𝗰-𝘁𝗼𝗼𝗹-𝘂𝘀𝗲-𝟬.𝟱𝗕 → 𝟴𝟴.𝟲% accuracy 𝗹𝗶𝗺𝗯𝗶𝗰-𝘁𝗼𝗼𝗹-𝘂𝘀𝗲-𝟯𝗕 → 𝟵𝟰.𝟲% accuracy 𝗹𝗶𝗺𝗯𝗶𝗰-𝘁𝗼𝗼𝗹-𝘂𝘀𝗲-𝟳𝗕 → 𝟵𝟲.𝟮% accuracy outperforms gpt-4.1 (74.0%) and claude-sonnet-4 (71.1%) on tool use evaluation.

FFreddie Vargus@freddie_v4 · Jul 22

today we're releasing a new small model (0.5B) for detecting problems with tool usage in agents, trained on 50M tokens from publicly available MCP server tools it's great at picking up on tool accuracy issues and outperforms larger models

0

1

7

1

361

J

Julia Neagu@JuliaANeagu · Jul 22

this is exactly what's possible right now: tiny, fast agents riding shotgun with your main stack. hyper-specialized to double-check tasks you absolutely can’t get wrong. they catch and fix mistakes and keep your agents on track. lfg

JJohn Berryman@JnBrymn · Jul 22

This is the start of a neat direction for @QuotientAI. In offline evals and online sampling, you can use this to get easy insights into the health of your tool calling. I wonder if in the future something like this could even be used for quick tool corrections in the online app.

0

2

8

0

431

J

Julia Neagu@JuliaANeagu · Jul 22

This is the start of a neat direction for @QuotientAI. In offline evals and online sampling, you can use this to get easy insights into the health of your tool calling. I wonder if in the future something like this could even be used for quick tool corrections in the online app.

FFreddie Vargus@freddie_v4 · Jul 22

today we're releasing a new small model (0.5B) for detecting problems with tool usage in agents, trained on 50M tokens from publicly available MCP server tools it's great at picking up on tool accuracy issues and outperforms larger models

0

2

7

2

976

Julia Neagu Retweeted

F

Freddie Vargus@freddie_v4 · Jul 22

today we're releasing a new small model (0.5B) for detecting problems with tool usage in agents, trained on 50M tokens from publicly available MCP server tools it's great at picking up on tool accuracy issues and outperforms larger models

13

98

894

791

87.0K

Julia Neagu Retweeted

j

jason liu@jxnlco · Jul 21

how you can catch hallucniations in production with @QuotientAI sign up for study notes and recordings afterwards even if you can't attend live maven.com/p/285276/how-y…

0

2

11

3

1.0K

J

Julia Neagu@JuliaANeagu · Jul 18

But who's that sexy voice? 👀 👀👀 I finally got around to using @elevenlabsio for our demos!

JJulia Neagu@JuliaANeagu · Jul 18

Just dropped: three new cookbooks for building AI research agents with @ExaAILabs, @LangChainAI, @OpenAI, and @AnthropicAI — now with built-in monitoring from @QuotientAI. Track search relevance. Catch hallucinations. Debug real-world agents as they run.

0

1

3

2

475

Julia Neagu Retweeted

j

jason liu@jxnlco · Jul 18

systematically improving rag sessions for the rest of the summer 1. rethinking rag with @Sourcegraph 2. how to catch halluicnations with @QuotientAI 3. lesson from building verticalized agents 4. billion scale vector search w/ @turbopuffer links all below!

2

6

38

33

3.0K

Julia Neagu Retweeted

J

John Berryman@JnBrymn · Jul 16

In light of all the attention that context engineering is getting, today I proudly introduce the second book that Albert Ziegler and I have written together: Context Engineering for LLM Applications.

3

6

36

9

5.0K

Julia Neagu Retweeted

j

jason liu@jxnlco · Jul 10

how do i catch hallucinations? come learn to implement monitoring systems that catch AI errors as they happen in live production environments with @JuliaANeagu and @QuotientAI if you register, you'll be sent the recording and study notes after they're done!…

1

3

9

5.0K

J

Julia Neagu@JuliaANeagu · Jul 3

touch sand

0

5

0

212