Yasir (@0xyaza)

Yasir Retweeted

C

Chujie Zheng@ChujieZheng · 6 h

Proud to introduce Group Sequence Policy Optimization (GSPO), our stable, efficient, and performant RL algorithm that powers the large-scale RL training of the latest Qwen3 models (Instruct, Coder, Thinking) 🚀 📄 huggingface.co/papers/2507.18…

11

84

594

405

28.0K

Y

Yasir@0xyaza · 5 h

Spot on.

MMaxime Rivest 🧙‍♂️🦙🐧@MaximeRivest · 5 h

This week we got: *Qwen3-235B-A22B-Instruct-2507 *Qwen3-235B-A22B-Thinking-2507 *Qwen3-Coder-480B-A35B-Instruct All open weights and Apache 2.0 licensed. I feel this marks the point where we can now have hugely effective models running locally. Someone needs to make consumer…

0

6

Yasir Retweeted

M

Maxime Rivest 🧙‍♂️🦙🐧@MaximeRivest · 5 h

This week we got: *Qwen3-235B-A22B-Instruct-2507 *Qwen3-235B-A22B-Thinking-2507 *Qwen3-Coder-480B-A35B-Instruct All open weights and Apache 2.0 licensed. I feel this marks the point where we can now have hugely effective models running locally. Someone needs to make consumer…

1

2

24

4

763

Y

Yasir@0xyaza · 6 h

I swear if @AMD just makes a desktop SoC with 256GB or 512GB of unified RAM, they'd own the personal computing market lol.

VVaibhav (VB) Srivastav@reach_vb · 6 h

Qwen on a ROLL! Thinking model that beats Gemini 2.5 Pro, O4 mini AND DeepSeek R1 too 🔥

0

1

74

Y

Yasir@0xyaza · Jul 23

Testing Qwen3-coder with the new 480B-instruct model on @hyperbolic_labs and it's been 🤌🤌🤌 so far.

0

1

0

37

Yasir Retweeted

C

Carbon@CogniCarbon · Jul 22

I love when Claude tells me something will take 6 weeks. No dude, we are doing it this afternoon.

185

673

10.0K

679

446.0K

Yasir Retweeted

s

spacy@dosco · Jul 22

dspy now works great in the browser. try the cool new dspy notebook with built in local in browser models. @DSPyOSS @lateinteraction you folks will like this.

7

18

141

100

8.0K

Y

Yasir@0xyaza · Jul 19

Very straightforward use of @DSPyOSS. Very nice example @MaximeRivest

MMaxime Rivest 🧙‍♂️🦙🐧@MaximeRivest · Jul 18

My tutorial on: How to build an Automatically Branching Chat with DSPy is out! With DSPy, I am able to easily rein in the chaos of LLM outputs and actually use LLM generation in my coding logic. No manual parsing or fiddling with strings required. If you follow along and run…

1

3

30

19

2.0K

Y

Yasir@0xyaza · Jul 19

I think @kirodotdev likely has the late entrant advantage. They've identified all the issues we've experienced using agentic coding, and nailed the user flow. IMHO @TaskmasterAI is the OG that influenced controlling the erratic behavior of all coding tools, but I have to say…

1

0

1

187

Y

Yasir@0xyaza · Jul 17

the weather is nice and all, but have you configured your @zeddotdev ide to use @Kimi_Moonshot Kimi-K2 yet using @GroqInc ?? The speed is ridiculous.

0

1

0

71

Y

Yasir@0xyaza · Jul 16

If there is one thing I've been told over and over again, but did not understand until it happens many times over...any and all opportunities are time-bound. There is a time limit to when the ROI on taking that opportunity is no longer worth it. Courage and speed are the defining…

0

29

Y

Yasir@0xyaza · Jul 15

When's @GroqInc inference speed meets their deployment speed....this is awesome and already using it heavily!

HHatice Ozen@ozenhati · Jul 15

the colossal giant is here. @kimi_moonshot's kimi v2 with 1t parameters is now on @groqinc for instant tool calling for your coding agents. full context available for all. full speed ahead. 🫡

1

0

11

0

420

Y

Yasir@0xyaza · Jul 15

the colossal giant is here. @kimi_moonshot's kimi v2 with 1t parameters is now on @groqinc for instant tool calling for your coding agents. full context available for all. full speed ahead. 🫡

HHatice Ozen@ozenhati · Jul 13

anyone else want kimi on groq or

32

37

431

88

52.0K

Yasir Retweeted

O

Omar Khattab@lateinteraction · Jul 14

📢 If you’re at #SIGIR2025 this week, make sure to be at Luca Scheerer’s paper talk: “WARP: An Efficient Engine for Multi-Vector Retrieval” (Wednesday 11am) WARP makes PLAID, the famous ludicrously fast ColBERT engine, another 3x faster on CPUs. With the usual ColBERT quality!

2

19

124

65

33.0K

Yasir Retweeted

D

DSPy@DSPyOSS · Jul 7

This prompt injection screenshot is circulating. From an abstraction standpoint, it's another argument for Signatures. Signatures separate the fixed task spec (instructions + I/O schema) from the variable input data, and assign a semantic role for each input. That's in…

3

9

122

71

6.0K

Y

Yasir@0xyaza · Jun 27

The race for LLM "cognitive core" - a few billion param model that maximally sacrifices encyclopedic knowledge for capability. It lives always-on and by default on every computer as the kernel of LLM personal computing. Its features are slowly crystalizing: - Natively multimodal…

OOmar Sanseviero@osanseviero · Jun 26

I’m so excited to announce Gemma 3n is here! 🎉 🔊Multimodal (text/audio/image/video) understanding 🤯Runs with as little as 2GB of RAM 🏆First model under 10B with @lmarena_ai score of 1300+ Available now on @huggingface, @kaggle, llama.cpp, ai.dev, and more

382

1.0K

10.0K

5.0K

1.2M

Y

Yasir@0xyaza · Jun 25

Bonus if you use @GroqInc with any of the optimization techniques (miprov2 etc) - the speedup gain is an easy 10x

OOmar Khattab@lateinteraction · Jun 25

Not enough people know this: Every DSPy optimizer has ALWAYS natively allowed you to tune any complex program with any many LLM calls and any structure you want. Multi-turn agents? Multi-module compound AI systems? Just use MIPRO or GRPO for prompt opt or RL the whole system!

0

3

27

7

3.0K

Y

Yasir@0xyaza · Jun 25

Not enough people know this: Every DSPy optimizer has ALWAYS natively allowed you to tune any complex program with any many LLM calls and any structure you want. Multi-turn agents? Multi-module compound AI systems? Just use MIPRO or GRPO for prompt opt or RL the whole system!

KKyle Mistele 🏴‍☠️@0xblacklight · Jun 25

how do you optimize a multi-prompt pipeline (deep-research style) with DSPy?

4

13

139

77

16.0K