Eric Alcaide

@eric_alcaide

common prosperity

LLMaxxing

Joined September 2016

767Following

1KFollowers

Pinned

Eric Alcaide@eric_alcaide · Apr 10, 2024

Wake up honey, new RWKV paper just dropped 🧵⤵️ Paper: arxiv.org/abs/2404.05892 Code: github.com/BlinkDL/RWKV-LM Models: huggingface.co/RWKV (Apache 2.0 license) (1/6)

eric_alcaide's tweet image. Wake up honey, new RWKV paper just dropped 🧵⤵️

Paper: arxiv.org/abs/2404.05892
Code: github.com/BlinkDL/RWKV-LM
Models: huggingface.co/RWKV (Apache 2.0 license)

(1/6)

147

18.0K

Eric Alcaide Retweeted

Overfit Quantitative Strategies@OverfitQuantit1 · Jul 24

Tough day for the Chicago boys

110

1.0K

245

92.0K

Eric Alcaide@eric_alcaide · Jul 12

No way

YYulun Du@Yulun_Du · Jul 12

Did you know “Kimi K2: Open Agentic Intelligence” has the abbreviation….🤫

230

Eric Alcaide@eric_alcaide · Jul 2

"running on my very limited GPU access at FAIR (Meta)" 😭😭😭

ZZeyuan Allen-Zhu, Sc.D.@ZeyuanAllenZhu · Jul 2

No matter how AI evolves overnight—tech, career, how it may impact me—I remain committed to using "physics of language models" approach to predict next-gen AI. Due to my limited GPU access at Meta, Part 4.1 (+new 4.2) are still in progress, but results on Canon layers are shining

744

Eric Alcaide@eric_alcaide · Jun 15

Sunday mood

432

Eric Alcaide@eric_alcaide · Jun 9

🧵On Baselines in LLM Architecture Research, a Tale of DeltaNet and RWKV-7 (1) (full essay at github.com/BlinkDL/zoology)

BBlinkDL@BlinkDL_AI · May 20

RWKV7-G1 "GooseOne" 🪿 2.9B release: pure RNN (attention-free) reasoning model, +5.2T tokens, comparable with Qwen2.5 3B / Llama3.2 3B and fully multilingual. Chat demo & weights on RWKV.com 7B training in progress.

16.0K

Eric Alcaide@eric_alcaide · Jun 4

So real, and so sad. Reasonable prevention is the best individual action; while we work to remove this burden.

TTeortaxes▶️ (DeepSeek 推特🐋铁粉 2023 – ∞)@teortaxesTex · Jun 3

Yes. Additionally, you're considered creepy if you find this endless carnage objectionable. Normie women will get the ick, e/accs will say your job is to burn out faster like good fuel, safetyists will call you egoistic, Theists will peddle cope. Onto the conveyor belt, meat.

274

Eric Alcaide@eric_alcaide · Jun 1

📸🛤️⛰️

261

Eric Alcaide Retweeted

Rob Henderson@robkhenderson · May 23

141

15.0K

Eric Alcaide Retweeted

Mathieu@miniapeur · May 20

130

1.0K

198

36.0K

Eric Alcaide@eric_alcaide · Apr 30

nice one !

222

Eric Alcaide Retweeted

comonoidal esotericist 🇨🇿🇪🇺🇺🇸🦀@adamnemecek1 · Apr 29

AGI will be built on the concept of equivalence classes. You will not build an AGI until you figure out how to approximate the partition functions of energy-based models. A partition function is fundamentally a sum over equivalence classes.

233

160

14.0K

Eric Alcaide Retweeted

Simo Ryu@cloneofsimo · Apr 25

Ok real talk. Say you have 2 weeks to live, normal people would spend time with family, but you have no life so you decided to burn gpus instead. King Jenson gave u all the ib cluster you need. Angeles of Data gives you magical s3 bucket that has any data you can imagine.…

6.0K

Eric Alcaide@eric_alcaide · Apr 17

Unfortunately, bitnet-b1.58-2B-4T is not looking good either🙃Please test your model before release: huggingface.co/spaces/Jellyfi…

BBlinkDL@BlinkDL_AI · Apr 1

The "Uncheatable Eval" is good at detecting model quality. For example, our community noticed a 1.58bit 500MB params model "Bonsai" on HF with decent evals, turns out it's evalmaxxing🙃please test your model before release

117

22.0K

Eric Alcaide Retweeted

Istra of Glome@tillwehvfaces · Mar 21

A Dostoyevsky line has never hit me so hard

214

7.0K

56.0K

19.0K

2.3M

Eric Alcaide@eric_alcaide · Mar 11

245

Eric Alcaide@eric_alcaide · Mar 1

this

ttensorqt@tensorqt · Mar 1

467

Eric Alcaide Retweeted

Mac Baconai@Macbaconai · Feb 28

Don't be rude. Every imposition devoid of grace is doomed to collapse.

136

2.0K

24.0K

5.0K

890.0K

Eric Alcaide@eric_alcaide · Mar 1

Cost profit margin 545%

DDeepSeek@deepseek_ai · Mar 1

🚀 Day 6 of #OpenSourceWeek: One More Thing – DeepSeek-V3/R1 Inference System Overview Optimized throughput and latency via: 🔧 Cross-node EP-powered batch scaling 🔄 Computation-communication overlap ⚖️ Load balancing Statistics of DeepSeek's Online Service: ⚡ 73.7k/14.8k…

848