secemp

@secemp9

🧠 researcher • 💻 swe • 💾 data scraper • 🌌 universe's jester • Author of TraceBack

Joined April 2021

2KFollowing

3KFollowers

Pinned

secemp@secemp9 · Mar 9

I'm happy to announce my first big model release: TraceBack

501

206

71.0K

secemp@secemp9 · 9 h

they're doing this to me in 2 months

ssnow@snowclipsed · 9 h

this is gonna get scooped

570

secemp@secemp9 · 14 h

"nothing works, okay let's try this"

303

secemp@secemp9 · Jul 24

interesting behavior during their failure mode though, at peak hours claude become lazy, while gemini despair

ssecemp@secemp9 · Jul 24

you know claude is currently being "optimized" for peak hours when you see this being OK (here replacing the main goal with a simple placeholder)

341

secemp@secemp9 · Jul 24

could probably make a simple eval running every N time during the day, every day. the one pattern I'm seeing is it does this the most if it encounter any level of difficulty, during say 2-3 turn at most

ssecemp@secemp9 · Jul 24

you know claude is currently being "optimized" for peak hours when you see this being OK (here replacing the main goal with a simple placeholder)

372

secemp@secemp9 · Jul 24

I get providers, honestly I would quantize the kvcache too during peak hours (or whatever other performance hitting tricks they're using), but, the model still need to be usable you know you could just RL it such that it still work great during those kinds of optimization right?

ssecemp@secemp9 · Jul 24

you know claude is currently being "optimized" for peak hours when you see this being OK (here replacing the main goal with a simple placeholder)

320

secemp@secemp9 · Jul 24

you know claude is currently being "optimized" for peak hours when you see this being OK (here replacing the main goal with a simple placeholder)

secemp9's tweet image. you know claude is currently being "optimized" for peak hours when you see this being OK (here replacing the main goal with a simple placeholder)

1.0K

secemp@secemp9 · Jul 24

yeah this is known for a while. many VLM support image prompting (directly or otherwise)

ZZack Witten@zswitten · Jul 24

343

secemp@secemp9 · Jul 24

what are the odds, that this was made 8 minutes ago...when just last night me and friends were talking about kv caching on official model providers...

secemp9's tweet image. what are the odds, that this was made 8 minutes ago...when just last night me and friends were talking about kv caching on official model providers...

334

secemp@secemp9 · Jul 24

I said similar things mentioned in that paper a while ago, nice

MMihir Prabhudesai@mihirp98 · Jul 22

🚨 The era of infinite internet data is ending, So we ask: 👉 What’s the right generative modelling objective when data—not compute—is the bottleneck? TL;DR: ▶️Compute-constrained? Train Autoregressive models ▶️Data-constrained? Train Diffusion models Get ready for 🤿 1/n

501