turboderp

@turboderp_

Joined June 2023

36Following

753Followers

turboderp Retweeted

kingbri@kingbri1st · Jul 13

1000 stars on tabbyAPI. Holy crap. Huge thanks to @turboderp and everyone who contributed!

620

turboderp Retweeted

kingbri@kingbri1st · May 11

TabbyAPI now supports ExllamaV3 with automatic backend detection! 🎉 Please note that exl3 is being actively worked on and mileage may vary compared to exl2 Thanks to @turboderp_ and all contributors for making this a reality.

442

turboderp@turboderp_ · Apr 25

I have decided to tweet today. So here is a visualization of how the paged cache works with continuous batching in ExLlamaV3. I think it's neat. #🐈

135

8.0K

turboderp@turboderp_ · Apr 7

Seems to still be true that larger models are less sensitive to quantization. Here is Mistral-Large 123B at 1.4 bits per weight, running on one 24 GB GPU. #AI or something

207

33.0K

turboderp@turboderp_ · Apr 6

\____

$turboderp_'s tweet image. \____$

8.0K

turboderp@turboderp_ · Apr 6

I made a thing. github.com/turboderp-org/…

turboderp_'s tweet card. An optimized quantization and inference library for running LLMs locally on modern consumer-class GPUs - GitHub - turboderp-org/exllamav3: An optimized quantization and inference library for runni...

228

13.0K

turboderp@turboderp_ · Mar 5

CUDA builds character 😭

592

turboderp Retweeted

kingbri@kingbri1st · Dec 5

Supply chain alert! Don't use the comfyUI Impact pack! Its dependency ultralytics has been compromised on pypi. Thanks Shinon for letting me know in Discord. github.com/ultralytics/ul…

2.0K

turboderp@turboderp_ · Nov 23

Fun with grounding in Qwen2-VL. Finding the things. #wherearethethings #exllamav2 #cat

494

turboderp Retweeted

kingbri@kingbri1st · Nov 22

TabbyAPI now supports vision. Thanks to @turboderp_ for exllamav2's updates and DocShotgun for the initial work. Any Exl2 supported vision model works, but this release focuses on Pixtral from @MistralAI

498

turboderp Retweeted

kingbri@kingbri1st · Nov 10

1 year ago, I made TabbyAPI with @turboderp_ as a side project. Now, it's my most popular side project. I wanted to break away from the bloated nature of AIO local model backends and just run #exllama. Thanks to all the contributors and testers. github.com/theroyallab/ta…

2.0K

turboderp@turboderp_ · Jun 20, 2024

huggingface.co/turboderp/llam…

550

turboderp@turboderp_ · Jun 13, 2024

I performed a successful vocabulary transplant on Qwen2-0.5B and turned it into a useful draft model for Llama-3. What a time to be alive. #hashtag huggingface.co/turboderp/Qwam…

turboderp_'s tweet card. turboderp/Qwama-0.5B-Instruct · Hugging Face

6.0K

turboderp@turboderp_ · May 10, 2024

Llama-3-instruct becomes much more useful when you censor some of its catchphrases. #simplesolutions etc. 🤷

877

turboderp Retweeted

Mike Lacher@mikelacher · Feb 6, 2024

New project: goody2.ai GOODY-2 is an AI model that's so responsible it won't give a straight answer to anything.

4.0K

turboderp Retweeted

Daniel van Strien@vanstriendaniel · Jan 3, 2024

🚀 TACO: a new benchmark for code generation from @BAAIBeijing with 26,443 problems. • 🤖 English questions & Python solutions • 🧠 Ideal for evaluating code generation from natural language • 📊 Train: 25,443 samples, Test: 1,000 samples • 📚 Diverse difficulty levels

18.0K

turboderp@turboderp_ · Dec 30, 2023

I guess I should post something once in a while. So here's a whole chatbot in 26 lines of Python running Mixtral 8x7B real fast on one 3090. Idk, I think it's neat. 🐈

459

334

44.0K

turboderp@turboderp_ · Nov 5, 2023

I made a thing. github.com/turboderp/exui

turboderp_'s tweet card. Web UI for ExLlamaV2. Contribute to turboderp-org/exui development by creating an account on GitHub.

735