interstellarninja

@intrstllrninja

growing artificial societies | by the open-source AGI, for the people | building @MarketAgentsAI | github: https://github.com/marketagents-ai/MarketAgents

Tesseract

Joined December 2010

672Following

2KFollowers

Pinned

interstellarninja@intrstllrninja · Jul 19, 2024

this interstellarninja is on covert missions right now involving power struggles with closed source AI labs and regulatory bodies plotting against open source AI 🥷

NNippon.com@nippon_en · Jul 16, 2024

Japan’s ninja are famed for their covert activities over centuries of power struggles in the country, and were highly prized by Tokugawa Ieyasu. nippon.com/en/japan-topic…

9.0K

interstellarninja@intrstllrninja · Jul 23

There's now support for viewing JSON in string / dict columns in @huggingface datasets!!! 🔍 Great for all the tool calling datasets like the brand new hermes tool use dataset by @intrstllrninja

CCaleb@calebfahlgren · Jul 11

NEW 🔥!! There's can now view JSON for List cells on @huggingface datasets. Now there's no excuse for looking at your data! 🫣

9.0K

interstellarninja@intrstllrninja · Jul 23

now you can view json prettified such as the tools list from a string column on huggingface thanks to @calebfahlgren from @huggingface datasets team for implementing this feature in addition to conversation json view 🙌

iinterstellarninja@intrstllrninja · Jul 22

today i'm releasing 50k rows of tool-use reasoning dataset compilation on huggingface includes following BFCL scenarios: - single turn tool-use - multiturn tool-use - multistep tool-use - relevance reasoning huggingface.co/datasets/inter…

702

interstellarninja@intrstllrninja · Jul 23

both hermes-3 dataset and my new hermes tool-use reasoning dataset are among #10 trending on huggingface

cclem 🤗@ClementDelangue · Jul 19

Now number one trending dataset on @huggingface, out of almost half a million! huggingface.co/datasets

9.0K

interstellarninja@intrstllrninja · Jul 22

good to see some details on kimi's tool-use data synthesis similar to to the hermes function calling datagen pipeline

TTeknium (e/λ)@Teknium1 · Jul 21

Kimi put out their paper :) github.com/MoonshotAI/Kim…

10.0K

interstellarninja@intrstllrninja · Jul 20

Congrats to our post training team who worked on the Hermes 3's dataset - @Teknium1, @nullvaluetensor, and outside contributor @intrstllrninja - on creating the now #1 Trending dataset on HuggingFace!

cclem 🤗@ClementDelangue · Jul 19

Now number one trending dataset on @huggingface, out of almost half a million! huggingface.co/datasets

314

32.0K

interstellarninja Retweeted

Mckay Wrigley@mckaywrigley · Jul 18

I’m not against crypto at all either! Agents + programmatic money is a match made in heaven. This one is just not me haha. I’m only here to have fun with my mac mini and build a fun experience for people :)

104

28.0K

interstellarninja Retweeted

elvis@omarsar0 · Jul 17

Agent Leaderboard v2 is here! > GPT-4.1 leads > Gemini-2.5-flash excels at tool selection > Kimi K2 is the top open-source model > Grok 4 falls short > Reasoning models lag behind > No single model dominates all domains More below:

216

2.0K

1.0K

270.0K

interstellarninja@intrstllrninja · Jul 18

personalized ai w/ memory is better than vanilla sota

YYohei@yoheinakajima · Jul 18

in ai, memory is a moat with social, relevant network size correlated with value for the user (network is a moat). with ai, every relevant memory extracted from user interactions increases the product value for the user. true or false?

225

interstellarninja Retweeted

Chamath Palihapitiya@chamath · Jul 16

This chart is even more interesting when you reflect the capex that it has taken to generate these results.

132

119

2.0K

431

310.0K

interstellarninja@intrstllrninja · Jul 15

In case the post was too vague, yes - this is the Hermes 3 dataset - 1 Million Samples - Created SOTA without the censorship at it's time on Llama-3 series (8, 70, and 405B) - Has a ton of data for teach system prompt adherence, roleplay, and a great mix of subjective and…

NNous Research@NousResearch · Jul 15

huggingface.co/datasets/NousR…

778

230

73.0K

interstellarninja Retweeted

Kimi.ai@Kimi_Moonshot · Jul 15

We've just fixed 2 bugs in Kimi-K2-Instruct huggingface repo. Please update the following files to apply the fix: - tokenizer_config.json: update chat-template so that it works for multi-turn tool calls. - tokenization_kimi.py: update encode method to enable encoding special…

954

118

71.0K

interstellarninja@intrstllrninja · Jul 15

kimi k2 uses chatml like tool calling tokens instead of xml tags; uses separate tokens for the tool call section, tool call and arguments

vvLLM@vllm_project · Jul 11

@Kimi_Moonshot just released a trillion-parameter model with great agentic capability, and it is already supported in vLLM! Have a try with a simple command, and check the doc for more advanced deployment🚀

712

interstellarninja Retweeted

Sebastian Raschka@rasbt · Jul 12

Kimi K2 is basically DeepSeek V3 but with fewer heads and more experts:

530

5.0K

3.0K

533.0K

interstellarninja Retweeted

Pietro Schirano@skirano · Jul 13

Kimi K2 is so good at tool calling and agentic loops, can call multiple tools in parallel and reliably, and knows "when to stop", which is another important property. It's the first model I feel comfortable using in production since Claude 3.5 Sonnet.

134

2.0K

1.0K

160.0K

interstellarninja Retweeted

Andrej Karpathy@karpathy · Jul 13

Scaling up RL is all the rage right now, I had a chat with a friend about it yesterday. I'm fairly certain RL will continue to yield more intermediate gains, but I also don't expect it to be the full story. RL is basically "hey this happened to go well (/poorly), let me slightly…

412

848

8.0K

5.0K

1.0M