�

👋 Jan

@jandotai

Jan is an open source ChatGPT-alternative that runs 100% offline. Built by @menloresearch. Community: https://discord.gg/TE5wMUa7b6

your device

Joined October 2023

950Following

7KFollowers

Pinned

�

👋 Jan@jandotai · Jul 3

hey Jan, do something for me 🪄

6.0K

�

👋 Jan@jandotai · 15 h

Qwen releases Qwen3-Coder, an open agentic coder that rivals Claude Sonnet 4. - 480B total params, 35B active (MoE) - 256K native context, extendable to 1M with YaRN - Excels at agentic coding and browser-based tool use

QQwen@Alibaba_Qwen · 22 h

>>> Qwen3-Coder is here! ✅ We’re releasing Qwen3-Coder-480B-A35B-Instruct, our most powerful open agentic code model to date. This 480B-parameter Mixture-of-Experts model (35B active) natively supports 256K context and scales to 1M context with extrapolation. It achieves…

1.0K

👋 Jan Retweeted

Menlo Research@menloresearch · Jul 22

Introducing Lucy: 1.7B model that Google for you It's an agentic‑search model that can even run on your phone. - Agentic search on tap - Lucy calls tools (<think></think>‑aware) - Fits in your pocket - runs on CPU or mobile Under the hood: - Built on @Alibaba_Qwen's Qwen3‑1.7B…

340

302

17.0K

�

👋 Jan@jandotai · Jul 22

This is really interesting. Someone built an AI Airport Simulation to test how LLMs handle real-time, high-stakes decision making like fuel like fuel emergencies, collision avoidance, and traffic congestion. Open-source & stress-tested. github.com/jjasghar/ai-ai…

jandotai's tweet card. Running LLMs against a sandbox airport to see if they can make the correct decisions in real time - jjasghar/ai-airport-simulation

726

�

👋 Jan@jandotai · Jul 22

ByteDance releases Seed-X, a 7B multilingual translation model. It's trained with instructions, RL, and reward tuning. Reportedly outperforms Gemini 2.5, Claude 3.5, and GPT-4 on translation across 28 languages, based on human evals & automatic metrics. huggingface.co/ByteDance-Seed…

jandotai's tweet card. ByteDance-Seed/Seed-X-Instruct-7B · Hugging Face

3.0K

�

👋 Jan@jandotai · Jul 22

OmniSVG-3B weights are now available. A multimodal model for generating scalable vector graphics, icons, logos, even anime characters from text or images. Also includes a 2M-sample dataset (MMSVG-2M) and supports both local and online demos. huggingface.co/OmniSVG/OmniSVG

jandotai's tweet card. OmniSVG/OmniSVG · Hugging Face

774

�

👋 Jan@jandotai · Jul 21

Qwen3-235B's latest version leads across 5 major reasoning and knowledge benchmarks. It's beating Claude Opus, GPT-4o, and DeepSeek V3 on GPQA, AIME25, LiveCodeBench, Arena-Hard, and BFCL.

jandotai's tweet image. Qwen3-235B's latest version leads across 5 major reasoning and knowledge benchmarks. It's beating Claude Opus, GPT-4o, and DeepSeek V3 on GPQA, AIME25, LiveCodeBench, Arena-Hard, and BFCL.

1.0K

�

👋 Jan@jandotai · Jul 21

Qwen announced Qwen3-235B-A22B-Instruct-2507, dominating math reasoning with 70.3 on AIME25 (vs GPT-4o's 26.7) and achieving 95.0 on ZebraLogic. It outperforms Opus 4 and GPT-4o across reasoning & coding while using 22B active parameters. huggingface.co/Qwen/Qwen3-235…

jandotai's tweet card. Qwen/Qwen3-235B-A22B-Instruct-2507 · Hugging Face

193

5.0K

�

👋 Jan@jandotai · Jul 21

Audio search without transcription: ColQwen-Omni embeds 30 minutes of audio in 10 seconds across documents, audio, and video in a single 3B model. huggingface.co/vidore/colqwen…

jandotai's tweet card. vidore/colqwen-omni-v0.1 · Hugging Face

269

188

9.0K

👋 Jan Retweeted

Paul Velonis@Velona · Jul 20

After spending 12 hours on a plane yesterday, I was really missing the support of LLMs for research. Currently exploring jan.ai which seems to be a very nice and easy way to run local open source models on my oldish Mac m1 air. Have a feeling this endeavour might…

536

�

👋 Jan@jandotai · Jul 21

A new Russian-aligned 32B model just beat Qwen3-32B and DeepSeek R1 on ruMMLU, ruAIME, and LiveCodeBench. T-pro-it-2.0 builds on Qwen3 with continual pretraining, 500K instructions, and a toggleable reasoning mode. huggingface.co/t-tech/T-pro-i…

jandotai's tweet card. t-tech/T-pro-it-2.0 · Hugging Face

2.0K

�

👋 Jan@jandotai · Jul 21

Pusa is a new video diffusion model that matches SOTA with 200x less training cost & 2500x less data. It outperforms Wan-I2V on VBench-I2V, runs 5x faster, and supports I2V, T2V, start-end frames, video extension, and video completion. huggingface.co/RaphaelLiu/Pus…

jandotai's tweet card. RaphaelLiu/PusaV1 · Hugging Face

346

167

17.0K

�

👋 Jan@jandotai · Jul 17

Jan v0.6.5 is out: SmolLM3-3B now run locally Highlights 💫 - Support for @huggingface's SmolLM3-3B - Fully responsive design across all screen sizes - New layout for Model Providers Update your Jan or download the latest.

7.0K

�

👋 Jan@jandotai · Jul 18

250 tokens/sec with Kimi K2 in Jan via @GroqInc. To set it up: - Go to Settings: Model Providers: Groq - Paste your Groq API key - Add model: moonshotai/kimi-k2-instruct

1.0K

👋 Jan Retweeted

Menlo Research@menloresearch · Jul 14

Menlo Research is hiring, come build cool stuff with us. Some open roles: - Sr Backend Engineer (Tauri) - Sr Software Engineer - Research Engineer (PyTorch) - Lead Electromechanical Systems Engineer - Mechanical Engineer - Management Associate menlo.bamboohr.com/careers/

2.0K

�

👋 Jan@jandotai · Jul 16

Microsoft releases a new dataset that improves Qwen2.5-7B from 17.4% to 57.3% on LiveCodeBench. It's called rStar-Coder, 418K tasks designed to push competitive code reasoning. A 7B model trained on it outperforms QWQ-32B on the USA Computing Olympiad. huggingface.co/datasets/micro…

jandotai's tweet card. microsoft/rStar-Coder · Datasets at Hugging Face

117

761

430

36.0K