argmax (@argmaxinc)

Pinned

a

argmax@argmaxinc · Mar 7

Introducing SpeakerKit State-of-the-art on-device speaker diarization: - 10 minutes of audio processed in 3 seconds - 10 megabytes in total - 6-year-old devices supported Details and links to the demo app are in the thread.

argmaxinc's tweet image. Introducing SpeakerKit

State-of-the-art on-device speaker diarization:
- 10 minutes of audio processed in 3 seconds
- 10 megabytes in total
- 6-year-old devices supported

Details and links to the demo app are in the thread.

7

37

330

264

56.0K

a

argmax@argmaxinc · Jul 22

Argmax Pro SDK 1.6.1 Changelog: - Truncated words from Parakeet are fixed - Empty transcripts for very short audio are fixed - `VoiceActivityDetector.modelVAD` is now compatible with Parakeet models Thanks for all the reproducible reports from developers as well as end-users!…

aargmax@argmaxinc · Jul 16

Major update to Argmax Pro SDK dropped today! - Real-time transcription API now supports multiple concurrent sessions - Diarized transcriptions have 40% lower error rates - New high-level APIs to streamline developer experience These improvements come at no additional latency or…

0

17

0

1.0K

a

argmax@argmaxinc · Jul 21

@superwhisperapp's local speech-to-text models have ~150 ms latency with the @argmaxinc backend but cloud LLM models for post-processing were adding huge latency on top. Thanks for reducing it @rauchg!

GGuillermo Rauch@rauchg · Jul 20

It's been fun collaborating with @superwhisperapp, a blazing fast AI speech-to-text app, with support for local and cloud models. By putting @vercel CDN in front of their model API, they're seeing up to 350ms gains in some geos 🤯 To be clear, this is 350ms+ faster by *just*…

2

1

9

1

973

argmax Retweeted

G

Guillermo Rauch@rauchg · Jul 20

It's been fun collaborating with @superwhisperapp, a blazing fast AI speech-to-text app, with support for local and cloud models. By putting @vercel CDN in front of their model API, they're seeing up to 350ms gains in some geos 🤯 To be clear, this is 350ms+ faster by *just*…

27

17

486

139

47.0K

a

argmax@argmaxinc · Jul 15

It is raining speech models today!! @NVIDIAAI also quietly reclaimed the lead on the OpenASR leaderboard:

MMistral AI@MistralAI · Jul 15

Introducing the world's best (and open) speech recognition models!

2

1

33

11

3.0K

argmax Retweeted

A

Atila@atiorh · Jul 14

Going to @icmlconf and want to learn about the frontiers of on-device AI? Catch my talk on July 18 after @DAlistarh and before @songhan_mit! I will discuss the secret sauce and benchmarks behind how @argmaxinc built WhisperKit to outmatch several top cloud providers for…

1

3

20

2

1.0K