Jaydev Tonde
@JaydevTonde
Data Scientist II @Wolters_Kluwer , Master's in computer science from Pune University, Visit My Blog : https://substack.com/@jaydevtonde
Just finished experimentation with the ModernBERT model released by @answerdotai on one multi label classification task. As mentioned in the paper it has outperformed the DeBERTa model which we were using previously and it is 2.5x faster also #ArtificialIntelligence #Transformers
Granular metrics are helping to understand where to focus exactly. #kaggle #MAP

📊 New AI Agents benchmark every week: AbsenceBench, FutureBench, ShadeArena… Soon we’ll need BenchmarkBench to evaluate which benchmarks actually matter 😅 #AIAgents
cool project from @snorbyte training a text-to-speech model that can, among other things, "code switch" between Indic languages and English demo deployed on Modal! snorbyte.com/blog/train-sot…
🚀Summer Fest Day 3: Cost-Effective MoE Inference on CPU from Intel PyTorch team Deploying 671B DeepSeek R1 with zero GPUs? SGLang now supports high-performance CPU-only inference on Intel Xeon 6—enabling billion-scale MoE models like DeepSeek to run on commodity CPU servers.…
Kimi K2 and when "DeepSeek Moments" become normal One "DeepSeek Moment" wasn't enough for us to wake up, hopefully we don't need a third. interconnects.ai/p/kimi-k2-and-…
Hi everyone, I’ve decided to move on from my current company after almost 3 years and I am open to new opportunities in AI/LLM engineering/research. I have designed and deployed production-grade RAG systems, fine-tuned and served open LLMs at scale. Key highlight from my…