Diptesh Kanojia
@diptesh
Senior Lecturer in NLP for AI, Institute for @PeopleCentedAI | University of Surrey | #nlproc
📢 Test Set RELEASED! 🚀 The test set for the #WMT25 Shared Task on QE-informed Segment-level Error Correction is now LIVE! It's time to put your MT error correction / APE methods to the test. Let's see how well they can correct machine translation! #NLProc #MT #WMT2025
Prof. Pushpak Bhattacharyya, in conversation with @EconomicTimes, advocates for trinity models—smaller, cost-effective AI models tailored to India’s diverse languages, domains, and tasks. Link: economictimes.indiatimes.com/tech/artificia… #CFILT #NLP #AI #LLM
Organizers are happy to help with any questions. 🙂 Website with all details and contacts: www2.statmt.org/wmt25/mteval-s…
📐Task 3: Quality-informed segment-level error correction Automatically post-edit machine-translated text using quality annotations to generate minimal and accurate corrections. Description: www2.statmt.org/wmt25/mteval-s… Submission platform: codabench.org/competitions/8…
📐Task 2: Span-level error detection Identify and locate translation errors within each segment (start/end indices) and classify their severity. Description: www2.statmt.org/wmt25/mteval-s… Submission platform: codabench.org/competitions/9…
📐Task 1: Segment-level quality score prediction Predict a quality score for each source–target segment pair, using document-level context and either ESA or MQM annotations. Description: www2.statmt.org/wmt25/mteval-s… Submission platform: codabench.org/competitions/9…
The 2025 MT Evaluation shared task brings together the strengths of the previous Metrics and Quality Estimation tasks under a single, unified evaluation framework. The following tasks are now open (deadline July 31st but participation has never been easier 🙂)
Machine Translation is my first and final love. Every single work I do has some flavor of Machine Translation to it. Machine Translation is the best test bed for any sequence to sequence neural architecture. So it's best you read the book on NMT by the OG MT teacher Prof Philipp…
📢 Presenting IndicSeamless: A Speech Translation Model for Indian Languages 🎙️🌍 IndicSeamless is a speech translation model fine-tuned from SeamlessM4Tv2-large on 13 Indian languages. Trained on a curated subset of BhasaAnuvaad, the largest open-source Speech Translation…
📣 Exciting #Hiring opportunity! Lecturer in Natural Language Processing #NLP. Apply 👉 tinyurl.com/4u2b8xyc. Closes 12 Mar. [email protected]. Our School of Computer Science & Electronic Engineering seeks Lecturer in Natural Language Processing to grow AI research 💡