Announcing the Multilingual Conversational Speech Language Model (MLC-SLM) Challenge

2025-03-20 IDOPRESS

MONROVIA,Calif.,March 19,2025 -- Nexdata,a leading global provider of AI data services today announces the start of The Multilingual Conversational Speech LLM (MLC-SLM) Challenge,an officially approved satellite event of Interspeech 2025.

This challenge,hosted by Meta,Google,Samsung,Naver,China Mobile,Northwestern Polytechnical University and Nexdata,aims to advance multilingual conversational speech AI by providing a real-world dataset and encouraging innovation in speech language models.

The challenge consists of two tasks,both of which require participants to explore the development of speech language models (SLMs):

Task I: Multilingual Conversational Speech Recognition

Objective: Develop a multilingual LLM-based ASR model. Participants will be provided with oracle segmentation and speaker labels for each conversation.

Task II: Multilingual Conversational Speech Diarization and Recognition

Objective: Develop a system for both speaker diarization (identifying who is speaking when),and recognition (transcribing speech to text). No prior or oracle information will be provided during evaluation (e.g.,no pre-segmented utterances or speaker labels). Both pipeline-based and end-to-end systems are encouraged,providing flexibility in system design and implementation.

The training set (Train) comprises approximately 11 languages: English (en),French (fr),German (de),Italian (it),Portuguese (pt),Spanish (es),Japanese (jp),Korean (ko),Russian (ru),Thai (th),Vietnamese (vi). It's designed to provide a rich resource for training and evaluating multilingual conversational speech language models (MLC-SLM),addressing the challenges of linguistic diversity,speaker variability,and contextual understanding.

Important Dates (AOT Time)

March 10,2025: Registration opens


March 15,2025: Training data release


March 20,2025: Development set and baseline system release


May 15,2025: Evaluation set release and Leaderboard open


May 30,2025: Leaderboard freeze and paper submission portal opens (CMT system)


June 15,2025: Paper submission deadline


July 1,2025: Notification of acceptance


August 18,2025: Workshop date

We have set a prize pool of $20,000 for the winners. Based on performance,the top three teams in each track will be awarded:


1st Prize: $5,000


2nd Prize: $3,000


3rd Prize: $2,000

For more details,please check out the challenge website: https://www.nexdata.ai/competition/mlc-slm

Participate here: https://docs.google.com/forms/d/e/1FAIpQLSftZCRQQWvO5NZd-bPo1VT2Xsaieu_ZYCklw6MhW6LqjWnuYQ/viewform?usp=send_form

For inquiries:mlc-slmw@nexdata.ai

Join us in shaping the future of multilingual conversational AI and be part of this groundbreaking challenge!

About Nexdata

Nexdata provides top-notch training data solutions and serves as your reliable partner. With an extensive array of off-the-shelf datasets and flexible data collection and annotation services,our mission revolves around unleashing AI's full potential and expediting the AI industry's growth.

Disclaimer: This article is reproduced from other media. The purpose of reprinting is to convey more information. It does not mean that this website agrees with its views and is responsible for its authenticity, and does not bear any legal responsibility. All resources on this site are collected on the Internet. The purpose of sharing is for everyone's learning and reference only. If there is copyright or intellectual property infringement, please leave us a message.

©copyright 2009-2020 Singapore Info Map