This ASR actually supports 52 languages

Author's): Gowtham Boyina

Originally published in Towards Artificial Intelligence.

The forced alignment model is the interesting part

Over time, I have tested dozens of speech recognition models. Most claim to support multilingualism, but it quietly falls apart when you feed them actual Chinese dialects, accented English, or anything other than the standard broadcast audio. The ones that work well are usually proprietary APIs that scale uncomfortably.

This ASR actually supports 52 languages

from Qwen-ASR github

Alibaba's Qwen team introduced Qwen3-ASR, an open-source speech recognition system that supports 52 languages and dialects. Key models include Qwen3-ASR-1.7B, which boasts state-of-the-art performance for multilingual tasks, and Qwen3-ForcedAligner-0.6B, a non-autoregressive model for accurate speech and text alignment. These improvements enable better support for Chinese dialects, user-generated content in multiple languages, and improved timestamp accuracy for applications requiring precise audio-text synchronization.

Read the entire blog for free on Medium.

Published via Towards AI

This ASR actually supports 52 languages

Author's): Gowtham Boyina

The forced alignment model is the interesting part

LEAVE A REPLY Cancel reply

APLICATIONS

How to use new ChatGPT app integrations including Spotify, Figma, Canva...

When the machines begin to build their own minds

The most talented AI model in Google

News organizations have the power to demand payment and should do...

HOT NEWS

combining generative artificial intelligence with live filmmaking

New Machine Learning Framework for Filtering Image-Text Data Proposed by UCSD...

Businesses are increasingly developing generative AI applications on AWS

Blizzard Under Fire as Diablo Immortal Event uses the art of...

POPULAR POSTS

Advantages and Disadvantages of the Top 14 AI Applications in 2024

National Recognition for GPHA Takoradi Hospital’s A.I. Application Focus Lab Week...

KRISP uses artificial intelligence to help Indians sound like Americans on...

POPULAR CATEGORY

How do LLMS understand? Look at the “thinking” of the mind...