All Topics
All Topics
Technology
Technology
AI
AI
Business
Business
Entertainment
Entertainment
News
News
Programming
Programming
Security
Security
Science
Science
Design
Design
Environment
Environment
Finance
Finance
Crypto
Crypto
Politics
Politics
Sports
Sports
Education
Education
Gaming
Gaming
Art
Art
Music
Music
Health
Health
Books
Books
Food
Food
Travel
Travel
Personal
Personal
Bluesky
Twitter

CraneAI Labs Releases v1.2 of Streaming ASR Model for Luganda, Shona, and Swahili

15h ago· 4 min readen

Summary

CraneAI Labs released version 1.2 of their crane-nemo-asr model, a streaming automatic speech recognition (ASR) system fine-tuned from NVIDIA's Nemotron-3.5 ASR model. It supports Luganda, Shona, and Swahili (with English retained) using a FastConformer Cache-Aware RNN-Transducer architecture with ~600M parameters. The key improvement in v1.2 is the recovery of long training clips (over 20 seconds) that were previously dropped, enabling transcription of longer conversational monologues. The model is designed for real-time, cache-aware streaming transcription conditioned on a language-ID prompt.

Source

Twitter / XCraneAI Labs Releases v1.2 of Streaming ASR Model for Luganda, Shona, and Swahilihuggingface.co

Key quotes

· 5 pulled
A streaming automatic speech recognition model for Luganda, Shona, and Swahili (with English retained), fine-tuned from nvidia/nemotron-3.5-asr-streaming-0.6b
The model transcribes conversational and read speech in real time (cache-aware streaming) and is conditioned on a language-ID prompt.
What's new in 1.2 — more training data, from the data we already had.
Earlier versions dropped every training clip longer than 20 s (long conversational monologues), because the trainer can't fit them. 1.2 recovers them.
We're on a journey to advance and democratize artificial intelligence through open source and open science.
Snippet from the RSS feed
We’re on a journey to advance and democratize artificial intelligence through open source and open science.

You might also wanna read

Comments

Sign in to join the conversation.

No comments yet. Be the first.