November 28th, 2025
Batch Container
Real-Time Container
GPU Transcription Inference Container
GPU Translation Inference Container
Version 14.8.0 is now available for Batch Container, Real-Time Container, GPU Transcription Inference Container and GPU Translation Inference Container.
New Tagalog language is now available. Supports code-switching between Filipino and English for bilingual speech
Speaker identification – Introduces the ability to label speakers using unique speaker identifiers. Refer to documentation here for details
Channel diarization – Enables perfect speaker separation when there is one speaker per channel. Refer to documentation here for details
Channel and Speaker diarization – Enables separation of multiple speakers per channel. Refer to documentation here for details
Force end of utterance – Enables the client to force finalise transcription at the end of speech for faster finals (200ms), ideal when using external VAD or turn detection models for voice agents. Refer to documentation here for details
Speaker Diarization – Improved speaker change detection accuracy for long audio streams (1+ hours)
Enhanced Operating Point
New models for English with improved accuracy in transcribing initialisms
Medical domain-specific model for Dutch, English, Finnish, French, German, Spanish giving the highest accuracy for healthcare use cases. Refer to the documentation for more details.
Relative accuracy improvements (medical domain): Dutch (nl) - 70%, English (en) - 14%, Finnish (fi) - 40%, French (fr) - 51%, German (de) - 36%, Spanish (es) - 63%
New models for Bilingual Malay English, Dutch, Finnish, French, Spanish with improved transcription accuracy
Relative accuracy improvements (general domain): Bilingual Malay English (en_ms)- 7.8%, Dutch (nl) - 16%, Finnish (fi) - 20%, French (fr) - 3%, Spanish (es) - 4.5%
Standard Operating Point
Enables faster transcription and higher throughput, refer to the documentation for more details
New models released with notable accuracy uplifts for the below languages:
Relative accuracy improvements: Belarusian (be) - 5%, Bulgarian (bg) - 7%, Catalan (ca) - 29%, Danish (da) - 22%, Finnish (fi) - 17%, Galician (gl) - 8%, Hungarian (hu) - 15%, Indonesian (id) - 14%, Korean (ko) - 8%, Latvian (lv) - 9%, Lithuanian (lt) - 11%, Marathi (mr) - 16%, Norwegian (no) - 12%, Romanian (ro) - 6%, Slovenian (sl) - 9%, Swedish (sv) - 6%, Thai (th) - 14%, Turkish (tr) - 32%, Ukrainian (uk) - 6%, Urdu (ur) - 15%, Vietnamese (vi) - 66%, Welsh (cy) - 17%
Batch and Real-time transcription
New Dutch models (Enhanced Operating Point) give up to 20% relative accuracy improvement
Real-time transcription
Improved Speaker Diarization accuracy for Enhanced Operating Point
A Software Bill of Materials (SBOM) is available for download from the corresponding release page in our Support Portal
Patched libgstreamer to address CVE-2025-3887. However, scanners are likely to still report it as vulnerable due to the unchanged base version string