14.8.0 - Containers

Version 14.8.0 is now available for Batch Container, Real-Time Container, GPU Transcription Inference Container and GPU Translation Inference Container.

New

CPU & GPU

Batch and Real-time transcription

New Tagalog language is now available. Supports code-switching between Filipino and English for bilingual speech

GPU

Batch and Real-time transcription

Speaker identification – Introduces the ability to label speakers using unique speaker identifiers. Refer to documentation here for details

Real-time transcription

Channel diarization – Enables perfect speaker separation when there is one speaker per channel. Refer to documentation here for details
Channel and Speaker diarization – Enables separation of multiple speakers per channel. Refer to documentation here for details
Force end of utterance – Enables the client to force finalise transcription at the end of speech for faster finals (200ms), ideal when using external VAD or turn detection models for voice agents. Refer to documentation here for details

Improvements

GPU & CPU

Real-time transcription

Speaker Diarization – Improved speaker change detection accuracy for long audio streams (1+ hours)

GPU

Batch and Real-time transcription

Enhanced Operating Point
- New models for English with improved accuracy in transcribing initialisms
- Medical domain-specific model for Dutch, English, Finnish, French, German, Spanish giving the highest accuracy for healthcare use cases. Refer to the documentation for more details.
  - Relative accuracy improvements (medical domain): Dutch (nl) - 70%, English (en) - 14%, Finnish (fi) - 40%, French (fr) - 51%, German (de) - 36%, Spanish (es) - 63%
- New models for Bilingual Malay English, Dutch, Finnish, French, Spanish with improved transcription accuracy
  - Relative accuracy improvements (general domain): Bilingual Malay English (en_ms)- 7.8%, Dutch (nl) - 16%, Finnish (fi) - 20%, French (fr) - 3%, Spanish (es) - 4.5%
Standard Operating Point
- Enables faster transcription and higher throughput, refer to the documentation for more details
- New models released with notable accuracy uplifts for the below languages:
  - Relative accuracy improvements: Belarusian (be) - 5%, Bulgarian (bg) - 7%, Catalan (ca) - 29%, Danish (da) - 22%, Finnish (fi) - 17%, Galician (gl) - 8%, Hungarian (hu) - 15%, Indonesian (id) - 14%, Korean (ko) - 8%, Latvian (lv) - 9%, Lithuanian (lt) - 11%, Marathi (mr) - 16%, Norwegian (no) - 12%, Romanian (ro) - 6%, Slovenian (sl) - 9%, Swedish (sv) - 6%, Thai (th) - 14%, Turkish (tr) - 32%, Ukrainian (uk) - 6%, Urdu (ur) - 15%, Vietnamese (vi) - 66%, Welsh (cy) - 17%

CPU

Batch and Real-time transcription

New Dutch models (Enhanced Operating Point) give up to 20% relative accuracy improvement

Real-time transcription

Improved Speaker Diarization accuracy for Enhanced Operating Point

Security fixes

GPU & CPU

A Software Bill of Materials (SBOM) is available for download from the corresponding release page in our Support Portal
Patched libgstreamer to address CVE-2025-3887. However, scanners are likely to still report it as vulnerable due to the unchanged base version string

Speechmatics

14.8.0 - Containers

New

CPU & GPU

Batch and Real-time transcription

GPU

Batch and Real-time transcription

Real-time transcription

Improvements

GPU & CPU

Real-time transcription

GPU

Batch and Real-time transcription

CPU

Security fixes

GPU & CPU