Release Notes

Follow new updates and improvements to Speechmatics.

June 4th, 2026

Batch Container

Realtime Container

Version 15.9.0 is now available for Batch Container, Real-Time Container, GPU Transcription Inference Container and GPU Translation Inference Container.

New


GPU & CPU

HTTP Batch Transcription

  • Supports URL-based audio input. Users can provide an audio file URL, enabling the service to fetch the file for transcription. See the documentation for more details.

  • The /ready endpoint now reports engines_used, improving service visibility. See the documentation for more details.

Improvements


CPU

  • New Arabic models (Enhanced Operating Point) give up to 2.7% relative accuracy improvement.

Fixes


GPU & CPU

  • Fixed transcription job failures when speaker diarization was enabled for WAV files with missing or incorrect duration metadata.

  • Fixed failure to generate transcripts for a small number of stereo files when one channel contains leading silence.

Security fixes


Vulnerability Management

  • Software Bill of Materials (SBOM) is available for download from the corresponding release page in our Support Portal.

  • libgstreamer (CVE-2025-3887): This component has been manually patched to address CVE-2025-3887. Note: Security scanners may still flag this component as vulnerable because the base version string remains unchanged

  • tritonserver 2.61.0 (NGC 25.09): This is a component in sm-gpu-inference-server and sm-translation-server images that is not listed in the corresponding SBOMs due to lack of installation metadata in the base images published by Nvidia.

    The identified CVEs have been reviewed and are not considered exploitable, provided the Triton API is only accessible to transcriber containers within the same trust domain. This can be achieved by deploying STT in a private Kubernetes cluster or by using a service mesh to restrict access to the appropriate pods.

    A future release will rebuild against Nvidia NGC 26.03 or later with the relevant fixes.

Identified CVEs

Severity

CVE-2026-24207

CRITICAL: Mar 2026 bulletin (a_id 5790), fixed in NGC 26.03

CVE-2026-24206, CVE-2026-24208, CVE-2026-24209, CVE-2026-24210, CVE-2026-24213, CVE-2026-24214

HIGH: Mar 2026 bulletin (a_id 5790), fixed in NGC 26.03

CVE-2026-24215

HIGH: May 2026 bulletin (a_id 5828), fixed in NGC 26.03

CVE-2026-24146, CVE-2026-24147, CVE-2026-24173, CVE-2026-24174, CVE-2026-24175

HIGH: Apr 2026 bulletin (a_id 5816), fixed in NGC 26.02

CVE-2025-33201, CVE-2025-33211

HIGH: Dec 2025 bulletin (a_id 5734), fixed post-25.09

May 14th, 2026

Realtime Kubernetes

Updates


  • Updates Redis dependency to use 8.6.3-alpine image.

    • Redis is deployed using community image from a dependent helm chart (β€œsm-redis”) instead of Bitnami image and helm chart.

  • Updates Realtime container version to 15.7.0. For more information, see the 15.7.0 container release notes.

  • Updates sessiongroups CustomResourceDefinitions and controller. Refer to the sm-realtime Helm chart RELEASE_NOTES.md for upgrade details.

Security fixes


Vulnerability Management

  • Software Bill of Materials (SBOM) is available for download from the corresponding release page in our Support Portal.

  • libgstreamer (CVE-2025-3887): This component has been manually patched to address CVE-2025-3887. Note: Security scanners may still flag this component as vulnerable because the base version string remains unchanged

Non-Applicable CVEs

The following vulnerabilities were reviewed and determined to have no security impact on this release due to specific configurations or the use of closed, trusted environments:

Component

Identified CVEs

python-multipart

CVE-2026-42561

urllib3

CVE-2026-44431, CVE-2026-44432

May 12th, 2026

Batch SaaS

Fixes

Fixed transcription job failures when speaker diarization was enabled for WAV files with missing or incorrect duration metadata.

  • Bug introduced in release version: 2026.04.23

Updated Orchestrator Version: 2026.05.08+3477f55380+15.9.0

May 7th, 2026

Batch SaaS

Fixes

Fixed failure to generate transcripts for a small number of stereo files when one channel contains leading silence.

  • Resolves the following tickets: 32372

  • Bug introduced in release version: 2026.04.23

Updated Orchestrator Version: 2026.05.05+c019d07f67+15.8.0

May 1st, 2026

Batch Container

Realtime Container

GPU Transcription Inference Container

GPU Translation Inference Container

Version 15.7.0 is now available for Batch Container, Real-Time Container, GPU Transcription Inference Container and GPU Translation Inference Container.

New


GPU & CPU

  • HTTP Batch Transcription – Processes multiple jobs using persistent workers, reducing turnaround time and improving CPU/GPU utilization. See the documentation for more details.

Improvements


GPU

  • New model (Enhanced Operating Point) for English (en) improves accuracy across:

    • Numbers, spellouts, and other alphanumerics

    • Medical measurements and terminology

    • Mixed spoken alphanumeric sequences of numbers and characters

    • Character sequences, for example spell outs of names or abbreviations

    • Formatting consistency for letter sequences, now returned as upper case letters

    • Email addresses and web URLs

    • Large and compound monetary amounts

Category

Relative Improvement (WER)

Numbers

69%

Spellouts

89%

Mixed alphanumerics

42%

Updates


GPU

  • Updated Japanese (ja) model for Enhanced Operating Point

Security fixes


Vulnerability Management

  • Software Bill of Materials (SBOM) is available for download from the corresponding release page in our Support Portal.

  • libgstreamer (CVE-2025-3887): This component has been manually patched to address CVE-2025-3887. Note: Security scanners may still flag this component as vulnerable because the base version string remains unchanged

April 24th, 2026

Realtime SaaS

Improvements

  • New model (Enhanced Operating Point) for English (en) improves accuracy across:

    • Medical measurements and terminology

    • Mixed spoken alphanumeric sequences of numbers and characters

    • Character sequences, for example spell outs of names or abbreviations

    • Formatting consistency for letter sequences, now returned as upper case letters

    • Email addresses and web URLs

    • Large and compound monetary amounts

  • Improved accuracy at low latency when using ForceEndOfUtterance

Updates

  • Updated Japanese (ja) model for Enhanced Operating Point

Updated Orchestrator Version: 2026.04.21+fd908134bc+15.7.0

April 16th, 2026

Batch SaaS

Improvements

New model (Enhanced Operating Point) for English (en) improves accuracy across:

  • Medical measurements and terminology

  • Mixed spoken alphanumeric sequences of numbers and characters

  • Character sequences, for example spell outs of names or abbreviations

  • Formatting consistency for letter sequences, now returned as upper case letters

  • Email addresses and web URLs

  • Large and compound monetary amounts

Updates

Updated Japanese (ja) model for Enhanced Operating Point

Updated Orchestrator Version: 2026.04.10+192b655fa8+15.5.0

March 12th, 2026

Realtime SaaS

Improvements

New model (Enhanced Operating Point) for English (en) improves accuracy across numbers, spellouts, and other alphanumerics.

Category

Relative Improvement (WER)

Numbers

69%

Spellouts

89%

Mixed alphanumerics

42%

Updated Orchestrator Version: 2026.02.27+2ce3ed4fc8+15.2.0

March 9th, 2026

Batch SaaS

Improvements

  • Faster processing for audio files up to 12 minutes when using a custom dictionary.

    • Supported languages: Arabic, Catalan, Dutch, English, French, German, Greek, Hebrew, Hindi, Japanese, Norwegian, Persian, Portuguese, Russian, Spanish, and Swedish.

  • New model (Enhanced Operating Point) for English (en) improves accuracy across numbers, spellouts, and other alphanumerics.

Category

Relative Improvement (WER)

Numbers

69%

Spellouts

89%

Mixed alphanumerics

42%

Updated Orchestrator Version: 2026.02.27+2ce3ed4fc8+15.2.0

March 5th, 2026

Realtime Kubernetes

Updates


  • Enables repopulation to automatically restore cluster state in the event of Redis data loss.

  • Supports language and operating-point based model cost generation for the custom inference server recipe. Refer to the sm-realtime Helm chart README.md for configuration details.

  • Updates Realtime container version to 15.0.0. For more information, see the 15.0.0 container release notes.

Security fixes


Vulnerability Management

  • Software Bill of Materials (SBOM) is available for download from the corresponding release page in our Support Portal.

  • libgstreamer (CVE-2025-3887): This component has been manually patched to address CVE-2025-3887. Note: Security scanners may still flag this component as vulnerable because the base version string remains unchanged

  • Redis versioning: This release by default uses Redis version 8.2.1 with publicly disclosed security vulnerabilities (CVEs) that have been assessed to not impact this product. Users who require additional security controls or remediation for these CVEs can choose to deploy a different Redis version which remediates issues of concern.

Non-Applicable CVEs

The following vulnerabilities (including those reported for the third-party package Redis version) were reviewed and determined to have no security impact on this release due to specific configurations or the use of closed, trusted environments:

Component

Identified CVEs

redis

CVE-2025-49844, CVE-2025-46817, CVE-2025-46818, CVE-2025-46819, CVE-2025-62507

stdlib

CVE-2025-58183, CVE-2025-61726, CVE-2025-61728, CVE-2025-61729, CVE-2025-68121

protobuf

CVE-2026-0994

crytptography

CVE-2026-26007

General Libs

libc (CVE-2025-4802, CVE-2026-0861), libpam (CVE-2025-6020), libssl3 (CVE-2025-15467, CVE-2025-69419, CVE-2025-69421)zlib (CVE-2023-45853), gpgv(CVE-2025-68973, CVE-2026-24882)

Others

perl (CVE-2023-31484)