March 5th, 2026
Real-Time Kubernetes
Enables repopulation to automatically restore cluster state in the event of Redis data loss.
Supports language and operating-point based model cost generation for the custom inference server recipe. Refer to the sm-realtime Helm chart README.md for configuration details.
Updates Realtime container version to 15.0.0. For more information, see the 15.0.0 container release notes.
Software Bill of Materials (SBOM) is available for download from the corresponding release page in our Support Portal.
libgstreamer (CVE-2025-3887): This component has been manually patched to address CVE-2025-3887. Note: Security scanners may still flag this component as vulnerable because the base version string remains unchanged
Redis versioning: This release by default uses Redis version 8.2.1 with publicly disclosed security vulnerabilities (CVEs) that have been assessed to not impact this product. Users who require additional security controls or remediation for these CVEs can choose to deploy a different Redis version which remediates issues of concern.
The following vulnerabilities (including those reported for the third-party package Redis version) were reviewed and determined to have no security impact on this release due to specific configurations or the use of closed, trusted environments:
March 4th, 2026
Real-Time Appliance
Word Replacement - Enables words in the transcript to be modified using a search and replace pattern. Refer to documentation here for details.
Supports the prefer_current_speaker configurable parameter to reduce the likelihood of incorrectly switching between similar sounding speakers. Refer to documentation for more details.
Supports End of Utterance feature. When enabled, this helps detect the end of turn in a conversation. This can benefit use cases such as Voice Agents, dictation and translation, reducing latency. Refer to the documentation for more details.
Supports speaker_sensitivity parameter to configure the sensitivity of speaker detection. Refer to documentation for more details.
New Tagalog language is now available. Supports code-switching between Filipino and English for bilingual speech
Channel diarization – Enables perfect speaker separation when there is one speaker per channel. Refer to documentation here for details
Channel and Speaker diarization – Enables separation of multiple speakers per channel. Refer to documentation here for details
Force end of utterance – Enables the client to force finalise transcription at the end of speech for faster finals (200ms), ideal when using external VAD or turn detection models for voice agents. Refer to documentation here for details
Standard and Enhanced operating point
New bilingual transcription languages: Malay English (en_ms), Tamil English (en_ta), Mandarin English (cmn_en), Arabic English (ar_en). Refer to documentation here for details.
New Multilingual transcription language pack Mandarin Malay Tamil English (cmn_en_ms_ta) available now. Refer to the documentation for more details.
Enhanced operating point
New medical domain-specific models for Danish, Dutch, English, Finnish, French, German, Norwegian, Spanish and Swedish giving the highest accuracy for healthcare use cases. Refer to the documentation for more details.
Speaker Diarization – Improved speaker change detection accuracy for long audio streams (1+ hours)
Significant increase in session density for GPU inference.
Standard Operating Point
Enables faster transcription and higher throughput, refer to the documentation for more details
New models released with notable accuracy uplifts for the below languages:
Enhanced Operating Point
New models for English with improved accuracy in transcribing initialisms
New models released with notable accuracy uplifts for the below languages:
Standard Operating Point
New models released with notable accuracy uplifts for the below languages:
Enhanced Operating Point
Improved Speaker Diarization accuracy for Enhanced Operating Point
New models released with notable accuracy uplifts for the below languages:
Fix failure to process some audio files starting with non-speech audio when speaker diarization is enabled.
Fix for failure to generate Final transcripts for numbers when punctuation is disabled.
Fix for Japanese to address decimals being occasionally transcribed as full stops.
Fix for number formatting in Malay & English bilingual (en_ms) and Tamil & English bilingual (en_ta) language packs
Resolves the following tickets: 30055
Fix for incorrect casing in Japanese (ja) transcription output
Resolves the following ticket: 28276
Fixed a session failure when a custom dictionary’s first item is only a hyphen.
A Software Bill of Materials (SBOM) is available for download from the corresponding release page in our Support Portal.
libgstreamer (CVE-2025-3887): This component has been manually patched to address CVE-2025-3887. Note: Security scanners may still flag this component as vulnerable because the base version string remains unchanged
The following vulnerabilities were reviewed and determined to have no security impact on this release due to specific configurations or the use of closed, trusted environments:
March 4th, 2026
Batch Appliance
Word Replacement - Enables words in the transcript to be modified using a search and replace pattern. Refer to documentation here for details.
Supports the prefer_current_speaker configurable parameter to reduce the likelihood of incorrectly switching between similar sounding speakers. Refer to documentation for more details.
New Tagalog language is now available. Supports code-switching between Filipino and English for bilingual speech
Standard and Enhanced operating point
New bilingual transcription languages: Malay English (en_ms), Tamil English (en_ta), Mandarin English (cmn_en), Arabic English (ar_en). Refer to documentation here for details.
New Multilingual transcription language pack Mandarin Malay Tamil English (cmn_en_ms_ta) available now. Refer to the documentation for more details.
Enhanced operating point
New medical domain-specific models for Danish, Dutch, English, Finnish, French, German, Norwegian, Spanish and Swedish giving the highest accuracy for healthcare use cases. Refer to the documentation for more details.
Significant increase in session density for GPU inference.
Standard Operating Point
Enables faster transcription and higher throughput, refer to the documentation for more details
New models released with notable accuracy uplifts for the below languages:
Enhanced Operating Point
New models for English with improved accuracy in transcribing initialisms
New models released with notable accuracy uplifts for the below languages:
Standard Operating Point
New models released with notable accuracy uplifts for the below languages:
Enhanced Operating Point
New models released with notable accuracy uplifts for the below languages:
Fix failure to process some audio files starting with non-speech audio when speaker diarization is enabled.
Fix for Japanese to address decimals being occasionally transcribed as full stops.
Fix for number formatting in Malay & English bilingual (en_ms) and Tamil & English bilingual (en_ta) language packs
Resolves the following tickets: 30055
Fix for incorrect casing in Japanese (ja) transcription output
Resolves the following ticket: 28276
Fixed a session failure when a custom dictionary’s first item is only a hyphen.
Software Bill of Materials (SBOM) is available for download from the corresponding release page in our Support Portal.
libgstreamer (CVE-2025-3887): This component has been manually patched to address CVE-2025-3887. Note: Security scanners may still flag this component as vulnerable because the base version string remains unchanged
The following vulnerabilities were reviewed and determined to have no security impact on this release due to specific configurations or the use of closed, trusted environments:
February 26th, 2026
Batch Container
Real-Time Container
GPU Transcription Inference Container
GPU Translation Inference Container
Version 15.0.0 is now available for Batch Container, Real-Time Container, GPU Transcription Inference Container and GPU Translation Inference Container.
New Bilingual transcription language pack Arabic and English (ar_en) available now.
Supports Medical domain specific models (Enhanced Operating Point) delivering highest accuracy for healthcare use cases.
Supports General domain models (Standard and Enhanced Operating Point) delivering highest accuracy for general use cases.
Updated models for Swedish (sv) with improved transcription accuracy (reduction in incorrect insertions)
“x-ray” and “x-rays” are now always hyphenated in English (en) transcription output
Software Bill of Materials (SBOM) is available for download from the corresponding release page in our Support Portal.
libgstreamer (CVE-2025-3887): This component has been manually patched to address CVE-2025-3887. Note: Security scanners may still flag this component as vulnerable because the base version string remains unchanged
The following vulnerabilities were reviewed and determined to have no security impact for this release.
February 25th, 2026
Real-Time SaaS
New Bilingual transcription language pack Arabic and English (ar_en) available now.
Supports Medical domain specific models (Enhanced Operating Point) delivering highest accuracy for healthcare use cases.
Supports General domain models (Standard and Enhanced Operating Point) delivering highest accuracy for general use cases.
Updated models for Swedish (sv) with improved transcription accuracy (reduction in incorrect insertions)
“x-ray” and “x-rays” are now always hyphenated in English (en) transcription output
Updated Orchestrator Version: 2026.02.06+686b83b434+15.0.0
February 19th, 2026
Batch SaaS
New Bilingual transcription language pack Arabic English (ar_en) available now.
Supports Medical domain specific models (Enhanced Operating Point) delivering highest accuracy for healthcare use cases.
Supports General domain models (Standard and Enhanced Operating Point) delivering highest accuracy for general use cases.
February 12th, 2026
Batch SaaS
Updated models for Swedish (sv) with improved transcription accuracy (reduction in incorrect insertions)
“x-ray” and “x-rays” are now always hyphenated in English (en) transcription output
Updated Orchestrator Version: 2026.02.06+686b83b434+15.0.0
February 5th, 2026
Real-Time Kubernetes
Adds support for using Realtime components without a ClusterRole, disabling cluster-wide access for the SessionGroups controller. Refer to the sm-realtime Helm chart README.md for configuration details.
This Helm chart version uses Realtime container version 14.13.0. For more information, see the 14.13.0 container release notes.
Software Bill of Materials (SBOM) is available for download from the corresponding release page in our Support Portal.
libgstreamer (CVE-2025-3887): This component has been manually patched to address CVE-2025-3887. Note: Security scanners may still flag this component as vulnerable because the base version string remains unchanged
Redis versioning: This release by default uses Redis version 8.2.1 with publicly disclosed security vulnerabilities (CVEs) that have been assessed to not impact this product. Users who require additional security controls or remediation for these CVEs can choose to deploy a different Redis version which remediates issues of concern.
The following vulnerabilities (including those reported for the third-party package Redis version) were reviewed and determined to have no security impact on this release due to specific configurations or the use of closed, trusted environments:
February 3rd, 2026
Batch SaaS
Significantly faster processing for Arabic, Catalan, Dutch, French, German, Greek, Hebrew, Hindi, Japanese, Norwegian, Persian, Portuguese, Russian and Swedish audio files up to 12 minutes in duration - released to all endpoints
January 26th, 2026
Batch Container
Real-Time Container
GPU Transcription Inference Container
GPU Translation Inference Container
Version 14.13.0 is now available for Batch Container, Real-Time Container, GPU Transcription Inference Container and GPU Translation Inference Container.
New Multilingual transcription language pack Mandarin Malay Tamil English (cmn_en_ms_ta) available now. Refer to the documentation for more details.
New medical domain-specific models for Danish, Norwegian, Swedish giving the highest accuracy for healthcare use cases. Refer to the documentation for more details.
New models for Bilingual Malay English, Bilingual Tamil English, Danish, Finnish, Maltese, Norwegian, Swedish with improved transcription accuracy
New models for Bilingual Malay English, Bilingual Tamil English, Danish, Finnish, Norwegian, Swedish with improved transcription accuracy
New models for Danish, Finnish, Galician, Norwegian, Swedish with improved transcription accuracy
New models for Danish, Finnish, Galician, Maltese, Norwegian, Swedish with improved transcription accuracy
Fix for number formatting in Malay & English bilingual (en_ms) and Tamil & English bilingual (en_ta) language packs
Resolves the following tickets: 30055
Fix for incorrect casing in Japanese (ja) transcription output
Resolves the following ticket: 28276
Software Bill of Materials (SBOM) is available for download from the corresponding release page in our Support Portal.
libgstreamer (CVE-2025-3887): This component has been manually patched to address CVE-2025-3887. Note: Security scanners may still flag this component as vulnerable because the base version string remains unchanged
The following vulnerabilities were reviewed and determined to have no security impact for this release.