June 4th, 2026
Batch Container
Realtime Container
Version 15.9.0 is now available for Batch Container, Real-Time Container, GPU Transcription Inference Container and GPU Translation Inference Container.
HTTP Batch Transcription
Supports URL-based audio input. Users can provide an audio file URL, enabling the service to fetch the file for transcription. See the documentation for more details.
The /ready endpoint now reports engines_used, improving service visibility. See the documentation for more details.
New Arabic models (Enhanced Operating Point) give up to 2.7% relative accuracy improvement.
Fixed transcription job failures when speaker diarization was enabled for WAV files with missing or incorrect duration metadata.
Fixed failure to generate transcripts for a small number of stereo files when one channel contains leading silence.
Software Bill of Materials (SBOM) is available for download from the corresponding release page in our Support Portal.
libgstreamer (CVE-2025-3887): This component has been manually patched to address CVE-2025-3887. Note: Security scanners may still flag this component as vulnerable because the base version string remains unchanged
tritonserver 2.61.0 (NGC 25.09): This is a component in sm-gpu-inference-server and sm-translation-server images that is not listed in the corresponding SBOMs due to lack of installation metadata in the base images published by Nvidia.
The identified CVEs have been reviewed and are not considered exploitable, provided the Triton API is only accessible to transcriber containers within the same trust domain. This can be achieved by deploying STT in a private Kubernetes cluster or by using a service mesh to restrict access to the appropriate pods.
A future release will rebuild against Nvidia NGC 26.03 or later with the relevant fixes.