6.0.0 - Batch Appliance

July 9th, 2024

Batch Appliance

GPU & CPU
- New Virtual Appliance architecture with support for our latest generation of Ursa GPU models
- New transcription language - Hebrew (he)
- New transcription language - Persian (fa)
- Automatic Usage Reporting is enabled by default
GPU
- GPU Ursa models - all 50 languages are now available on GPU
  - Major transcription accuracy gains
  - Major improvement in Speaker Diarization accuracy
  - Faster transcription
- Bilingual Spanish and English language pack - this enables Spanish and English to be transcribed accurately within the same file
- Audio Events: Detection of music, laughter and applause in media files now supported. Refer to documentation here to get started

Improved transcription accuracy for English, Norwegian, Romanian, Basque, Belarusian, Estonian, Mongolian, Thai, Vietnamese, and Welsh
Enhanced transcription of disfluencies in English. The model now more accurately captures common disfluencies like "um" and "uh". This change makes our ASR even more accurate for verbatim transcription, great for use cases such as audio editing, analytics on hesitations for call centers and legal transcription. For details on how to identify disfluencies in output, see the documentation here
More accurate transcription of short utterances of the word "I" in English
More accurate transcription of acronyms in English
Improvements to capitalization for English transcription
Improved accuracy when transcribing audio with periods of silence
Channel Diarization now supports up to 100 separate input channels

Fixes for specific transcription accuracy issues in English, German, Swedish and Norwegian
Fix for issue affecting recognition of English words ending in 'erm'
Fixed an error with custom dictionary when the content is only a "-"
Fix for transcribed words returned during non-speech audio when Custom Dictionary is used
Security fixes

Speechmatics