July 9th, 2024
Batch Appliance
GPU & CPU
New Virtual Appliance architecture with support for our latest generation of Ursa GPU models
New transcription language - Hebrew (he)
New transcription language - Persian (fa)
Automatic Usage Reporting is enabled by default
GPU
GPU Ursa models - all 50 languages are now available on GPU
Major transcription accuracy gains
Major improvement in Speaker Diarization accuracy
Faster transcription
Bilingual Spanish and English language pack - this enables Spanish and English to be transcribed accurately within the same file
Audio Events: Detection of music, laughter and applause in media files now supported. Refer to documentation here to get started
Improved transcription accuracy for English, Norwegian, Romanian, Basque, Belarusian, Estonian, Mongolian, Thai, Vietnamese, and Welsh
Enhanced transcription of disfluencies in English. The model now more accurately captures common disfluencies like "um" and "uh". This change makes our ASR even more accurate for verbatim transcription, great for use cases such as audio editing, analytics on hesitations for call centers and legal transcription. For details on how to identify disfluencies in output, see the documentation here
More accurate transcription of short utterances of the word "I" in English
More accurate transcription of acronyms in English
Improvements to capitalization for English transcription
Improved accuracy when transcribing audio with periods of silence
Channel Diarization now supports up to 100 separate input channels
Fixes for specific transcription accuracy issues in English, German, Swedish and Norwegian
Fix for issue affecting recognition of English words ending in 'erm'
Fixed an error with custom dictionary when the content is only a "-"
Fix for transcribed words returned during non-speech audio when Custom Dictionary is used
Security fixes