Initial improvements from our Ursa2 accuracy uplift; note that further improvements are on the way in the next few weeks
Improved transcription accuracy and updated vocabulary for 31 languages (Enhanced Operating Point only): Bashkir (ba), Basque (eu), Belarusian (be), Bulgarian (bg), Cantonese (yue), Catalan (ca), Danish (da), Esperanto (eo), Estonian (et), Finnish (fi), French (fr), Galician (gl), Greek (el), Hindi (hi), Indonesian (id), Interlingua (ia), Japanese (ja), Korean (ko), Latvian (lv), Malay (ms), Marathi (mr), Mongolian (mn), Norwegian (no), Romanian (ro), Slovenian (sl), Spanish (es), Swedish (sv), Turkish (tr), Ukrainian (uk), Uyghur (ug), Vietnamese (vi)
Updated vocabulary for English (Enhanced Operating Point only)
Improved music detection accuracy in Audio Events
Fixes
Fix for occasional incorrect repetition of words in transcription output when Audio Events is enabled
Fix for missing volume tags in output when Audio Filtering is enabled along with Speaker Diarization