June 25th, 2024

Real-Time SaaS

2024.06.25 - Real-Time SaaS

New

Disfluency removal: automatically remove disfluencies from your transcript. Refer to documentation here to get started

Improvements

  • Initial improvements from our Ursa2 accuracy uplift; note that further improvements are on the way in the next few weeks
  • Improved transcription accuracy and updated vocabulary for 31 languages (Enhanced Operating Point only): Bashkir (ba), Basque (eu), Belarusian (be), Bulgarian (bg), Cantonese (yue), Catalan (ca), Danish (da), Esperanto (eo), Estonian (et), Finnish (fi), French (fr), Galician (gl), Greek (el), Hindi (hi), Indonesian (id), Interlingua (ia), Japanese (ja), Korean (ko), Latvian (lv), Malay (ms), Marathi (mr), Mongolian (mn), Norwegian (no), Romanian (ro), Slovenian (sl), Spanish (es), Swedish (sv), Turkish (tr), Ukrainian (uk), Uyghur (ug), Vietnamese (vi)
  • Updated vocabulary for English (Enhanced Operating Point only)
  • Improved transcription accuracy around endpoints, especially for lower values of max_delay
  • When a transcription Final does not contain words which appeared in previous Partials, an AddPartialTranscript message containing the missing words is now sent immediately after the Final
  • Start and end times in AddTranscript and AddPartialTranscript messages are now always rounded to 2 decimal places
  • Improved music detection accuracy in Audio Events