Speaker Diarization has been completely re-designed internally and should now be significantly more accurate
Instead of gendered speaker labels (M1, F2) speaker labels will be now (S1, S2 etc.) in the json-v2 and txt output. Speaker gender identification is no longer a supported feature
If requesting an output in txt format, and requesting no diarization, there will be no Speaker:UU at the start of a transcript
Users may still request Speaker Diarization as before via the configuration object
Beta sensitivity parameters will be removed. The parameters will remain within the API but will not have any effect
This update to Speaker Diarization feature can mean the turnaround time for your transcript will in some cases take longer
Improved Swedish and Arabic language packs, both now have advanced punctuation enabled (Swedish supports . ? , ! and Arabic supports . Ψ Ψ !)
For the English language pack only, a new tag, [disfluency] has been added to a pre-set list of words that imply hesitation or interjection in the JSON-v2 output only. Examples include 'hmm' and 'umm'. Customers may use this tag to carry out their own post-processing