Improved models for English transcription (Standard and Enhanced operating points):
Enhanced transcription of disfluencies in English. The model now more accurately captures common disfluencies like "um" and "uh". This change makes our ASR even more accurate for verbatim transcription, great for use cases such as audio editing, analytics on hesitations for call centers and legal transcription. For details on how to identify disfluencies in output, see the documentation here
More accurate transcription of short utterances of the word "I" in English
More accurate transcription of acronyms in English