Miguel Jetté’s Post

View profile for Miguel Jetté, graphic

Head of AI @ Circle Medical | Healthcare, ASR, NLU, & genAI

The Rev team has recently released another great open source paper and dataset. Proud of the work we did all of those years and the work they continue to do! Congrats to you all! Very proud and love watching these releases come to life! Corey Miller, Miguel del Rio Fernandez, Nishchal Bhandari, Martin Ratajczak, Danny Chen, and Quinn McNamara! Paper: https://lnkd.in/gTqrnikD Github: https://lnkd.in/gdFuegYt "Word error rate (WER) as a metric has a variety of limitations that have plagued the field of speech recognition. Evaluation datasets suffer from varying style, formality, and inherent ambiguity of the transcription task. In this work, we attempt to mitigate some of these differences by performing style-agnostic evaluation of ASR systems using multiple references transcribed under opposing style parameters. As a result, we find that existing WER reports are likely significantly over-estimating the number of contentful errors made by state-of-the-art ASR systems. In addition, we have found our multireference method to be a useful mechanism for comparing the quality of ASR models that differ in the stylistic makeup of their training data and target task." #asr #speechrecognition #speech #wer #speechrec #rev

To view or add a comment, sign in

Explore topics