Word error rate as a function of ρ (Speed Perturbation)

Source publication

Figure 1: Word error rate as a function of ρ (White noise)

Figure 2: Word error rate as a function of ρ (Speed Perturbation)

Figure 3: Word error rate as a function of l (Dropping chunks)

Figure 4: Word error rate as a function of k (Dropping chunks)

Figure 5: Word error rate as a function of ρ (White noise)

Analyzing Robustness of End-to-End Neural Models for Automatic Speech Recognition

Preprint

Full-text available

Aug 2022

Goutham Rajendran
Wei Zou

We investigate robustness properties of pre-trained neural models for automatic speech recognition. Real life data in machine learning is usually very noisy and almost never clean, which can be attributed to various factors depending on the domain, e.g. outliers, random noise and adversarial noise. Therefore, the models we develop for various tasks...

Context 1

... perturbation speeds up or slows down the speech. For a given speech signal x and speed 100/ρ, f (x) is computed by resampling the audio signal without changing the sampling rate, using the technique in [21]. See Fig. 2 for the results. The plot conforms with our intuition that speech that is sped up or slowed ...

View in full-text

Word error rate as a function of ρ (Speed Perturbation)

Context in source publication