Samples of our denoising system

Samples at avg. SNR

Dataset Noisy input SNR Noisy input human MOS Human MOS pMOS
WHAMVox_easy (0008) 4.703704 2.90 4.575000 4.371693
WHAMVox_hard (1885) 1.961483 3.175000 4.525000 4.506654
Valentini (p257_291) -1.307026 2.425 3.975 3.343298
  Original Unprocessed                                   Processed                                            Label

WHAMVox_easy (0008)

0008.wav

nTqGP_0008_processed.wav

0008_label.wav

WHAMVox_hard (1885)

1885.wav

nTqGP_1885_processed.wav

1885_label.wav

Valentini (p257_291)

p257_291.wav

p257_291_processed.wav

p257_291_label.wav

Samples at high SNR (~20dB)

Dataset Noisy input SNR Noisy input human MOS Human MOS pMOS
WHAMVox_hard (0755) 20.137263 3.4 4.725 4.656920
WHAMVox_easy (0996) 19.023238 4.425 4.850000 4.604721
WHAMVox_easy (1597) 18.667135 3.725 4.875000 4.496887
  Original Unprocessed                                   Processed                                            Label 

WHAMVox_hard (0755)

0755.wav

0755_processed.wav

0755_label.wav

WHAMVox_easy (0996)

0996.wav

0996_processed.wav

0996_label.wav