Move resample to functional and add librosa comparison #1402

carolineechen · 2021-03-19T21:30:24Z

add resample to torchaudio.functional, copied over from kaldi compliance resample_waveform
adds batching to kaldi compliance interface resample_waveform, which now wraps functional.resample
add test to compare torchaudio resample against librosa.resample

carolineechen · 2021-03-19T21:34:19Z

test/torchaudio_unittest/functional/librosa_compatibility_test.py

+        lr_upsampled = librosa.resample(waveform.squeeze(0).numpy(), sample_rate, upsample_rate)
+        lr_upsampled = torch.from_numpy(lr_upsampled).unsqueeze(0)
+
+        self.assertEqual(ta_upsampled, lr_upsampled, atol=1e-4, rtol=1e-5)


@vincentqb @cpuhrsch what's an acceptable margin of error for this comparison? using these values gives an assertion error Tensors failed to compare as equal!With rtol=1e-05 and atol=0.0001, found 28142 element(s) (out of 128000) whose difference(s) exceeded the margin of error (including 0 nan comparisons). The greatest difference was 0.006743520498275757 (0.23187145590782166 vs. 0.2386149764060974), which occurred at index (0, 127992).

You can look at the kaldi compliance as guidance. As you can see, the tolerances are fairly high there too.

Since we are not changing the algorithm as part of this PR, these new tests are informational, and we take the threshold that makes them pass (after we make sure that we match the correct parameters of course). The threshold therefore becomes new signal/information that we learned from this exercise.

Based on the numbers you gave, assuming we use the correct corresponding parameters for each resampling functions, I'd say atol=1e-2 which is also used here.

btw @carolineechen, as discussed offline with @cpuhrsch, what other parameters have you tried in librosa's resample? let's follow-up by opening a pull request that calls them explicitly :)

I had only tried the librosa default values for this test, but yup, I can look into the other parameters

vincentqb

Great, thanks! I'm glad to see this happening :)

torchaudio/compliance/kaldi.py

test/torchaudio_unittest/compliance_kaldi_test.py

docs/source/compliance.kaldi.rst

torchaudio/compliance/kaldi.py

vincentqb · 2021-03-22T17:23:15Z

test/torchaudio_unittest/functional/librosa_compatibility_test.py

+        self.assertEqual(ta_upsampled, lr_upsampled, atol=1e-2, rtol=1e-5)
+
+        ta_downsampled = F.resample(waveform, sample_rate, downsample_rate)
+        lr_downsampled = librosa.resample(waveform.squeeze(0).numpy(), sample_rate, downsample_rate)


question for future work: why is squeeze needed before passing to librosa?

have you looked into this @carolineechen ?

waveform has dimensions (1, n), but librosa.resample requires it to be of shape (n,) or (2, n)

vincentqb · 2021-03-22T17:25:08Z

torchaudio/transforms.py

        Returns:
            Tensor: Output signal of dimension (..., time).
        """
        if self.resampling_method == 'sinc_interpolation':


note: we'll leave the discussion around migrating the resampling_method parameter to a follow-up PR

torchaudio/compliance/kaldi.py

vincentqb

Alrighty, LGTM overall :)

torchaudio/compliance/kaldi.py

The Building a Convolution/Batch Norm fuser in FX tutorial (https://pytorch.org/tutorials/intermediate/fx_conv_bn_fuser.html) was added for 1.8. This is a minor update to index.rst to have the tutorial show up in the left hand nav under the "Code Transforms with FX" category.

move resample to functional, compare to librosa

d8b8530

facebook-github-bot added the CLA Signed label Mar 19, 2021

carolineechen commented Mar 19, 2021

View reviewed changes

Caroline Chen added 2 commits March 19, 2021 14:46

update docs

0f8f6a4

update librosa test

35bc582

carolineechen marked this pull request as ready for review March 22, 2021 14:09

carolineechen requested review from cpuhrsch and vincentqb March 22, 2021 14:10

vincentqb reviewed Mar 22, 2021

View reviewed changes

torchaudio/compliance/kaldi.py Show resolved Hide resolved

test/torchaudio_unittest/compliance_kaldi_test.py Show resolved Hide resolved

vincentqb reviewed Mar 22, 2021

View reviewed changes

docs/source/compliance.kaldi.rst Show resolved Hide resolved

torchaudio/compliance/kaldi.py Show resolved Hide resolved

carolineechen force-pushed the resample branch from d19cfc9 to 5cbc49d Compare March 22, 2021 16:51

vincentqb reviewed Mar 22, 2021

View reviewed changes

carolineechen force-pushed the resample branch from 5cbc49d to cbe6183 Compare March 22, 2021 17:28

vincentqb reviewed Mar 22, 2021

View reviewed changes

torchaudio/compliance/kaldi.py Show resolved Hide resolved

carolineechen force-pushed the resample branch from cbe6183 to 540c89c Compare March 22, 2021 17:36

vincentqb approved these changes Mar 22, 2021

View reviewed changes

torchaudio/compliance/kaldi.py Outdated Show resolved Hide resolved

address PR comments

ae85994

carolineechen force-pushed the resample branch from 540c89c to ae85994 Compare March 22, 2021 18:18

carolineechen merged commit 14dd917 into pytorch:master Mar 22, 2021

carolineechen deleted the resample branch March 22, 2021 20:08

vincentqb mentioned this pull request Mar 23, 2021

add open-unmix music separation model pytorch/hub#186

Merged

Move resample to functional and add librosa comparison #1402

Move resample to functional and add librosa comparison #1402

Uh oh!

Conversation

carolineechen commented Mar 19, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

carolineechen Mar 19, 2021

Choose a reason for hiding this comment

Uh oh!

vincentqb Mar 19, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

vincentqb Mar 23, 2021

Choose a reason for hiding this comment

Uh oh!

carolineechen Mar 23, 2021

Choose a reason for hiding this comment

Uh oh!

vincentqb left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

vincentqb Mar 22, 2021

Choose a reason for hiding this comment

Uh oh!

vincentqb Mar 22, 2021

Choose a reason for hiding this comment

Uh oh!

carolineechen Mar 22, 2021

Choose a reason for hiding this comment

Uh oh!

vincentqb Mar 22, 2021

Choose a reason for hiding this comment

Uh oh!

Uh oh!

vincentqb left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

carolineechen commented Mar 19, 2021 •

edited

Loading

vincentqb Mar 19, 2021 •

edited

Loading

vincentqb left a comment •

edited

Loading