KEMBAR78

OpInfo: norm by kshitij12345 · Pull Request #59259 · pytorch/pytorch · GitHub

OpInfo: norm #59259

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Closed

kshitij12345 wants to merge 5 commits into pytorch:master from kshitij12345:develop/opinfo/norm

Collaborator

kshitij12345 commented Jun 1, 2021 •

edited

Loading

Reference: #54261

EDIT:
~~Test takes whooping 4 mins to run 😓~~ (Filtered tests also included linalg norm)

Newly added tests take around 2 mins.

==================================================== 193 passed, 224 skipped, 27224 deselected, 5 warnings in 138.87s (0:02:18) ====================================================


          OpInfo: norm

facebook-github-bot added the cla signed label

Contributor

facebook-github-bot commented Jun 1, 2021 •

edited

Loading

💊 CI failures summary and remediations

As of commit 80de388 (more details on the Dr. CI page):

2/2 failures possibly* introduced in this PR
- 2/2 non-scanned failure(s)

ci.pytorch.org: 1 failed

Failed: pr/pytorch-linux-bionic-rocm4.2-py3.6

This comment was automatically generated by Dr. CI (expand for details).

Follow this link to opt-out of these comments for your Pull Requests.

Please report bugs/suggestions to the (internal) Dr. CI Users group.

Click here to manually regenerate this comment.

kshitij12345 requested a review from mruberry

June 1, 2021 19:51

kshitij12345 marked this pull request as ready for review

June 1, 2021 19:51


          make mypy happy :)

ae15fc4

pytorchbot added the open source label

ngimel added the triaged label

kshitij12345 added 2 commits

June 1, 2021 18:11


          remove debug name

46c497b


          skip fx test

83ba134

mruberry reviewed

View reviewed changes

torch/testing/_internal/common_methods_invocations.py Outdated

    
                      cases_negdim.append((shape, tuple(new_args), name.replace("_dim", "_neg_dim")))

                  def generator():

                      if sample_types == 'default':

Collaborator

mruberry Jun 2, 2021

Why not separate these sample input sets into different sample input functions instead of using this if/elif statement to choose between them?

Collaborator Author

kshitij12345 Jun 2, 2021

Since they all belong to the same operator. I thought it is ok to have them in one function and also the cases tuple are fairly seperate. Let me know if we should split them in multiple functions.

Collaborator

mruberry Jun 2, 2021

I think it's more readable if they're separate. torch.norm is really a "clearing house" for multiple functions behind the scenes

mruberry reviewed

View reviewed changes

torch/testing/_internal/common_methods_invocations.py Outdated

    
                      elif sample_types == 'nuc':

                          for shape, args, name in cases_nuc:  # type: ignore[assignment]

                              yield SampleInput(make_arg(shape), args=args, name=name)

                      elif sample_types == 'jit':

Collaborator

mruberry Jun 2, 2021

a "jit" category is a little odd

Collaborator Author

kshitij12345 Jun 2, 2021 •

edited

Loading

This is mostly ad-hoc for the cases where JIT failed. 😛 (though JIT to fro does make sense).

mruberry reviewed

View reviewed changes

torch/testing/_internal/common_methods_invocations.py Show resolved Hide resolved

mruberry reviewed

View reviewed changes

torch/testing/_internal/common_methods_invocations.py Outdated

    
                         )

                         ),

                  OpInfo('norm',

                         variant_test_name='jit',

Collaborator

mruberry Jun 2, 2021

As mentioned above this variant seems a little weird.

Ideally variants would correspond to different code paths with different properties. So if nuclear norm and frobenius norm are actually different functions with different properties then they can have different OpInfos.

Maybe this can become the frobenius variant?

Collaborator

mruberry Jun 2, 2021

(all of its inputs use the frobenius norm, since the default for norm is frobenius)

Collaborator Author

kshitij12345 Jun 2, 2021

They have different code-paths.

pytorch/aten/src/ATen/native/cpu/ReduceOpsKernel.cpp

Lines 199 to 277 in 44c20ce

    
           static void norm_kernel_tensor_iterator_impl( 
        
               TensorIterator& iter, 
        
               const Scalar& p) { 
        
             // NOLINTNEXTLINE(cppcoreguidelines-init-variables) 
        
             float val; 
        
             if (p.isIntegral(false)) { 
        
               val = p.to<int64_t>(); 
        
             } else if (p.isFloatingPoint()) { 
        
               // NOLINTNEXTLINE(cppcoreguidelines-narrowing-conversions,bugprone-narrowing-conversions) 
        
               val = p.to<double>(); 
        
             } else { 
        
               AT_ERROR("norm_kernel_tensor_iterator_impl expects norm to be integer or float"); 
        
             } 
        
             // In the dispatch code blocks below, reduction kernels accumulate results as 
        
             // the type `acc_t`. When `scalar_t` is complex, `acc_t` is the downgraded 
        
             // real number type. Otherwise, `acc_t` and `scalar_t` are the same type. 
        
             if (val == 0) { 
        
               AT_DISPATCH_FLOATING_AND_COMPLEX_TYPES_AND2(kHalf, kBFloat16, iter.input_dtype(), "norm_cpu", [&] { 
        
                 using acc_t = typename scalar_value_type<scalar_t>::type; 
        
                 binary_kernel_reduce( 
        
                   iter, 
        
                   NormZeroOps<scalar_t, acc_t>(), 
        
                   acc_t(0) 
        
                 ); 
        
               }); 
        
             } else if (val == 1) { 
        
               AT_DISPATCH_FLOATING_AND_COMPLEX_TYPES_AND2(kHalf, kBFloat16, iter.input_dtype(), "norm_cpu", [&] { 
        
                 using acc_t = typename scalar_value_type<scalar_t>::type; 
        
                 binary_kernel_reduce( 
        
                   iter, 
        
                   NormOneOps<scalar_t, acc_t>(), 
        
                   acc_t(0) 
        
                 ); 
        
               }); 
        
             } else if (val == 2) { 
        
               AT_DISPATCH_FLOATING_AND_COMPLEX_TYPES_AND2(kHalf, kBFloat16, iter.input_dtype(), "norm_cpu", [&] { 
        
                 using acc_t = typename scalar_value_type<scalar_t>::type; 
        
                 binary_kernel_reduce( 
        
                   iter, 
        
                   NormTwoOps<scalar_t, acc_t>(), 
        
                   acc_t(0) 
        
                 ); 
        
               }); 
        
             } else if (val == INFINITY) { 
        
               AT_DISPATCH_FLOATING_AND_COMPLEX_TYPES_AND2(kHalf, kBFloat16, iter.input_dtype(), "norm_cpu", [&] { 
        
                 using acc_t = typename scalar_value_type<scalar_t>::type; 
        
                 binary_kernel_reduce( 
        
                   iter, 
        
                   AbsMaxOps<scalar_t, acc_t>(), 
        
                   acc_t(0) 
        
                 ); 
        
               }); 
        
             } else if (val == -INFINITY) { 
        
               AT_DISPATCH_FLOATING_AND_COMPLEX_TYPES_AND2(kHalf, kBFloat16, iter.input_dtype(), "norm_cpu", [&] { 
        
                 using acc_t = typename scalar_value_type<scalar_t>::type; 
        
                 binary_kernel_reduce( 
        
                   iter, 
        
                   AbsMinOps<scalar_t, acc_t>(), 
        
                   std::numeric_limits<acc_t>::max() 
        
                 ); 
        
               }); 
        
             } else { 
        
               AT_DISPATCH_FLOATING_AND_COMPLEX_TYPES_AND2(kHalf, kBFloat16, iter.input_dtype(), "norm_cpu", [&] { 
        
                 using acc_t = typename scalar_value_type<scalar_t>::type; 
        
                 binary_kernel_reduce( 
        
                   iter, 
        
                   NormOps<scalar_t, acc_t> { acc_t(val) }, 
        
                   acc_t(0) 
        
                 ); 
        
               }); 
        
             } 
        
             // For complex outputs, the above kernels do not touch the imaginary values, 
        
             // so we must zero them out 
        
             if (isComplexType(iter.output().scalar_type())) { 
        
               at::imag(iter.output()).zero_(); 
        
             } 
        
           }

mruberry reviewed

View reviewed changes

torch/testing/_internal/common_methods_invocations.py Outdated

    
              def sample_inputs_norm(op_info, device, dtype, requires_grad, sample_types='default', **kwargs):

                  make_arg = partial(make_tensor, device=device, dtype=dtype, requires_grad=requires_grad)

                  cases_nuc = (

Collaborator

mruberry Jun 2, 2021

This might be more readable if there was just one sample input func per norm variant.

Another option would be to try and create a more generic generation for norm inputs. That seems tricky, however, and ultimately we plan to be rid of torch.norm, so extending its test coverage isn't especially interesting.

Collaborator Author

kshitij12345 Jun 2, 2021

Sure! Will split the sample func.

mruberry reviewed

View reviewed changes

Collaborator

mruberry left a comment

This is a challenging and significant function to OpInfo and I think this PR is most of the way there, @kshitij12345. I made a few inline comments for readability, and I'm curious to hear your thoughts. Basically I suggest renaming "jit" to "fro" and cutting up the sample inputs for the different OpInfos to make the code a little more straightforward.


          address review: split sample input fn and add comment

80de388

Collaborator Author

kshitij12345 commented Jun 2, 2021

@mruberry have addressed the questions above.

Changes

Split the sample_input_fn into multiple.
Add comment regarding multiple OpInfo entries.

Thanks!

mruberry approved these changes

View reviewed changes

Collaborator

mruberry left a comment

Nice work, @kshitij12345!

Contributor

facebook-github-bot commented Jun 2, 2021

@mruberry has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

mruberry added the ci/all label

facebook-github-bot closed this in

6620d7d

Contributor

facebook-github-bot commented Jun 3, 2021

@mruberry merged this pull request in 6620d7d.

facebook-github-bot added the Merged label

deniskokarev pushed a commit to deniskokarev/pytorch that referenced this pull request


          OpInfo: norm (pytorch#59259)

165cd4f

Summary:
Reference: pytorch#54261

EDIT:
~~Test takes whooping 4 mins to run 😓~~ (Filtered tests also included linalg norm)

Newly added tests take around 2 mins.
```
==================================================== 193 passed, 224 skipped, 27224 deselected, 5 warnings in 138.87s (0:02:18) ====================================================
```

Pull Request resolved: pytorch#59259

Reviewed By: jbschlosser

Differential Revision: D28833962

Pulled By: mruberry

fbshipit-source-id: 40b24d6a8cb8b7d231b2f6b34b87cee4f136c5f9

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

cla signed Merged open source triaged