[ROCM] fix bug in #60313 #61073

micmelesse · 2021-06-30T20:54:15Z

This PR fixes a bug in #60313. Where the tensors generated by _generate_valid_rocfft_input are on the cpu instead of the gpu. This was due to using numpy to generate tensors and converting it to pytorch using torch.from_numpy. This leads to the generated tensors staying on the cpu. We now generate the tensors using pytorch itself which carries over the device type of the input tensors to the generated tensor.

facebook-github-bot · 2021-06-30T20:54:20Z

💊 CI failures summary and remediations

As of commit 3b2a6a9 (more details on the Dr. CI page and at hud.pytorch.org/pr/61073):

1/1 failures introduced in this PR

🕵️ 1 new failure recognized by patterns

The following CI failures do not appear to be due to upstream breakages:

Windows CI (pytorch-win-vs2019-cuda10-cudnn7-py3) / test (default, 2, 2, windows.8xlarge.nvidia.gpu) (1/1)

Step: "Install Cuda" (full log | diagnosis details | 🔁 rerun)

2021-07-09T01:46:52.7093405Z ls: cannot access ...UDA/v10.1/bin/nvcc.exe': No such file or directory

2021-07-09T01:46:51.3709010Z 
2021-07-09T01:46:51.3709476Z Folders: 11
2021-07-09T01:46:51.3709817Z Files: 130
2021-07-09T01:46:51.3710257Z Size:       907512
2021-07-09T01:46:51.3711016Z Compressed: 111420
2021-07-09T01:46:51.3770010Z + mkdir -p 'C:/Program Files/NVIDIA Corporation/NvToolsExt'
2021-07-09T01:46:51.4185990Z + cp -r NvToolsExt/bin NvToolsExt/docs NvToolsExt/include NvToolsExt/lib NvToolsExt/samples 'C:/Program Files/NVIDIA Corporation/NvToolsExt/'
2021-07-09T01:46:52.6904200Z + export 'NVTOOLSEXT_PATH=C:\Program Files\NVIDIA Corporation\NvToolsExt\'
2021-07-09T01:46:52.6905881Z + NVTOOLSEXT_PATH='C:\Program Files\NVIDIA Corporation\NvToolsExt\'
2021-07-09T01:46:52.6907325Z + ls '/c/Program Files/NVIDIA GPU Computing Toolkit/CUDA/v10.1/bin/nvcc.exe'
2021-07-09T01:46:52.7093405Z ls: cannot access '/c/Program Files/NVIDIA GPU Computing Toolkit/CUDA/v10.1/bin/nvcc.exe': No such file or directory
2021-07-09T01:46:52.7099154Z + echo 'CUDA installation failed'
2021-07-09T01:46:52.7100189Z + mkdir -p /c/w/build-results
2021-07-09T01:46:52.7101063Z CUDA installation failed
2021-07-09T01:46:52.7292416Z + 7z a 'c:\w\build-results\cuda_install_logs.7z' cuda_install_logs
2021-07-09T01:46:53.1640507Z 
2021-07-09T01:46:53.1641724Z 7-Zip 19.00 (x64) : Copyright (c) 1999-2018 Igor Pavlov : 2019-02-21
2021-07-09T01:46:53.1642317Z 
2021-07-09T01:46:53.1643051Z Scanning the drive:
2021-07-09T01:46:53.1643845Z 1 folder, 2 files, 4189346 bytes (4092 KiB)
2021-07-09T01:46:53.1644159Z

Preview docs built from this PR

This comment was automatically generated by Dr. CI (expand for details).

Follow this link to opt-out of these comments for your Pull Requests.

Please report bugs/suggestions to the (internal) Dr. CI Users group.

Click here to manually regenerate this comment.

malfet

This affects ROCm only, so I trust you know what you are doing.
But frankly, it feels like rather than documenting limitations of ROCfft support, PR is trying to avoid testing those corner cases

malfet · 2021-07-12T21:41:09Z

test/test_spectral_ops.py

    # rocFFT requires/assumes that the input to hipfftExecC2R or hipfftExecZ2D
    # is of the form that is a valid output from a real to complex transform
    # (i.e. it cannot be a set of random numbers)
    # So for ROCm, call np.fft.rfftn and use its output as the input


This comment seems out of date, isn't it?

facebook-github-bot · 2021-07-12T21:42:02Z

@malfet has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

facebook-github-bot · 2021-07-13T14:09:48Z

@malfet merged this pull request in ac6ec0e.

force tensor to cuda

54e51a3

facebook-github-bot added the cla signed label Jun 30, 2021

github-actions bot added the module: rocm AMD GPU support for Pytorch label Jun 30, 2021

pytorchbot added the open source label Jun 30, 2021

micmelesse added 5 commits June 30, 2021 18:44

match input tensor device

d81f039

fix typo

74861e6

Merge branch 'master' into force_to_gpu

c28213f

Merge branch 'master' into force_to_gpu

4a9bfee

gen input if fft_size is odd

c4e6086

micmelesse changed the title ~~[ROCM] force tensor to cuda~~ [ROCM] fix bug in #60313 Jul 9, 2021

fix white space

3b2a6a9

micmelesse marked this pull request as ready for review July 9, 2021 17:38

ezyang requested a review from malfet July 12, 2021 13:05

ezyang added the triaged This issue has been looked at a team member, and triaged and prioritized into an appropriate module label Jul 12, 2021

malfet approved these changes Jul 12, 2021

View reviewed changes

malfet reviewed Jul 12, 2021

View reviewed changes

facebook-github-bot closed this in ac6ec0e Jul 13, 2021

facebook-github-bot added the Merged label Jul 13, 2021

micmelesse mentioned this pull request Nov 19, 2021

[ROCM] enable fft tests #60313

Closed

micmelesse mentioned this pull request Dec 8, 2021

c2r fft input generation #69627

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[ROCM] fix bug in #60313 #61073

[ROCM] fix bug in #60313 #61073

micmelesse commented Jun 30, 2021 •

edited

Loading

Uh oh!

facebook-github-bot commented Jun 30, 2021 •

edited

Loading

Uh oh!

malfet left a comment

Uh oh!

malfet Jul 12, 2021

Uh oh!

facebook-github-bot commented Jul 12, 2021

Uh oh!

facebook-github-bot commented Jul 13, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

[ROCM] fix bug in #60313 #61073

[ROCM] fix bug in #60313 #61073

Conversation

micmelesse commented Jun 30, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

facebook-github-bot commented Jun 30, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

💊 CI failures summary and remediations

🕵️ 1 new failure recognized by patterns

Windows CI (pytorch-win-vs2019-cuda10-cudnn7-py3) / test (default, 2, 2, windows.8xlarge.nvidia.gpu) (1/1)

Uh oh!

malfet left a comment

Choose a reason for hiding this comment

Uh oh!

malfet Jul 12, 2021

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot commented Jul 12, 2021

Uh oh!

facebook-github-bot commented Jul 13, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

micmelesse commented Jun 30, 2021 •

edited

Loading

facebook-github-bot commented Jun 30, 2021 •

edited

Loading