[c10d] Start deprecating *_multigpu APIs #85961

kwen2501 · 2022-09-30T08:44:18Z

Deprecation reasons:

For most users training is on one GPU per process so these APIs are rarely used
They added one more API dimension
They can be expressed in a composed manner
They are not abstracted – specific to GPU
They caused backend APIs and implementations to have nested std::vector<std::vector<Tensor>>, which is hard to read or maintain

pytorch-bot · 2022-09-30T08:44:20Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/85961

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit our office hours

Note: Links to docs will display an error until the docs builds have been completed.

❌ 1 Failures, 1 Pending

As of commit 217d2c5:

The following jobs have failed:

linux-bionic-cuda11.6-py3.10-gcc7 / test (default, 1, 4, linux.4xlarge.nvidia.gpu)

This comment was automatically generated by Dr. CI and updates every 15 minutes.

kwen2501 · 2022-09-30T15:29:06Z

The "CUDA out of memory" CI error is unrelated

XilunWu

lgtm. thx for working on it!

H-Huang · 2022-09-30T20:21:06Z

torch/distributed/distributed_c10d.py


    """
+    warnings.warn(
+        "torch.distributed.broadcast_multigpu will be deprecated. If you must "


link to docs? (https://pytorch.org/docs/master/distributed.html#collective-functions)

Thanks. Added

kwen2501 · 2022-10-01T00:55:47Z

@pytorchbot merge -f 'The CUDA out of memory error is unrelated to this change'

pytorchmergebot · 2022-10-01T00:59:37Z

@pytorchbot successfully started a merge job. Check the current status here.
The merge job was triggered with the force (-f) flag. This means your change will be merged immediately, bypassing any CI checks (ETA: 1-5 minutes). If this is not the intended behavior, feel free to use some of the other merge options in the wiki.
Please reach out to the PyTorch DevX Team with feedback or questions!

### Deprecation reasons: - For most users training is on one GPU per process so these APIs are rarely used - They added one more API dimension - They can be expressed in a composed manner - They are not abstracted – specific to GPU - They caused backend APIs and implementations to have nested `std::vector<std::vector<Tensor>>`, which is hard to read or maintain Pull Request resolved: #85961 Approved by: https://github.com/XilunWu, https://github.com/H-Huang

Inject deprecation messages for *_multigpu APIs

0e2c4b5

kwen2501 requested review from H-Huang, awgu, mingzhe09088, mrshenli, pritamdamania87, rohan-varma and zhaojuanmao as code owners September 30, 2022 08:44

pytorch-bot bot added the release notes: distributed (c10d) release notes category label Sep 30, 2022

facebook-github-bot added cla signed oncall: distributed Add this issue/PR to distributed oncall triage queue labels Sep 30, 2022

XilunWu approved these changes Sep 30, 2022

View reviewed changes

H-Huang reviewed Sep 30, 2022

View reviewed changes

H-Huang approved these changes Sep 30, 2022

View reviewed changes

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Sep 30, 2022

Add doc url

217d2c5

kwen2501 added the topic: deprecation topic category label Oct 1, 2022

pytorchmergebot added the Merged label Oct 1, 2022

pytorchmergebot closed this in 05d1128 Oct 1, 2022

github-actions bot deleted the c10d_deprecate_multigpu branch April 1, 2024 01:53

H-Huang mentioned this pull request Jul 31, 2025

[C10D] Document barrier interaction with device_id #159389

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[c10d] Start deprecating *_multigpu APIs #85961

[c10d] Start deprecating *_multigpu APIs #85961

Uh oh!

kwen2501 commented Sep 30, 2022

Uh oh!

pytorch-bot bot commented Sep 30, 2022 •

edited

Loading

Uh oh!

kwen2501 commented Sep 30, 2022

Uh oh!

XilunWu left a comment

Uh oh!

H-Huang Sep 30, 2022

Uh oh!

kwen2501 Oct 1, 2022

Uh oh!

kwen2501 commented Oct 1, 2022

Uh oh!

pytorchmergebot commented Oct 1, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

[c10d] Start deprecating *_multigpu APIs #85961

[c10d] Start deprecating *_multigpu APIs #85961

Uh oh!

Conversation

kwen2501 commented Sep 30, 2022

Deprecation reasons:

Uh oh!

pytorch-bot bot commented Sep 30, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/85961

❌ 1 Failures, 1 Pending

Uh oh!

kwen2501 commented Sep 30, 2022

Uh oh!

XilunWu left a comment

Choose a reason for hiding this comment

Uh oh!

H-Huang Sep 30, 2022

Choose a reason for hiding this comment

Uh oh!

kwen2501 Oct 1, 2022

Choose a reason for hiding this comment

Uh oh!

kwen2501 commented Oct 1, 2022

Uh oh!

pytorchmergebot commented Oct 1, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

pytorch-bot bot commented Sep 30, 2022 •

edited

Loading