[TP][Inference] Enable DTensor TP inference #110751

fduwjj · 2023-10-06T19:32:07Z

Stack from ghstack (oldest at bottom):

-> [TP][Inference] Enable DTensor TP inference #110751

In #109977, we observed that during inference mode, aten.Linear does not get decomposed. So instead of enabling sharding propagation for linear op, we use func.decompose so that it gets decomposed to matmul and mm.

[ghstack-poisoned]

pytorch-bot · 2023-10-06T19:32:11Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/110751

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ You can merge normally! (4 Unrelated Failures)

As of commit 7e716a8 with merge base a3e5ec4 ():

FLAKY - The following jobs failed but were likely due to flakiness present on trunk:

UNSTABLE - The following jobs failed but were likely due to flakiness present on trunk and has been marked as unstable:

This comment was automatically generated by Dr. CI and updates every 15 minutes.

torch/distributed/_tensor/api.py

test/distributed/tensor/parallel/test_tp_examples.py

In #109977, we observed that during inference mode, aten.Linear does not get decomposed. So instead of enabling sharding propagation for linear op, we use func.decompose so that it gets decomposed to matmul and mm. [ghstack-poisoned]

ghstack-source-id: 502aaf6 Pull Request resolved: #110751

torch/distributed/_tensor/api.py

bdhirsh

left light comments but lgtm!

In #109977, we observed that during inference mode, aten.Linear does not get decomposed. So instead of enabling sharding propagation for linear op, we use func.decompose so that it gets decomposed to matmul and mm. [ghstack-poisoned]

fduwjj · 2023-10-07T00:06:00Z

@pytorchbot merge

pytorchmergebot · 2023-10-07T00:07:46Z

Merge failed

Reason: This PR needs a release notes: label
If your changes are user facing and intended to be a part of release notes, please use a label starting with release notes:.

If not, please add the topic: not user facing label.

To add a label, you can comment to pytorchbot, for example
@pytorchbot label "topic: not user facing"

For more information, see
https://github.com/pytorch/pytorch/wiki/PyTorch-AutoLabel-Bot#why-categorize-for-release-notes-and-how-does-it-work.

Details for Dev Infra team

Raised by workflow job

In #109977, we observed that during inference mode, aten.Linear does not get decomposed. So instead of enabling sharding propagation for linear op, we use func.decompose so that it gets decomposed to matmul and mm. [ghstack-poisoned]

ghstack-source-id: 462029d Pull Request resolved: #110751

fduwjj · 2023-10-07T18:55:31Z

@pytorchbot merge -f "The failing test are not related to this PR."

pytorchmergebot · 2023-10-07T18:57:18Z

Merge started

Your change will be merged immediately since you used the force (-f) flag, bypassing any CI checks (ETA: 1-5 minutes). Please use -f as last resort and instead consider -i/--ignore-current to continue the merge ignoring current failures. This will allow currently pending tests to finish and report signal before the merge.

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

Enable DTensor TP inference

8db1ec1

[ghstack-poisoned]

fduwjj requested review from H-Huang, awgu, d4l3k, fegin, kiukchung, kwen2501, mrshenli, rohan-varma, wanchaol, wz337 and zhaojuanmao as code owners October 6, 2023 19:32

wanchaol reviewed Oct 6, 2023

View reviewed changes

torch/distributed/_tensor/api.py Outdated Show resolved Hide resolved

wanchaol reviewed Oct 6, 2023

View reviewed changes

test/distributed/tensor/parallel/test_tp_examples.py Outdated Show resolved Hide resolved

fduwjj changed the title ~~Enable DTensor TP inference~~ [TP][Inference] Enable DTensor TP inference Oct 6, 2023

fduwjj added a commit that referenced this pull request Oct 6, 2023

Enable DTensor TP inference

7fdc5c5

ghstack-source-id: 502aaf6 Pull Request resolved: #110751

fduwjj requested a review from wanchaol October 6, 2023 21:25

bdhirsh reviewed Oct 6, 2023

View reviewed changes

torch/distributed/_tensor/api.py Show resolved Hide resolved

bdhirsh approved these changes Oct 6, 2023

View reviewed changes

wanchaol approved these changes Oct 6, 2023

View reviewed changes

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Oct 7, 2023

pytorchmergebot added the merging label Oct 7, 2023

pytorchmergebot removed the merging label Oct 7, 2023

fduwjj mentioned this pull request Oct 7, 2023

[TP][Inference] Add aten.linear.default op implementation #109977

Closed

fduwjj added module: dtensor distributed tensor tag release notes: distributed (dtensor) release notes category labels Oct 7, 2023

fduwjj added the ciflow/periodic Trigger jobs ran periodically on master (periodic.yml) on the PR label Oct 7, 2023

fduwjj added a commit that referenced this pull request Oct 7, 2023

Enable DTensor TP inference

e498c74

ghstack-source-id: 462029d Pull Request resolved: #110751

pytorchmergebot added the merging label Oct 7, 2023

pytorchmergebot added Merged and removed merging labels Oct 7, 2023

pytorchmergebot closed this in 2dc5e16 Oct 7, 2023

fduwjj mentioned this pull request Oct 8, 2023

[TP][Inference] Add decompose for matmul op #110833

Closed

facebook-github-bot deleted the gh/fduwjj/105/head branch October 11, 2023 14:23

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[TP][Inference] Enable DTensor TP inference #110751

[TP][Inference] Enable DTensor TP inference #110751

Uh oh!

fduwjj commented Oct 6, 2023 •

edited

Loading

Uh oh!

pytorch-bot bot commented Oct 6, 2023 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

bdhirsh left a comment

Uh oh!

fduwjj commented Oct 7, 2023

Uh oh!

pytorchmergebot commented Oct 7, 2023

Uh oh!

fduwjj commented Oct 7, 2023

Uh oh!

pytorchmergebot commented Oct 7, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[TP][Inference] Enable DTensor TP inference #110751

[TP][Inference] Enable DTensor TP inference #110751

Uh oh!

Conversation

fduwjj commented Oct 6, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Oct 6, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/110751

✅ You can merge normally! (4 Unrelated Failures)

Uh oh!

Uh oh!

Uh oh!

Uh oh!

bdhirsh left a comment

Choose a reason for hiding this comment

Uh oh!

fduwjj commented Oct 7, 2023

Uh oh!

pytorchmergebot commented Oct 7, 2023

Merge failed

Uh oh!

fduwjj commented Oct 7, 2023

Uh oh!

pytorchmergebot commented Oct 7, 2023

Merge started

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

fduwjj commented Oct 6, 2023 •

edited

Loading

pytorch-bot bot commented Oct 6, 2023 •

edited

Loading