KEMBAR78
Move compile into AutoParallel API by wconstab · Pull Request #77 · meta-pytorch/autoparallel · GitHub
Skip to content

Conversation

@wconstab
Copy link
Contributor

@wconstab wconstab commented Aug 4, 2025

Requires upstream pytorch/pytorch#159814 to land first. (done)

also updates joint_with_descriptors meta info to reflect that arg/placeholder shapes have changed (inputs and params are now sharded, not global, as they were when first traced).

kicked off a verification run from this PR @ acaaf9c: https://www.internalfb.com/mlhub/pipelines/runs/mast/torchtitan-64-whc-kf1llhnr

tbm FSDP_eager:torchtitan-64-whc-p3s1bn compile_pr:torchtitan-64-whc-kf1llhnr compile_noac_from_update_post:torchtitan-64-fmassa-r4rvfnf6

hmm. this compile PR has higher memory usage than the compile_noac job we had last week.. what changed..
rerun w/ memory profiling: https://www.internalfb.com/mlhub/pipelines/runs/mast/torchtitan-64-whc-k3l2mcd

@meta-cla meta-cla bot added the CLA Signed This label is managed by the Meta Open Source bot. label Aug 4, 2025
Copy link
Contributor

@fmassa fmassa left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks!

Copy link
Contributor

@fmassa fmassa left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks!

@wconstab wconstab force-pushed the whc/compile branch 2 times, most recently from b07c2a6 to 27583b4 Compare August 7, 2025 16:58
@wconstab
Copy link
Contributor Author

wconstab commented Aug 7, 2025

ah crap. this change actually isn't possible to land in isolation becuase the meta info needs to be updated before compile will even run. I'll merge my meta-fixup PR into this PR.

Update node metas after parallelization
@wconstab wconstab merged commit 75cef61 into main Aug 8, 2025
6 checks passed
@wconstab wconstab deleted the whc/compile branch August 8, 2025 03:06
@fmassa fmassa mentioned this pull request Aug 10, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

CLA Signed This label is managed by the Meta Open Source bot.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants