KEMBAR78
support fast small T for broadcasting by brad-mengchi · Pull Request #1902 · pytorch/FBGEMM · GitHub
Skip to content

Conversation

@brad-mengchi
Copy link
Contributor

Summary: As titled, speedup B=1 and small T (T<=320) case, since current implementation only benefits large B * T.

Differential Revision: D47899778

Summary: As titled, speedup B=1 and small T (T<=320) case, since current implementation only benefits large B * T.

Differential Revision: D47899778

fbshipit-source-id: 79eacc67fa1e7e5aaf4128e3a70677e99e388f03
@netlify
Copy link

netlify bot commented Jul 30, 2023

Deploy Preview for pytorch-fbgemm-docs canceled.

Name Link
🔨 Latest commit 4e3debc
🔍 Latest deploy log https://app.netlify.com/sites/pytorch-fbgemm-docs/deploys/64c5bfe3f3b5450008973f87

@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D47899778

@facebook-github-bot
Copy link
Contributor

This pull request has been merged in 26cc4df.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants