KEMBAR78
Split zeta_kernel out of BinaryMiscOpsKernel.cu by peterbell10 · Pull Request #62261 · pytorch/pytorch · GitHub
Skip to content

Conversation

@peterbell10
Copy link
Collaborator

BinaryMiscOpsKernel.cu takes 4 m 30 s to compile on my machine, which is the second slowest after PowKernel.cu. Moving the zeta kernel into it's own file takes 3 m 30 s, and reduces BinaryMiscOpsKernel.cu compile time to 1 m.

BinaryMiscOpsKernel.cu takes 4 m 30 s to compile on my machine.
Moving the zeta kernel into it's own file takes 3 m 30 s,
and reduces BinaryMiscOpsKernel.cu compile time to 1 m.
@peterbell10 peterbell10 added module: cuda Related to torch.cuda, and CUDA support in general open source labels Jul 27, 2021
@peterbell10 peterbell10 requested a review from ngimel July 27, 2021 13:45
@facebook-github-bot
Copy link
Contributor

facebook-github-bot commented Jul 27, 2021

🔗 Helpful links

💊 CI failures summary and remediations

As of commit ef25eb3 (more details on the Dr. CI page):


  • 1/1 failures introduced in this PR

1 failure not recognized by patterns:

Job Step Action
GitHub Actions Lint / mypy Run mypy 🔁 rerun

This comment was automatically generated by Dr. CI (expand for details).Follow this link to opt-out of these comments for your Pull Requests.

Please report bugs/suggestions to the (internal) Dr. CI Users group.

Click here to manually regenerate this comment.

@ejguan ejguan added the triaged This issue has been looked at a team member, and triaged and prioritized into an appropriate module label Jul 27, 2021
@facebook-github-bot
Copy link
Contributor

@ngimel has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

@facebook-github-bot
Copy link
Contributor

@ngimel merged this pull request in fcc7fbe.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

cla signed Merged module: cuda Related to torch.cuda, and CUDA support in general open source triaged This issue has been looked at a team member, and triaged and prioritized into an appropriate module

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants