KEMBAR78
cp: `Fix the load checkpointing issue -- onelogger callback gets called multiple time in some case. (14945)` into `r2.5.0` by chtruong814 · Pull Request #14948 · NVIDIA-NeMo/NeMo · GitHub
Skip to content

Conversation

@chtruong814
Copy link
Collaborator

beep boop [🤖]: Hi @liquor233 👋,

we've cherry picked #14945 into  for you! 🚀

Please review and approve this cherry pick by your convenience!

…ltiple time in some case. (#14945)

* Update modelPT.py

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>

* Update one_logger_callback.py

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>

* fix the test for error handling

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>

* update the dependency version

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>

---------

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>
Signed-off-by: NeMo Bot <nemo-bot@nvidia.com>
@PeiyuanQi
Copy link
Collaborator

PeiyuanQi commented Oct 17, 2025

LGTM approved as MLWFO

@PeiyuanQi PeiyuanQi removed their request for review October 17, 2025 16:02
@github-actions github-actions bot removed the Run CICD label Oct 17, 2025
@chtruong814
Copy link
Collaborator Author

The import test is failing because of how the workflow is written using the main branch instead to verify dependency installs. Will go ahead and merge this.

@chtruong814 chtruong814 merged commit f53c2cd into r2.5.0 Oct 17, 2025
267 of 276 checks passed
@chtruong814 chtruong814 deleted the cherry-pick-14945-r2.5.0 branch October 17, 2025 19:16
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

cherry-pick core Changes to NeMo Core

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants