KEMBAR78
feat: cache refit_param_info_mcore by yuki-97 · Pull Request #698 · NVIDIA-NeMo/RL · GitHub
Skip to content

Conversation

yuki-97
Copy link
Contributor

@yuki-97 yuki-97 commented Jul 21, 2025

After dtype issue fixed:

  1. refit_param_info_mcore can be cached and no need to call get_param_info each time in prepare_weights_for_ipc. This won't offer any speed optimization (or maybe too small to see).
  2. refit_param_info_hf is no need to cache anymore, we used it to compare dtype before and now it's fixed.

Signed-off-by: Yuki Huang <yukih@nvidia.com>
@yuki-97 yuki-97 requested a review from ZhiyuLi-Nvidia July 21, 2025 02:41
@ZhiyuLi-Nvidia ZhiyuLi-Nvidia merged commit 9e57877 into zhiyul/yukih/refit-optimization-minimal-serialization Jul 21, 2025
5 checks passed
@ZhiyuLi-Nvidia ZhiyuLi-Nvidia deleted the yukih/cache-param-info branch July 21, 2025 04:43
ZhiyuLi-Nvidia pushed a commit that referenced this pull request Jul 21, 2025
Signed-off-by: Yuki Huang <yukih@nvidia.com>
Signed-off-by: Zhiyu Li <zhiyul@nvidia.com>
ZhiyuLi-Nvidia pushed a commit that referenced this pull request Jul 21, 2025
Signed-off-by: Yuki Huang <yukih@nvidia.com>
Signed-off-by: Zhiyu Li <zhiyul@nvidia.com>
ZhiyuLi-Nvidia pushed a commit that referenced this pull request Jul 29, 2025
Signed-off-by: Yuki Huang <yukih@nvidia.com>
Signed-off-by: Zhiyu Li <zhiyul@nvidia.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants