Updated _load_pretrained_model_low_mem to check if keys are in the state_dict #16643

FrancescoSaverioZuppichini · 2022-04-07T06:27:14Z

What does this PR do?

This PR checks if any key is in the state_dict before attempting to load it. If we have multiple checkpoints, not all keys are in every checkpoint.

TODO

tests

…ored state_dict

FrancescoSaverioZuppichini · 2022-04-07T06:28:11Z

I am wondering what is the correct place to add a test for this function

HuggingFaceDocBuilderDev · 2022-04-07T06:40:57Z

The documentation is not available anymore as the PR was closed or merged.

sgugger

Thanks for tackling this, the fix could be a tiny bit better I believe.

@stas00 It looks like the whole low_cpu_mem_usage is not tested at present? Maybe we can take care of tests in a separate PR for both a whole and a sharded checkpoint, so this can be merged fast for the RegNet PR?

sgugger · 2022-04-07T11:43:46Z

src/transformers/modeling_utils.py

-                    if isinstance(getattr(submodule, param_name), torch.nn.Parameter):
-                        new_val = torch.nn.Parameter(new_val)
-                    setattr(submodule, param_name, new_val)
+                    if k in state_dict:


This test should go above on line 2165 with a continue if it's not True, to avoid looking for the param when we don't need it.

Updated. the only difference to your comment is setattr(submodule, param_name, new_val) is after the check for the key

There is nothing on line 2165, are you sure you pushed your update? The goal is to avoid spending any time in this block (starting at submodule, param_name = find_submodule_and_param_name(model, k)) when there is no need to.

Apologies, updated. No need for ugly continue when you can do everything with a positive conditional flow

Prefilter?

keys_to_load = [k for k in loaded_state_dict_keys if k in state_dict]

it won't be the same if loaded_state_dict_keys doesn't include all state_dict keys. I'm pretty sure it is right now, but it may change. Note this warning:

transformers/src/transformers/modeling_utils.py

Line 2121 in 10131af

Currently, it doesn't handle missing_keys, unexpected_keys, mismatched_keys. It can't handle deepspeed.

it was a quick hack to enable an urgent use so it needs to be completed to do a full support, in which case not all keys from state_dict might be loaded.

I only suggested the comprehension way as another way to avoid too much conditional nesting.

continue is there for this exact reason and a functional programming tool

You will have to put your continue inside an if statement. For me is the same, feel free to suggest the change that fits your coding style preference and I will happily change it. But, let's avoid unneeded nitpicking

I suggested a simple alternative to deep conditional nesting here: #16643 (comment)

But I'm fine with the code the way it is now as well.

Sure, what I meant is that prefiltering is the same as just iterating the loaded state_dict keys, that is the cleanest solution

stas00 · 2022-04-07T15:00:21Z

Your plan works for me, Sylvain. I will work on the low mem test then today.

sgugger

Thanks!

src/transformers/modeling_utils.py

stas00

LGTM, thank you for fixing this bug, @FrancescoSaverioZuppichini

Updated _load_pretrained_model_low_mem to check if keys are in the st…

089964f

…ored state_dict

FrancescoSaverioZuppichini requested a review from stas00 April 7, 2022 06:27

This was referenced Apr 7, 2022

RegNet #16188

Merged

What's part-0 and part-1 in RegNet 10B trained with seer facebookresearch/vissl#538

Closed

sgugger reviewed Apr 7, 2022

View reviewed changes

FrancescoSaverioZuppichini added 3 commits April 7, 2022 17:07

update after conversions

db383cc

update after conversation

136f46f

update after conversation

30e715b

sgugger approved these changes Apr 7, 2022

View reviewed changes

stas00 reviewed Apr 7, 2022

View reviewed changes

src/transformers/modeling_utils.py Outdated Show resolved Hide resolved

FrancescoSaverioZuppichini added 2 commits April 7, 2022 17:50

update after conversation

cef4d7f

update after conversation (I should sleep)

5f71190

stas00 reviewed Apr 7, 2022

View reviewed changes

src/transformers/modeling_utils.py Outdated Show resolved Hide resolved

update after conversation

10131af

FrancescoSaverioZuppichini requested a review from stas00 April 7, 2022 16:42

stas00 approved these changes Apr 7, 2022

View reviewed changes

FrancescoSaverioZuppichini merged commit 4099817 into main Apr 7, 2022

FrancescoSaverioZuppichini deleted the _load_pretrained_model_low_mem branch April 7, 2022 18:48

Updated _load_pretrained_model_low_mem to check if keys are in the state_dict #16643

Updated _load_pretrained_model_low_mem to check if keys are in the state_dict #16643

Uh oh!

Conversation

FrancescoSaverioZuppichini commented Apr 7, 2022

What does this PR do?

Uh oh!

FrancescoSaverioZuppichini commented Apr 7, 2022

Uh oh!

HuggingFaceDocBuilderDev commented Apr 7, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sgugger left a comment

Choose a reason for hiding this comment

Uh oh!

sgugger Apr 7, 2022

Choose a reason for hiding this comment

Uh oh!

FrancescoSaverioZuppichini Apr 7, 2022

Choose a reason for hiding this comment

Uh oh!

sgugger Apr 7, 2022

Choose a reason for hiding this comment

Uh oh!

FrancescoSaverioZuppichini Apr 7, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

stas00 Apr 7, 2022

Choose a reason for hiding this comment

Uh oh!

stas00 Apr 7, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

stas00 Apr 7, 2022

Choose a reason for hiding this comment

Uh oh!

FrancescoSaverioZuppichini Apr 7, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

stas00 Apr 7, 2022

Choose a reason for hiding this comment

Uh oh!

FrancescoSaverioZuppichini Apr 7, 2022

Choose a reason for hiding this comment

Uh oh!

stas00 commented Apr 7, 2022

Uh oh!

sgugger left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

stas00 left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

HuggingFaceDocBuilderDev commented Apr 7, 2022 •

edited

Loading

FrancescoSaverioZuppichini Apr 7, 2022 •

edited

Loading

stas00 Apr 7, 2022 •

edited

Loading

FrancescoSaverioZuppichini Apr 7, 2022 •

edited

Loading