Enable dreambooth lora finetune example on other devices #10602

jiqing-feng · 2025-01-17T06:56:13Z

This PR mainly changed 2 points to enable the example on other devices:

Replace torch.cuda.amp.autocast() by torch.amp.autocast(device) so other devices can also use it
Empty cache for xpu.

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

kaixuanliu · 2025-01-20T00:59:08Z

LGTM

jiqing-feng · 2025-01-20T01:09:23Z

Hi @sayakpaul . Would you please review this PR? Thanks!

sayakpaul

Thanks for contribution! I just left some comments, LMK if they make sense.

examples/dreambooth/train_dreambooth_lora.py

sayakpaul · 2025-01-20T03:17:47Z

examples/dreambooth/train_dreambooth_lora.py

                torch.cuda.empty_cache()
+            if hasattr(torch, "xpu") and torch.xpu.is_available():
+                torch.xpu.empty_cache()


Same as above.

examples/dreambooth/train_dreambooth_lora.py

jiqing-feng · 2025-01-21T02:50:07Z

Hi @sayakpaul . I have fixed your comments. For mixed-precision, I am not sure because we have torch_dtype and weight_dtype here. In my view, the mix-precision should work on lora module but not SD based model because the weight_dtype has already been converted to the mix-precision.

sayakpaul

Just one comment and we should be good to go.

Have you verified if it works effectively? If so, could you share some results?

examples/dreambooth/train_dreambooth_lora.py

jiqing-feng · 2025-01-21T06:00:20Z

Just one comment and we should be good to go.

Have you verified if it works effectively? If so, could you share some results?

For cuda, it just enabled the DDP finetune and had no impact on the result as previously.

The result of Intel Xeon CPU is:

Sorry, I cannot release the latency performance data cause it is not allowed by Intel.

sayakpaul · 2025-01-21T07:06:48Z

For cuda, it just enabled the DDP finetune and had no impact on the result as previously.

What do you mean by this? I don't see the dog from the training set appear here. Am I missing something?

jiqing-feng · 2025-01-21T07:17:21Z

For cuda, it just enabled the DDP finetune and had no impact on the result as previously.

What do you mean by this? I don't see the dog from the training set appear here. Am I missing something?

Sorry for I didn't upload the cuda result image successfully, I will run again to get the result and give you feedback ASAP.

jiqing-feng · 2025-01-21T07:26:44Z

Hi @sayakpaul , here is the cuda result runs on 2*A100 cards

sayakpaul · 2025-01-21T07:34:13Z

And how about the XPU result?

jiqing-feng · 2025-01-21T07:37:21Z

And how about the XPU result?

Just got the XPU result with 2 DDP across 2 cards:

The XPU needs to tune the args to get a high-quality result.

sayakpaul · 2025-01-21T07:39:22Z

Nice, this is good.

jiqing-feng · 2025-01-21T08:13:36Z

The failed test seems unrelated to my changes. Please let me know what needs to be changed or request any other reviewers before merging. Thanks!

sayakpaul · 2025-01-21T08:39:49Z

Thanks much!

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

jiqing-feng marked this pull request as draft January 17, 2025 06:56

jiqing-feng added 3 commits January 17, 2025 14:44

enable dreambooth_lora on other devices

20cb3a8

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

enable xpu

cd45e14

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

check cuda device before empty cache

a343848

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

Merge branch 'main' into main

52924fd

jiqing-feng marked this pull request as ready for review January 20, 2025 01:04

sayakpaul reviewed Jan 20, 2025

View reviewed changes

sayakpaul approved these changes Jan 21, 2025

View reviewed changes

examples/dreambooth/train_dreambooth_lora.py Outdated Show resolved Hide resolved

sayakpaul merged commit 012d08b into huggingface:main Jan 21, 2025
11 of 12 checks passed

jiqing-feng added 4 commits January 21, 2025 10:28

fix comment

660a0bf

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

Merge branch 'main' into main

432700b

import free_memory

83343de

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

Merge branch 'main' into main

43afe8c

Enable dreambooth lora finetune example on other devices #10602

Enable dreambooth lora finetune example on other devices #10602

Uh oh!

Conversation

jiqing-feng commented Jan 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

kaixuanliu commented Jan 20, 2025

Uh oh!

jiqing-feng commented Jan 20, 2025

Uh oh!

sayakpaul left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sayakpaul Jan 20, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

jiqing-feng commented Jan 21, 2025

Uh oh!

sayakpaul left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

jiqing-feng commented Jan 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sayakpaul commented Jan 21, 2025

Uh oh!

jiqing-feng commented Jan 21, 2025

Uh oh!

jiqing-feng commented Jan 21, 2025

Uh oh!

sayakpaul commented Jan 21, 2025

Uh oh!

jiqing-feng commented Jan 21, 2025

Uh oh!

sayakpaul commented Jan 21, 2025

Uh oh!

jiqing-feng commented Jan 21, 2025

Uh oh!

Uh oh!

sayakpaul commented Jan 21, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

jiqing-feng commented Jan 17, 2025 •

edited

Loading

jiqing-feng commented Jan 21, 2025 •

edited

Loading