KEMBAR78
Mimic `adamw_torch_4bit` and have `adamw_torch_8bit` · Issue #34893 · huggingface/transformers · GitHub
Skip to content

Mimic adamw_torch_4bit and have adamw_torch_8bit #34893

@fzyzcjy

Description

@fzyzcjy

Feature request

Hi thanks for the lib! Currently there is adamw_torch_4bit, but I hope to mimic it to have a adamw_torch_8bit that uses 8bit torchao adamw.

The reason is that, I would like to use deepspeed cpu offload for the optimizer, and also use 8bit adamw. However, the 8bit one in current hf transformers does not support cpu, so I need to use the torchao one.

Motivation

Your contribution

yes, willing to PR

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions