[`DPO`] fix DPO + GC issues #927

younesbelkada · 2023-10-30T17:08:59Z

What does this PR do?

For DPO, when one uses gradient_checkpointing we need to attach hooks to enable inputs to have requires grad to true, otherwise the training will either silently fail or completely fail

cc @lvwerra

HuggingFaceDocBuilderDev · 2023-10-30T17:14:45Z

The documentation is not available anymore as the PR was closed or merged.

fix DPO + GC issues

a662835

younesbelkada mentioned this pull request Oct 30, 2023

DPO and accelerate #801

Closed

younesbelkada requested a review from lvwerra October 30, 2023 17:28

lvwerra approved these changes Oct 31, 2023

View reviewed changes

younesbelkada merged commit b89b712 into main Oct 31, 2023
8 checks passed

younesbelkada deleted the dpo-fix branch October 31, 2023 09:55

younesbelkada mentioned this pull request Oct 31, 2023

fix backward error in stack llama2 DPO example if checkpointing is used huggingface/peft#1056

Closed

lapp0 pushed a commit to lapp0/trl that referenced this pull request May 10, 2024

fix DPO + GC issues (huggingface#927)

703d8c7

RUFFY-369 mentioned this pull request May 20, 2024

error when using PPO in Gemma #1663

Closed

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[`DPO`] fix DPO + GC issues #927

[`DPO`] fix DPO + GC issues #927

younesbelkada commented Oct 30, 2023

HuggingFaceDocBuilderDev commented Oct 30, 2023 •

edited

Loading

[DPO] fix DPO + GC issues #927

[DPO] fix DPO + GC issues #927

Conversation

younesbelkada commented Oct 30, 2023

What does this PR do?

HuggingFaceDocBuilderDev commented Oct 30, 2023 • edited Loading

[`DPO`] fix DPO + GC issues #927

[`DPO`] fix DPO + GC issues #927

HuggingFaceDocBuilderDev commented Oct 30, 2023 •

edited

Loading