-
Notifications
You must be signed in to change notification settings - Fork 1.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
error when using PPO in Gemma #1663
Comments
@mostafamdy I also think the same, |
Do you know how we can set it to False without changing the source code? |
@mostafamdy here is the code to get a PPOTrainer instance which you may have used as I don't know about your script:
So, The So, try this simple line of code for changing the value of
|
Thanks @RUFFY-369 I tried this but not working with me
it worked after adding this code.
here is the full code
Have a nice day 😄 |
I don't know is it correct or not I found this in test ppo trainer |
@mostafamdy Yeah I checked that test script out while finding the value changes in So, the above code that you mentioned, are you using all the code from test file or just bits of it to make your script work? |
Thank you so much for your help
|
Your welcome ! Glad I could be of help. 😄 Have a nice day 👍 😄 |
Hey both, is the issue that the newly resized embedding don't require grad even if the rest does? |
Hi @ArthurZucker , what i found out was that there was a PR with DPO+Gradient checkpoiting issue where if "one uses gradient_checkpointing we need to attach hooks to enable inputs to have requires grad to true, otherwise the training will either silently fail or completely fail".
It's the same as what @mostafamdy found in test_ppo_trainer. |
cc @younesbelkada 🤗 |
This is a TRL issue so transferring it here ! |
#1664 should fix the issue |
System Info
Hi,
I tried using ppo with gemma model but I get this error
I think the issue is here is_encoder_decoder
Who can help?
No response
Information
Tasks
examples
folder (such as GLUE/SQuAD, ...)Reproduction
.
Expected behavior
.
The text was updated successfully, but these errors were encountered: