Skip to content

[Multi-Adapter PPO] Fix and Refactor reward model adapter #982

Merged
younesbelkada merged 4 commits intohuggingface:mainfrom mnoukhov:reward-model-adapterNov 21, 2023

Commits

Commits on Nov 10, 2023

Commits on Nov 20, 2023