diff --git a/docs/source/index.mdx b/docs/source/index.mdx index b1de84afb1..bdddc9b6f2 100644 --- a/docs/source/index.mdx +++ b/docs/source/index.mdx @@ -38,28 +38,39 @@ Check the appropriate sections of the documentation depending on your needs:
Published on July 10, 2024
Preference Optimization for Vision Language Models with TRL
- - -Illustrating Reinforcement Learning from Human Feedback
+ + +Published on June 12, 2024
+Putting RL back in RLHF
- - -Fine-tuning 20B LLMs with RLHF on a 24GB consumer GPU
+ + +Published on September 29, 2023
+Finetune Stable Diffusion Models with DDPO via TRL
+ + + +Published on August 8, 2023
+Fine-tune Llama 2 with DPO
- + +Published on April 5, 2023
StackLLaMA: A hands-on guide to train LLaMA with RLHF
- - -Fine-tune Llama 2 with DPO
+ + +Published on March 9, 2023
+Fine-tuning 20B LLMs with RLHF on a 24GB consumer GPU
- - -Finetune Stable Diffusion Models with DDPO via TRL
+ + +Published on December 9, 2022
+Illustrating Reinforcement Learning from Human Feedback