diff --git a/docs/source/index.mdx b/docs/source/index.mdx index b1de84afb1..bdddc9b6f2 100644 --- a/docs/source/index.mdx +++ b/docs/source/index.mdx @@ -38,28 +38,39 @@ Check the appropriate sections of the documentation depending on your needs:
- thumbnail + thumbnail +

Published on July 10, 2024

Preference Optimization for Vision Language Models with TRL

- - thumbnail -

Illustrating Reinforcement Learning from Human Feedback

+
+ thumbnail +

Published on June 12, 2024

+

Putting RL back in RLHF

- - thumbnail -

Fine-tuning 20B LLMs with RLHF on a 24GB consumer GPU

+
+ thumbnail +

Published on September 29, 2023

+

Finetune Stable Diffusion Models with DDPO via TRL

+
+ + thumbnail +

Published on August 8, 2023

+

Fine-tune Llama 2 with DPO

- thumbnail + thumbnail +

Published on April 5, 2023

StackLLaMA: A hands-on guide to train LLaMA with RLHF

- - thumbnail -

Fine-tune Llama 2 with DPO

+
+ thumbnail +

Published on March 9, 2023

+

Fine-tuning 20B LLMs with RLHF on a 24GB consumer GPU

- - thumbnail -

Finetune Stable Diffusion Models with DDPO via TRL

+
+ thumbnail +

Published on December 9, 2022

+

Illustrating Reinforcement Learning from Human Feedback