[`Docs`] Add unsloth optimizations in TRL's documentation #1119

younesbelkada · 2023-12-20T16:28:58Z

What does this PR do?

This PR adds details about unsloth library: https://github.com/unslothai/unsloth that can bring very interesting out of the box speedups to users. In the future, I'll make sure users that use unsloth + SFTTrainer have the tag unsloth when they push models on the hub with unsloth

cc @danielhanchen @lvwerra

@danielhanchen let me know if the content does look to you! Feel free to add any suggestion directly in the PR

Related: unslothai/unsloth#34

HuggingFaceDocBuilderDev · 2023-12-20T16:33:13Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

danielhanchen · 2023-12-21T02:38:08Z

Cool work @younesbelkada and thanks! Looks good - I'll add a few changes in a bit!

danielhanchen · 2023-12-21T15:17:52Z

docs/source/sft_trainer.mdx

+
+### Accelerate fine-tuning using `unsloth` library
+
+You can further accelerate QLoRA / LoRA and even full-finetuning using [`unsloth`](https://github.com/unslothai/unsloth) library that is compatible with `SFTTrainer`. Currently `unsloth` supports only Llama and Mistral architectures.


"You can further accelerate QLoRA / LoRA and even full-finetuning using" maybe to

You can further accelerate QLoRA / LoRA 2.2x faster and use 60% less memroy and even full-finetuning (1.1x speed boost) using unsloth library that is compatible with SFTTrainer. Currently unsloth supports only Llama (Yi, TinyLlama etc) and Mistral architectures.

danielhanchen · 2023-12-21T15:18:50Z

docs/source/sft_trainer.mdx

@@ -410,6 +410,61 @@ We have tested NEFTune by training `mistralai/Mistral-7B-v0.1` on the [OpenAssis
 </div>

 Note however, that the amount of performance gain is _dataset dependent_ and in particular, applying NEFTune on synthetic datasets like [UltraChat](https://huggingface.co/datasets/stingning/ultrachat) typically produces smaller gains.
+
+### Accelerate fine-tuning using `unsloth` library


Accelerate fine-tuning using unsloth library

maybe to

Accelerate fine-tuning using unsloth

danielhanchen · 2023-12-21T15:25:39Z

Actually instead of comments, I just editted the docs in place here: add-unsloth-docs...danielhanchen:trl:patch-1

Co-authored-by: Daniel Han <[email protected]>

younesbelkada · 2023-12-21T15:35:30Z

Thanks a lot @danielhanchen ! I just merged your PR with your suggestions

danielhanchen · 2023-12-22T13:07:34Z

:)

…e#1119) * add unsloth * Update sft_trainer.mdx (huggingface#1124) Co-authored-by: Daniel Han <[email protected]> --------- Co-authored-by: Daniel Han <[email protected]>

add unsloth

5b4b637

danielhanchen reviewed Dec 21, 2023

View reviewed changes

Update sft_trainer.mdx (#1124)

a38e1ad

Co-authored-by: Daniel Han <[email protected]>

danielhanchen approved these changes Dec 21, 2023

View reviewed changes

younesbelkada merged commit f11e213 into main Dec 22, 2023
9 checks passed

younesbelkada deleted the add-unsloth-docs branch December 22, 2023 12:45

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[`Docs`] Add unsloth optimizations in TRL's documentation #1119

[`Docs`] Add unsloth optimizations in TRL's documentation #1119

younesbelkada commented Dec 20, 2023

HuggingFaceDocBuilderDev commented Dec 20, 2023

danielhanchen commented Dec 21, 2023

danielhanchen Dec 21, 2023

danielhanchen Dec 21, 2023

danielhanchen commented Dec 21, 2023

younesbelkada commented Dec 21, 2023

danielhanchen commented Dec 22, 2023


		### Accelerate fine-tuning using `unsloth` library

		You can further accelerate QLoRA / LoRA and even full-finetuning using [`unsloth`](https://github.com/unslothai/unsloth) library that is compatible with `SFTTrainer`. Currently `unsloth` supports only Llama and Mistral architectures.

[Docs] Add unsloth optimizations in TRL's documentation #1119

[Docs] Add unsloth optimizations in TRL's documentation #1119

Conversation

younesbelkada commented Dec 20, 2023

What does this PR do?

HuggingFaceDocBuilderDev commented Dec 20, 2023

danielhanchen commented Dec 21, 2023

danielhanchen Dec 21, 2023

Choose a reason for hiding this comment

danielhanchen Dec 21, 2023

Choose a reason for hiding this comment

Accelerate fine-tuning using unsloth library

Accelerate fine-tuning using unsloth

danielhanchen commented Dec 21, 2023

younesbelkada commented Dec 21, 2023

danielhanchen commented Dec 22, 2023

[`Docs`] Add unsloth optimizations in TRL's documentation #1119

[`Docs`] Add unsloth optimizations in TRL's documentation #1119

Accelerate fine-tuning using `unsloth` library

Accelerate fine-tuning using `unsloth`