Skip to content

Actions: huggingface/trl

Build PR Documentation

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
211 workflow run results
211 workflow run results

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

[Document] Minor fixes of sft_trainer document
Build PR Documentation #1779: Pull request #1029 opened by mutichung
November 23, 2023 11:28 3m 18s clarify_sfttrainer_doc
November 23, 2023 11:28 3m 18s
[DPO] IPO Training loss
Build PR Documentation #1775: Pull request #1022 synchronize by kashif
November 22, 2023 15:15 3m 35s kashif:ipo
November 22, 2023 15:15 3m 35s
[DPO] IPO Training loss
Build PR Documentation #1774: Pull request #1022 synchronize by kashif
November 22, 2023 15:13 2m 21s kashif:ipo
November 22, 2023 15:13 2m 21s
[DPO] IPO Training loss
Build PR Documentation #1773: Pull request #1022 synchronize by kashif
November 22, 2023 14:07 3m 27s kashif:ipo
November 22, 2023 14:07 3m 27s
[DPO] IPO Training loss
Build PR Documentation #1772: Pull request #1022 synchronize by kashif
November 22, 2023 13:25 3m 22s kashif:ipo
November 22, 2023 13:25 3m 22s
[DPO] IPO Training loss
Build PR Documentation #1771: Pull request #1022 opened by kashif
November 22, 2023 13:12 3m 27s kashif:ipo
November 22, 2023 13:12 3m 27s
Fixes reward and text gathering in distributed training
Build PR Documentation #1770: Pull request #850 synchronize by vwxyzjn
November 22, 2023 04:57 3m 43s fix-reward-gather
November 22, 2023 04:57 3m 43s
Fixes reward and text gathering in distributed training
Build PR Documentation #1769: Pull request #850 synchronize by vwxyzjn
November 22, 2023 04:56 1m 3s fix-reward-gather
November 22, 2023 04:56 1m 3s
Remove duplicate data loading in rl_training.py
Build PR Documentation #1768: Pull request #1020 opened by viethoangtranduong
November 21, 2023 20:29 3m 43s viethoangtranduong:patch-1
November 21, 2023 20:29 3m 43s
[DPO] use ref model logprobs if it exists in the data
Build PR Documentation #1767: Pull request #885 synchronize by kashif
November 20, 2023 18:15 3m 43s kashif:reference-logprobs
November 20, 2023 18:15 3m 43s
[DPO] use ref model logprobs if it exists in the data
Build PR Documentation #1766: Pull request #885 synchronize by kashif
November 20, 2023 17:36 3m 50s kashif:reference-logprobs
November 20, 2023 17:36 3m 50s
[Multi-Adapter PPO] Fix and Refactor reward model adapter
Build PR Documentation #1765: Pull request #982 synchronize by mnoukhov
November 20, 2023 16:59 3m 54s mnoukhov:reward-model-adapter
November 20, 2023 16:59 3m 54s
Fixes reward and text gathering in distributed training
Build PR Documentation #1764: Pull request #850 synchronize by edbeeching
November 20, 2023 12:33 4m 29s fix-reward-gather
November 20, 2023 12:33 4m 29s
Update utils.py
Build PR Documentation #1763: Pull request #1012 opened by ZihanWang314
November 20, 2023 11:41 3m 39s ZihanWang314:patch-1
November 20, 2023 11:41 3m 39s
Fixes reward and text gathering in distributed training
Build PR Documentation #1762: Pull request #850 synchronize by edbeeching
November 20, 2023 11:25 3m 35s fix-reward-gather
November 20, 2023 11:25 3m 35s
Fixes reward and text gathering in distributed training
Build PR Documentation #1761: Pull request #850 synchronize by edbeeching
November 20, 2023 09:59 3m 26s fix-reward-gather
November 20, 2023 09:59 3m 26s
[DPO] use ref model logprobs if it exists in the data
Build PR Documentation #1760: Pull request #885 synchronize by kashif
November 17, 2023 15:48 3m 38s kashif:reference-logprobs
November 17, 2023 15:48 3m 38s
Update how_to_train.md
Build PR Documentation #1747: Pull request #1003 synchronize by halfrot
November 17, 2023 11:01 3m 44s patch-3
November 17, 2023 11:01 3m 44s
[DPO] use ref model logprobs if it exists in the data
Build PR Documentation #1746: Pull request #885 synchronize by kashif
November 17, 2023 09:23 3m 27s kashif:reference-logprobs
November 17, 2023 09:23 3m 27s
Adds requires_grad to input for non-quantized peft models
Build PR Documentation #1743: Pull request #1006 synchronize by younesbelkada
November 16, 2023 17:31 3m 34s younesbelkada-patch-sft-trainer-gc
November 16, 2023 17:31 3m 34s
Adds requires_grad to input for non-quantized peft models
Build PR Documentation #1742: Pull request #1006 synchronize by younesbelkada
November 16, 2023 17:28 3m 27s younesbelkada-patch-sft-trainer-gc
November 16, 2023 17:28 3m 27s