Skip to content

Actions: hiyouga/LLaMA-Factory

Actions

All workflows

Actions

Loading...
Loading

Showing runs from all workflows
3,341 workflow runs
3,341 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

量化等级和导出量化等级二者的区别
label_issue #1946: Issue #6510 opened by TLL1213
January 2, 2025 08:49 10s
January 2, 2025 08:49 10s
Qwen2.5 3B sft GPU利用率很低
label_issue #1944: Issue #6508 opened by ATP-BME
January 2, 2025 06:31 12s
January 2, 2025 06:31 12s
预测结果异常
label_issue #1943: Issue #6507 opened by bisque-qwe
January 2, 2025 05:37 13s
January 2, 2025 05:37 13s
January 1, 2025 10:18 10s
如何修改推理阶段的Prompt
label_issue #1941: Issue #6502 opened by HHTao16
December 31, 2024 08:26 13s
December 31, 2024 08:26 13s
support DeepSeek-VL2 finetune?
label_issue #1940: Issue #6501 opened by xiadingZ
December 31, 2024 07:51 14s
December 31, 2024 07:51 14s
December 31, 2024 06:52 10s
refactor(data): 重构mask方式,sharegpt 支持更精细的mask控制
tests #1726: Pull request #6498 opened by zzc0430
December 31, 2024 06:31 Action required zzc0430:refactor/mask_history
December 31, 2024 06:31 Action required
How to use multiple images in multimodal dataset?
label_issue #1937: Issue #6497 opened by xiadingZ
December 31, 2024 03:52 10s
December 31, 2024 03:52 10s
December 31, 2024 02:35 14s
如何在训练保存checkpoint 的时候不保存优化器状态
label_issue #1935: Issue #6494 opened by EmeryBAI
December 30, 2024 16:04 11s
December 30, 2024 16:04 11s
Merge pull request #6492 from hiyouga/hiyouga/add_deepseek3
tests #1725: Commit 2382a5f pushed by hiyouga
December 30, 2024 13:50 6m 44s main
December 30, 2024 13:50 6m 44s
[model] add deepseek3 model
tests #1724: Pull request #6492 opened by hiyouga
December 30, 2024 13:40 8m 9s hiyouga/add_deepseek3
December 30, 2024 13:40 8m 9s
Merge pull request #5507 from piamo/main
tests #1723: Commit 91467ed pushed by hiyouga
December 30, 2024 13:08 7m 51s main
December 30, 2024 13:08 7m 51s
bug
label_issue #1932: Issue #6488 opened by liwenewil
December 30, 2024 09:00 11s
December 30, 2024 09:00 11s
GaLore_AdamW和LoRA的应用
label_issue #1931: Issue #6487 opened by YajieW99
December 30, 2024 08:47 11s
December 30, 2024 08:47 11s
Merge pull request #6483 from hiyouga/hiyouga/fix_paligemma_infer
tests #1722: Commit 40805b0 pushed by hiyouga
December 30, 2024 08:34 7m 46s main
December 30, 2024 08:34 7m 46s
resume_from_checkpoint oom killed
label_issue #1930: Issue #6486 opened by sunrise224
December 30, 2024 08:22 9s
December 30, 2024 08:22 9s
如何指定只冻结 LLM 进行多模态模型的训练?
label_issue #1929: Issue #6484 opened by Ben81828
December 30, 2024 06:09 11s
December 30, 2024 06:09 11s
[model] update vllm & fix paligemma dtype
tests #1721: Pull request #6483 synchronize by hiyouga
December 30, 2024 06:03 7m 13s hiyouga/fix_paligemma_infer
December 30, 2024 06:03 7m 13s
[model] update vllm & fix paligemma dtype
tests #1720: Pull request #6483 opened by hiyouga
December 30, 2024 05:56 7m 36s hiyouga/fix_paligemma_infer
December 30, 2024 05:56 7m 36s