Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Qwen2.5 3B sft GPU利用率很低 #6508

Closed
1 task done
ATP-BME opened this issue Jan 2, 2025 · 1 comment
Closed
1 task done

Qwen2.5 3B sft GPU利用率很低 #6508

ATP-BME opened this issue Jan 2, 2025 · 1 comment
Labels
invalid This doesn't seem right

Comments

@ATP-BME
Copy link

ATP-BME commented Jan 2, 2025

Reminder

  • I have read the README and searched the existing issues.

System Info

  • llamafactory version: 0.9.2.dev0
  • Platform: Windows-10-10.0.19045-SP0
  • Python version: 3.10.13
  • PyTorch version: 2.2.2 (GPU)
  • Transformers version: 4.43.2
  • Datasets version: 2.20.0
  • Accelerate version: 1.0.1
  • PEFT version: 0.11.1
  • TRL version: 0.9.6
  • GPU type: Quadro P6000
  • DeepSpeed version: 0.14.2+unknown
  • Bitsandbytes version: 0.44.1

Reproduction

model

model_name_or_path: F:/LM/model_zoo/qwen2.5-3B-instruct

method

stage: sft
do_train: true
finetuning_type: lora
lora_target: all
lora_rank: 16
lora_alpha: 16
lora_dropout: 0.05

dataset

dataset: qwen_train_data
template: qwen
cutoff_len: 3072
overwrite_cache: true
preprocessing_num_workers: 16

output

output_dir: saves/qwen2.5-3b/lora/sft
logging_steps: 100
save_steps: 100
plot_loss: true
overwrite_output_dir: true

train

resume_from_checkpoint: F:/LM/Qwen2/LLaMA-Factory-main/LLaMA-Factory-main/saves/qwen2.5-3b/lora/sft/checkpoint-1000
per_device_train_batch_size: 1
gradient_accumulation_steps: 16
learning_rate: 1.0e-4
num_train_epochs: 1.0
lr_scheduler_type: cosine
warmup_ratio: 0.1

bf16: true

fp16: true
ddp_timeout: 180000000

eval

val_size: 0.01
per_device_eval_batch_size: 1
eval_strategy: steps
eval_steps: 500

Expected behavior

sft微调Qwen 2.5 3B,可以进行训练但是显卡利用率几乎为0.请问这是正常的吗?有没有方法可以加速训练。

image
image

Others

No response

@github-actions github-actions bot added the pending This problem is yet to be addressed label Jan 2, 2025
@hiyouga hiyouga added invalid This doesn't seem right and removed pending This problem is yet to be addressed labels Jan 2, 2025
@hiyouga
Copy link
Owner

hiyouga commented Jan 2, 2025

请使用 nvidia-smi 查看利用率

@hiyouga hiyouga closed this as completed Jan 2, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
invalid This doesn't seem right
Projects
None yet
Development

No branches or pull requests

2 participants