We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
llamafactory
这是训练的推理文件
### model model_name_or_path: mergd_llama2_7b # trust_remote_code: true ### method stage: dpo do_train: true finetuning_type: lora lora_target: q_proj,v_proj pref_beta: 0.1 pref_loss: sigmoid # choices: [sigmoid (dpo), orpo, simpo] ### dataset dataset: dataset template: llama2 # cutoff_len: 2048 # max_samples: 1000 overwrite_cache: true preprocessing_num_workers: 16 ### output output_dir: new_path logging_steps: 10 save_steps: 500 plot_loss: true overwrite_output_dir: true ### train per_device_train_batch_size: 4 gradient_accumulation_steps: 4 learning_rate: 5.0e-5 num_train_epochs: 1.0 lr_scheduler_type: cosine warmup_ratio: 0.1 bf16: true ddp_timeout: 180000000 # ### eval # val_size: 0.1 # per_device_eval_batch_size: 1 # eval_strategy: steps # eval_steps: 500(chatkbqa)
这是推理的配置文件
### model model_name_or_path: mergd_llama2_7b adapter_name_or_path: new_path ### method stage: dpo do_predict: true finetuning_type: lora lora_target: q_proj,v_proj ### dataset template: llama2 eval_dataset: eval_dataset overwrite_cache: true preprocessing_num_workers: 16 ### output output_dir: output_dir logging_steps: 10 save_steps: 1000 plot_loss: true overwrite_output_dir: true
这样子运行后,无法得到输出文件,所以想请教一下DPO之后应该如何进行推理呢?
No response
The text was updated successfully, but these errors were encountered:
https://github.com/hiyouga/LLaMA-Factory/tree/main/examples#batch-generation-using-vllm-tensor-parallel
Sorry, something went wrong.
No branches or pull requests
Reminder
System Info
llamafactory
version: 0.9.2.dev0Reproduction
这是训练的推理文件
这是推理的配置文件
这样子运行后,无法得到输出文件,所以想请教一下DPO之后应该如何进行推理呢?
Others
No response
The text was updated successfully, but these errors were encountered: