Unable to load Zh-En Pre-trained Model for fine-tuning #153

riddlehk · 2020-07-03T03:59:57Z

Dear authors,

I make use of the script to perform fine-tuning on the zh-en pre-trained model you provided. After allocating GPUs, dictionaries and binary data, the following error messages popped up:

Traceback (most recent call last):
  File "/opt/conda/lib/python3.6/site-packages/torch/multiprocessing/spawn.py", line 20, in _wrap
    fn(i, *args)
  File "/opt/conda/lib/python3.6/site-packages/fairseq_cli/train.py", line 265, in distributed_main
    main(args, init_distributed=True)
  File "/opt/conda/lib/python3.6/site-packages/fairseq_cli/train.py", line 68, in main
    extra_state, epoch_itr = checkpoint_utils.load_checkpoint(args, trainer)
  File "/opt/conda/lib/python3.6/site-packages/fairseq/checkpoint_utils.py", line 107, in load_checkpoint
    reset_meters=args.reset_meters,
  File "/opt/conda/lib/python3.6/site-packages/fairseq/trainer.py", line 154, in load_checkpoint
    'Cannot load model parameters from checkpoint, '
Exception: Cannot load model parameters from checkpoint, please ensure that the architectures match.

I observed that the size of the pre-trained model is much bigger (of size 6425203122) than the checkpoints I obtained by training from scratch (of size ~3220000000), any advice on letting the pre-trained model being successfully loaded?

The text was updated successfully, but these errors were encountered:

ranggarppb · 2020-07-18T13:57:46Z

I've got this problem too. Is there any solution for this?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Unable to load Zh-En Pre-trained Model for fine-tuning #153

Unable to load Zh-En Pre-trained Model for fine-tuning #153

riddlehk commented Jul 3, 2020

ranggarppb commented Jul 18, 2020

Unable to load Zh-En Pre-trained Model for fine-tuning #153

Unable to load Zh-En Pre-trained Model for fine-tuning #153

Comments

riddlehk commented Jul 3, 2020

ranggarppb commented Jul 18, 2020