Question about training loss #18

stein-666 · 2024-07-26T16:06:01Z

Hi, I’m struggling to reproduce the work. However, when I start training following the process in this repository, the loss decreases rapidly and it seems to be approaching convergence. Despite this, the model fails to reconstruct images. Does this make sense?

llvictorll · 2024-07-26T16:21:55Z

Hello,

based on the information you provided with the screenshot:

Are you using images of size 128? The VQGAN provided is not robust for images below 256.
Despite the loss dropping quickly, it seems you are showing results after only 7,500 iterations. The model needs many more updates to generate images with good quality. If the loss drops quickly in the beginning, it's mainly because the model first learns to copy-paste the unmasked tokens. Of course, I don't know the other hyperparameters you are using, but factors like batch size, learning rate or model size can drastically influence the training.

Best,

Victor

stein-666 · 2024-07-28T14:40:07Z

Thanks for your reply. Indeed, the image size is set to 128 for fast training. I will follow the technique report you released and make another attempt. Thanks again!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question about training loss #18

Question about training loss #18

stein-666 commented Jul 26, 2024 •

edited

Loading

llvictorll commented Jul 26, 2024

stein-666 commented Jul 28, 2024

Question about training loss #18

Question about training loss #18

Comments

stein-666 commented Jul 26, 2024 • edited Loading

llvictorll commented Jul 26, 2024

stein-666 commented Jul 28, 2024

stein-666 commented Jul 26, 2024 •

edited

Loading