deep vector quantization

Random experiments with VQVAE and friends, i.e. autoencoder models that pass through discrete latent variable bottlenecks, which are then easy to subsequently plug into existing infrastructure for modeling sequences of discrete variables (GPT and friends). I didn't get a chance to make the code pretty and consume and propagate all the proper args etc. - currently this is very much not a "pass arguments in watch it work" kind of repo, this is "read the entire code and hack things inline" code.

DeepMind's VQVAE

Should train out of the box as python train_vqvae.py --gpus 1 --vq_flavor vqvae.

I am able to get what I think are expected results on CIFAR-10 using VQVAE (judging by reconstruction loss achieved). However I had to resort to a data-driven intialization scheme with k-means, which the sonnet repo does not use, potentially due to more careful model initialization treatment. When I do not use data-driven init the training exhibits catastrophic index collapse.

Jang et al. Gumbel Softmax version, also the version used in DALL-E allegedly, though we have not seen the details yet.

Should train out of the box as python train_vqvae.py --gpus 1 --vq_flavor gumbel.

Trains and converges to slightly higher reconstruction loss, but tuning the scale of the kl divergence loss and the temperature decay rate and the version of gumbel (soft/hard) has so far proved a little bit finicky. Also the whole thing trains much slower. Requires a bit more thorough hyperparameter search than a few one-off guesses.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
README.md		README.md
data.py		data.py
model.py		model.py
train_vqvae.py		train_vqvae.py
visualize.ipynb		visualize.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

deep vector quantization

About

Releases

Packages

Languages

yutian-wang/deep-vector-quantization

Folders and files

Latest commit

History

Repository files navigation

deep vector quantization

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages