T2I-with-quantitative-embeddings

Results

Architecture

Steps to reproduce the code

Part1: Data preparation and feature extraction (Including BERT Text Embeddings)

Step 1: Run the notebook. This notebook will work only on Colab https://colab.research.google.com/drive/1C9wzPjyYUb0mOiuDlVCS8l86TsaQBdsD
Step2: Run the script dataset_curation_part1.py . To do this ; the dataset needs to be downloaded from this link: https://drive.google.com/drive/folders/1FhQARl68A9NjjIbpQ6ib28UQ6pcLUSXJ?usp=sharing
- Out of the Step 1 is used in this python file
- And the path for animal images needs to be changed to the path of animal_images folder in your local. Code line 53.

this neeeds to be replaced with the path where the images are saved

destination = '/Volumes/GoogleDrive/Shared drives/MSML612 DeepLearningProject/data/animal_images'

Part 2: CGAN training

Step 1: Open the Congif.ini, setup an experiment with desired values and give an experiment name. The data_path variable should point to the relative path where the pickle file from the previous Part was generated.
Step 2: Install the required packages and run the command $ python CGAN.py. This should start the training and should start saving the models, plots and generated images under the Experiments/<exp_name> Folder.

Please note that since this a complex model, training was done on NVIDIA 16GB GPU enabled High Performance Cluster system and it still took about 1 hour to train per epoch and to see minimal results, atleast about 30 epochs of training needs to be done.

Part 3: CGAN predictions

The best model we trained was about 15 epochs due to the hardware constraints. Given that our qualitative dataset is relatively small, the results were vague but starting to form.
To generate image on a desired text, in the notebook CGAN_Predicition.ipynb change the variable ‘TEXT’ to the desired sentence and run the entire notebook to generate the predicted image at the end.

Special Mention: Lafite Code

We tried implementing the Lafite code which is open source. It can be found in the Lafite directory. The pickle can be processed using dataset_tool.py by giving its path to the source in the script. The data has been generated in the ldata folder.
Kindly go through the documentation of Lafite to setup the requirements.
To run the Lafite training script, you need to run $ python train.py --outdir=./training_runs --data=./ldata --test_data=./ldata_test. (We faced issues running this due to CUDA issues).
For Predicitons using Lafite, open the notebook Lafite/generate.ipynb and change the text to the required text. Run the notebook and the predictions should be loaded.

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
data		data
results		results
.gitignore		.gitignore
Architecture.png		Architecture.png
CGAN.py		CGAN.py
CGAN_Config.ini		CGAN_Config.ini
CGAN_DataTransforms.py		CGAN_DataTransforms.py
CGAN_Dataset.py		CGAN_Dataset.py
CGAN_Model.py		CGAN_Model.py
CGAN_Prediciton.py		CGAN_Prediciton.py
CGAN_utils.py		CGAN_utils.py
Deep Learning Project Final Report.pdf		Deep Learning Project Final Report.pdf
Final Group Project PPT.pptx		Final Group Project PPT.pptx
README.md		README.md
data_curation.ipynb		data_curation.ipynb
dataset_curation_part1.py		dataset_curation_part1.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

T2I-with-quantitative-embeddings

Results

Architecture

Steps to reproduce the code

Part 2: CGAN training

Part 3: CGAN predictions

Special Mention: Lafite Code

About

Releases

Packages

Contributors 2

Languages

sandeeppvn/Text-to-Image-Transformer-GAN

Folders and files

Latest commit

History

Repository files navigation

T2I-with-quantitative-embeddings

Results

Architecture

Steps to reproduce the code

Part 2: CGAN training

Part 3: CGAN predictions

Special Mention: Lafite Code

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages