Drama/Movie Script generator

An attempt at designing a network which generates Movie Scripts (how cool is that!)

I have shortlisted some candidates for acomplishing this taks:

Seq2seq LSTM with encoders. Idea behind using this is to have multiple levels of abstraction which we will obtain from a seq2seq model. We might use LSTM's with some around 700 memory cells, we get this through experience.
Variational Autoencoders. Unlike just autoencoders, variational autoencoders have a special property of giving contineous RV's. I want to use this model analogous to how 'Thought vectors' are. With contineous RV's instead of discrete, we can have script with accurate and enriched meaning. Again meaning at what level of abstraction? What kind of meaning? etc are questions I dont know and I hope to find out soon.
A GAN where Gen produces dialogues and Cop tells if it comes close to the storyline (This will be cool!). We'll need a different loss function here to penalise the network based on the cosine similarity with subsequent story segments for each character.

Two obvious candidates:

Cornell's Movie Dialog Corpus
I love Elementary. Its a modern take on /Sherlock Holmes/. Drama, mystery, perfect. So I am going to try to curate a dataset from Elementary Season 1 or 5 subtitles. Unfortunately, I havent been able to find Subtitles for the deaf or hard-of-hearing (SDH) which have character labled dialogs insted of just dialogs. Still looking, if you guys find any let me know.

Abhiraj (Me obviously)
I have been discussing various ML/DL projects with two interesting college mates @Ritesh and @Richa. They have helped and contributed ideas to various projects including this one.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
.ipynb_checkpoints		.ipynb_checkpoints
dataset		dataset
keras		keras
.gitignore		.gitignore
README.md		README.md
RNN.ipynb		RNN.ipynb
Wiki_text_generator.ipynb		Wiki_text_generator.ipynb
subtitle_cleaning.ipynb		subtitle_cleaning.ipynb

Provide feedback