Implementing clstm's network architecture in Keras #115

Zerithious · 2016-12-05T17:32:22Z

I wish to implement clstm's network architecture in Keras, to facilitate extension and modifications to the network, and also to add gpu support.

I'm pretty much a newbie to NNs, and I'm trying to wrap my head around the code and the inner workings of clstm. It would greatly help my effort if someone can answer these questions:

What is the exact architecture/topology of the network implemented in clstm? (I mean, what layers, what do they consist of and in which order).
How does it encode the images to be used as input to the network? Is it just the raw pixels of the image with varying width/height?
What other things would I need to know in order to implement this in Keras or a similar framework like tensorflow?

zuphilip · 2016-12-05T18:26:25Z

@tmbdev Tom, maybe you can share some insights for this questions. Moreover, I think you are already doing something with tensorflow.

amitdo · 2016-12-06T07:00:30Z

How does it encode the images to be used as input to the network?
Is it just the raw pixels of the image with varying width/height?

Yes. it takes raw pixels as input. The width is always 1 and the height is fixed - by default to 48.

amitdo · 2016-12-06T07:11:23Z

You might want to look at ocropy's lstm.py code, because I think it will be easier to understand. It was written by the same author who wrote clstm.

mittagessen · 2016-12-06T15:29:34Z

Keras already implements/utilizes LSTM nets (the network architecture) and Connectionist Temporal Classification (the loss function used for training) from both theano and tensorflow. See keras-team/keras#3436 for more details.

The only thing that's not part of keras is the preprocessing of line images, chiefly dewarping (see lineest.py) whose utility is somewhat questionable as I trained perfectly working models without it.

tmbdev · 2016-12-07T14:49:06Z

I already have an LSTM-based OCR for TensorFlow that works like CLSTM, but TensorFlow (and by extension, Keras) support for LSTMs is still less than ideal.

Zerithious · 2016-12-07T14:57:20Z

@tmbdev Can you link to the repo containing the tensorflow code? Did you mean this?

tmbdev · 2016-12-08T05:52:38Z

No, that's a multidimensional add-on to TensorFlow. The LSTM code is experimental and in a bunch of iPython notebooks. I'm working on a Torch version right now, which I think will work better.

wanghaisheng · 2017-03-29T09:38:18Z

@tmbdev any progress ?

rayush7 · 2017-04-03T11:02:05Z

@tmbdev I am also interested in torch or tensorflow/keras based implementation of clstm. Is there any update from your end?

chinakook · 2017-04-18T08:36:18Z

A tensorflow implementation of popular library CLSTM and OCROPY line reader
https://github.com/ferjad/tlstm
Does anyone want to modify the repository to the cudnnLSTM or cudnnPersistentRNN implementation, by which the training could speed up about 6x to 10x times ?

zuphilip added the question label Dec 5, 2016

kba mentioned this issue Dec 6, 2016

CTC in both Theano and Tensorflow along with image OCR example kba/awesome-ocr#42

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implementing clstm's network architecture in Keras #115

Implementing clstm's network architecture in Keras #115

Zerithious commented Dec 5, 2016

zuphilip commented Dec 5, 2016

amitdo commented Dec 6, 2016 •

edited

Loading

amitdo commented Dec 6, 2016

mittagessen commented Dec 6, 2016 •

edited

Loading

tmbdev commented Dec 7, 2016

Zerithious commented Dec 7, 2016

tmbdev commented Dec 8, 2016

wanghaisheng commented Mar 29, 2017

rayush7 commented Apr 3, 2017

chinakook commented Apr 18, 2017 •

edited

Loading

Implementing clstm's network architecture in Keras #115

Implementing clstm's network architecture in Keras #115

Comments

Zerithious commented Dec 5, 2016

zuphilip commented Dec 5, 2016

amitdo commented Dec 6, 2016 • edited Loading

amitdo commented Dec 6, 2016

mittagessen commented Dec 6, 2016 • edited Loading

tmbdev commented Dec 7, 2016

Zerithious commented Dec 7, 2016

tmbdev commented Dec 8, 2016

wanghaisheng commented Mar 29, 2017

rayush7 commented Apr 3, 2017

chinakook commented Apr 18, 2017 • edited Loading

amitdo commented Dec 6, 2016 •

edited

Loading

mittagessen commented Dec 6, 2016 •

edited

Loading

chinakook commented Apr 18, 2017 •

edited

Loading