Predictive Corrective Networks for Action Detection

This is the source code for training and evaluating "Predictive-Corrective Networks".

Please file an issue if you run into any problems, or contact me.

Download models and data

To download models+data, run

bash download.sh

This will create a directory with the following structure

data/
    <dataset>/
        models/
            vgg16-init.t7: Initial VGG-16 model pre-trained on Imagenet.
            vgg16-trained.t7: Trained VGG-16 single-frame model.
            pc_c33-1_fc7-8.t7: Trained predictive-corrective model.
        labels/
            trainval.h5: Train labels
            test.h5: Test labels

Currently only models for THUMOS/MultiTHUMOS are included, but we will release Charades models as soon as possible.

Dumping frames

Before running on any videos, you will need to dump frames (resized to 256x256) into a root directory which contains one subdirectory for each video. Each video subdirectory should contain frames of the form frame%04d.png (e.g. frame0012.png), extracted at 10 frames per second. If you would like to train or evaluate models at different frame rates, please file an issue or contact me and I can point you in the right direction.

You may find my dump_frames and resize_images scripts useful for this.

Running a pre-trained model

Store frames from your videos in one directory frames_root, with frames at frames_root/video_name/frame%04d.png as described above.

To evaluate the predictive-corrective model, run

th scripts/evaluate_model.lua \
    --model data/multithumos/models/pc_c33-1_fc7-8.t7 \
    --frames /path/to/frames_root \
    --output_log /path/to/output.log \
    --sequence_length 8 \
    --step_size 1 \
    --batch_size 16 \
    --output_hdf5 /path/to/output_predictions.h5

Training a model

Single-frame

To train a single frame model, look at config/config-vgg.yaml. Documentation for each config parameter is available in main.lua, but the only ones you really need to change are the path to training and test frames.

train_source_options:
    frames_root: '/path/to/multithumos/test/frames'
    labels_hdf5: 'data/multithumos/labels/test.h5'

val_source_options:
    frames_root: '/path/to/multithumos/trainval/frames'
    labels_hdf5: 'data/multithumos/labels/trainval.h5'

Once you have updated these, run

th main.lua config/config-vgg.yaml /path/to/output/directory

Predictive Corrective

First, generate a predictive-corrective model initialized from a trained single-frame model, as follows:

th scripts/make_predictive_corrective.lua \
--model data/multithumos/models/vgg16-trained.t7 \
--output data/multithumos/models/pc_c33-1_fc7-8-init.t7

Next, update config/config-predictive-corrective.yaml to point to your dumped frames, as described above. Then, run

th main.lua config/config-predictive-corrective.yaml /path/to/output/directory

This usually takes 2-3 days to run on 4 GPUs.

Required packages

Note: This is an incomplete list! TODO(achald): Document all required packages.

argparse
classic
cudnn
cutorch
luaposix
lyaml
nnlr
rnn

Caution: Other configs/scripts

Please note that there are a number of other scripts and configs in this repository that are not well documented. I am sharing them in case any of them are useful to look at, to see how I use the model, etc., but beware that they may be broken and I may not be able to help you fix them.

Extra: Generate labels hdf5 files

For convenience, we provide the labels for the datasets we use as HDF5 files. However, it is possible to generate these yourself. Here is the script I used to generate MultiTHUMOS labels HDF5, and here is a similar script for Charades. These are not very well documented, but feel free to contact me if you run into any issues.

Name	Name	Last commit message	Last commit date
Latest commit achalddave Add helper script for reproducing experiments Nov 29, 2017 5198840 · Nov 29, 2017 History 786 Commits
config	config	Add helper for creating subsampled experiment configs	Nov 29, 2017
debug	debug	Move LMDB data source to new file	Jul 20, 2017
experimental	experimental	Move sampler code to samplers.lua	May 6, 2017
layers	layers	Use log instead of print, remove unnecessary prints	Apr 7, 2017
scripts	scripts	Actually support selected_frames in compute_map	Nov 29, 2017
tests	tests	Move LMDB data source to new file	Jul 20, 2017
util	util	Add sizestr to log.lua, add torch_LICENSE	Jul 20, 2017
video_util @ 4a3d4da	video_util @ 4a3d4da	Update video_util submodule	Jan 23, 2017
.gitignore	.gitignore	Add helper for creating subsampled experiment configs	Nov 29, 2017
.gitmodules	.gitmodules	Remove logging submodule	Oct 10, 2016
LICENSE	LICENSE	Create LICENSE	Jul 20, 2017
README.md	README.md	Change code block from lua to bash	Jul 20, 2017
__init__.py	__init__.py	Add __init__.py to root directory	Apr 3, 2017
data_loader.lua	data_loader.lua	Fix doc for old DataLoader	Jul 20, 2017
data_source.lua	data_source.lua	Small abstraction in data_source.lua	Nov 29, 2017
download.sh	download.sh	Add download script	Aug 21, 2017
dump_git_info.sh	dump_git_info.sh	Output git-head revision in dump_git_info.sh	Nov 29, 2017
evaluator.lua	evaluator.lua	Update mAP computation code	Mar 30, 2017
last_step_criterion.lua	last_step_criterion.lua	Make unnecessarily global variable local (lint)	Jul 15, 2016
lmdb_data_source.lua	lmdb_data_source.lua	Bugfixes for lmdb_data_source, trainer	Nov 29, 2017
main.lua	main.lua	Fix dumping of config file	Nov 29, 2017
notes.md	notes.md	Add note about pillow resizing function change	Jul 20, 2017
reproduce_experiment.sh	reproduce_experiment.sh	Add helper script for reproducing experiments	Nov 29, 2017
samplers.lua	samplers.lua	Add options for padding batch when video ends	Nov 29, 2017
trainer.lua	trainer.lua	Add options for padding batch when video ends	Nov 29, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Predictive Corrective Networks for Action Detection

Download models and data

Dumping frames

Running a pre-trained model

Training a model

Single-frame

Predictive Corrective

Required packages

Caution: Other configs/scripts

Extra: Generate labels hdf5 files

About

Releases

Packages

Languages

License

achalddave/predictive-corrective

Folders and files

Latest commit

History

Repository files navigation

Predictive Corrective Networks for Action Detection

Download models and data

Dumping frames

Running a pre-trained model

Training a model

Single-frame

Predictive Corrective

Required packages

Caution: Other configs/scripts

Extra: Generate labels hdf5 files

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages