vlm_benchmark

Code for benchmarking contrastive VLMs on zero and few-shot activity recognition. This codebase is used for evaluating methods in our paper: Few-Shot Classification of Interactive Activities of Daily Living (InteractADL).

InteractADL Dataset

For information on how to download InteractADL, please see the official InteractADL page.

Other datasets

The setup instructions for our method evaluation suite (this codebase) and other activity recognition datasets can be found in SETUP.md.

Issues

If you create a GitHub issue in this repo, we will do our best to help you resolve it!

Information

The main entry point for our experiments is in hyperparam_search.py. We launch a run with a given VLM, classifier, dataset and number of shots with: python hyperparam_search.py <VLM> <classifier> --dataset <dataset> --n_shots <number_of_shots>. This is how we report all results in our paper, unless specified otherwise.

By default for all classifiers and datasets, we find the optimal set of hyperparameters for the validation set, and report results from a fresh run of the same hyperparameters on the test set to avoid spurious results from testing various hyperparameter settings for our methods. We include our explicit hyperparameter tuning process in our codebase for transparency and reproducbility, and note that we only search over a small number of hyperparameters (<5 for each classifier) for fair comparison. We use default hyperparameter settings from papers and codebases whenever they are available.

Name		Name	Last commit message	Last commit date
Latest commit History 143 Commits
CLIP		CLIP
MILES		MILES
OpenAI_CLIP		OpenAI_CLIP
UNIVL		UNIVL
VIFI_CLIP		VIFI_CLIP
VTTWINS		VTTWINS
category_error_rates		category_error_rates
classifier		classifier
dataset		dataset
embedding_visualizer		embedding_visualizer
frozen		frozen
linear_probing		linear_probing
pytorchvideo		pytorchvideo
scripts		scripts
tests		tests
video_clip		video_clip
vifi_utils		vifi_utils
.gitignore		.gitignore
.gitmodules		.gitmodules
FewShotTestHandler.py		FewShotTestHandler.py
README.md		README.md
SETUP.md		SETUP.md
SimilarityVLM.py		SimilarityVLM.py
__init__.py		__init__.py
category_error_rates.ipynb		category_error_rates.ipynb
class_embed_shift_visualizer.py		class_embed_shift_visualizer.py
distributed_test.py		distributed_test.py
embedding_visualizer.ipynb		embedding_visualizer.ipynb
experiment_parameters.json		experiment_parameters.json
export_vlm_embeddings.ipynb		export_vlm_embeddings.ipynb
fs_splits_hyperparam_search.py		fs_splits_hyperparam_search.py
generate_500p_embeds.py		generate_500p_embeds.py
hyperparam_search.ipynb		hyperparam_search.ipynb
hyperparam_search.py		hyperparam_search.py
name_tuning_interpretability.ipynb		name_tuning_interpretability.ipynb
orchestrate_results.py		orchestrate_results.py
param_test_results.csv		param_test_results.csv
plotting_utils.py		plotting_utils.py
preprocess_interactADL.py		preprocess_interactADL.py
preprocess_interactADL_subclips.py		preprocess_interactADL_subclips.py
random_hyperparam_search.py		random_hyperparam_search.py
refactored_hyperparam_search.py		refactored_hyperparam_search.py
result_plotting.ipynb		result_plotting.ipynb
retrieval_results.csv		retrieval_results.csv
retrieval_test.ipynb		retrieval_test.ipynb
run_experiment.py		run_experiment.py
run_experiments.py		run_experiments.py
run_experiments.sh		run_experiments.sh
run_tests.sh		run_tests.sh
similarity_metrics.py		similarity_metrics.py
subvideo_embedding_visualizer.ipynb		subvideo_embedding_visualizer.ipynb
test_results.csv		test_results.csv
vlm_param_tests.ipynb		vlm_param_tests.ipynb
vlm_param_tests_script.py		vlm_param_tests_script.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

vlm_benchmark

InteractADL Dataset

Other datasets

Issues

Information

About

Releases

Packages

Contributors 3

Languages

zanedurante/vlm_benchmark

Folders and files

Latest commit

History

Repository files navigation

vlm_benchmark

InteractADL Dataset

Other datasets

Issues

Information

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages