A Pytorch implementation for "Your Interest, Your Summaries: Query-Focused Long Video Summarization (FCSNA-QFVS)" [ICARCV, 2024]
Note: This repo still requires some work and will keep updating it.
Your Interest, Your Summaries: Query-Focused Long Video Summarization
Nirav Patel
Payal Prajapati
Maitrik Shah
L. D. College of Engineering
ICARCV 2024
Generating a concise and informative video summary from a long video is important, yet subjective due to varying scene importance. Users' ability to specify scene importance through text queries enhances the relevance of such summaries. This paper introduces an approach for query-focused video summarization, aiming to align video summaries closely with user queries. To this end, we propose the Fully Convolutional Sequence Network with Attention (FCSNA-QFVS), a novel approach designed for this task. Leveraging temporal convolutional and attention mechanisms, our model effectively extracts and highlights relevant content based on user-specified queries. Experimental validation on a benchmark dataset for query-focused video summarization demonstrates the effectiveness of our approach.
The evaluation code is being borrowed from EgoVLPv2.
The code is inspired by CHAN and FCSN implementation.
If you find our work helpful, please cite:
@misc{patel2024interestsummariesqueryfocusedlong,
title={Your Interest, Your Summaries: Query-Focused Long Video Summarization},
author={Nirav Patel and Payal Prajapati and Maitrik Shah},
year={2024},
eprint={2410.14087},
archivePrefix={arXiv},
primaryClass={cs.CV},
url={https://arxiv.org/abs/2410.14087},
}