Introduction

Official Pytorch implementation for Neural Video and Image Compression including:

Neural Video Codec
- DCVC: Deep Contextual Video Compression, NeurIPS 2021, in this folder.
- DCVC-TCM: Temporal Context Mining for Learned Video Compression, in IEEE Transactions on Multimedia, and arxiv, in this folder.
- DCVC-HEM: Hybrid Spatial-Temporal Entropy Modelling for Neural Video Compression, ACM MM 2022, in this folder.
  - The first end-to-end neural video codec to exceed H.266 (VTM) using the highest compression ratio configuration, in terms of both PSNR and MS-SSIM.
  - The first end-to-end neural video codec to achieve rate adjustment in single model.
- DCVC-DC: Neural Video Compression with Diverse Contexts, CVPR 2023, in this folder.
  - The first end-to-end neural video codec to exceed ECM using the highest compression ratio low delay configuration with a intra refresh period roughly to one second (32 frames), in terms of PSNR and MS-SSIM for RGB content.
  - The first end-to-end neural video codec to exceed ECM using the highest compression ratio low delay configuration with a intra refresh period roughly to one second (32 frames), in terms of PSNR for YUV420 content.
- DCVC-FM: Neural Video Compression with Feature Modulation, CVPR 2024, in this folder.
  - The first end-to-end neural video codec to exceed ECM using the highest compression ratio low delay configuration with only one intra frame, in terms of PSNR for both YUV420 content and RGB content in a single model.
  - The first end-to-end neural video codec that support a large quality and bitrate range in a single model.
Neural Image Codec
- EVC: Towards Real-Time Neural Image Compression with Mask Decay, ICLR 2023, in this folder.

Pretrained models

As a backup, all the pretrained models could be found here.

On the comparison

Please note that different methods may use different configurations to test different models, such as

Source video may be different, e.g., cropped or padded to the desired resolution.
Intra period may be different, e.g., 96, 32, 12, or 10.
Number of encoded frames may be different.

So, it does not make sense to compare the numbers in different methods directly, unless making sure they are using same test conditions.

Please find more details on the test conditions.

Acknowledgement

The implementation is based on CompressAI and PyTorchVideoCompression.

Citation

If you find this work useful for your research, please cite:

@article{li2021deep,
  title={Deep Contextual Video Compression},
  author={Li, Jiahao and Li, Bin and Lu, Yan},
  journal={Advances in Neural Information Processing Systems},
  volume={34},
  year={2021}
}

@article{sheng2022temporal,
  title={Temporal context mining for learned video compression},
  author={Sheng, Xihua and Li, Jiahao and Li, Bin and Li, Li and Liu, Dong and Lu, Yan},
  journal={IEEE Transactions on Multimedia},
  year={2022},
  publisher={IEEE}
}

@inproceedings{li2022hybrid,
  title={Hybrid Spatial-Temporal Entropy Modelling for Neural Video Compression},
  author={Li, Jiahao and Li, Bin and Lu, Yan},
  booktitle={Proceedings of the 30th ACM International Conference on Multimedia},
  year={2022}
}

@inproceedings{li2023neural,
  title={Neural Video Compression with Diverse Contexts},
  author={Li, Jiahao and Li, Bin and Lu, Yan},
  booktitle={{IEEE/CVF} Conference on Computer Vision and Pattern Recognition,
             {CVPR} 2023, Vancouver, Canada, June 18-22, 2023},
  year={2023}
}

@inproceedings{li2024neural,
  title={Neural Video Compression with Feature Modulation},
  author={Li, Jiahao and Li, Bin and Lu, Yan},
  booktitle={{IEEE/CVF} Conference on Computer Vision and Pattern Recognition,
             {CVPR} 2024, Seattle, WA, USA, June 17-21, 2024},
  year={2024}
}

@inproceedings{wang2023EVC,
  title={EVC: Towards Real-Time Neural Image Compression with Mask Decay},
  author={Wang, Guo-Hua and Li, Jiahao and Li, Bin and Lu, Yan},
  booktitle={International Conference on Learning Representations},
  year={2023}
}

Trademarks

This project may contain trademarks or logos for projects, products, or services. Authorized use of Microsoft trademarks or logos is subject to and must follow Microsoft’s Trademark & Brand Guidelines. Use of Microsoft trademarks or logos in modified versions of this project must not cause confusion or imply Microsoft sponsorship. Any use of third-party trademarks or logos are subject to those third-party’s policies.

Name	Name	Last commit message	Last commit date
Latest commit yaohualibin Update model link on the front page. Jan 20, 2025 b67129d · Jan 20, 2025 History 25 Commits
.github/workflows	.github/workflows	Create codeql-analysis.yml	Mar 30, 2022
DCVC-DC	DCVC-DC	bug fix for batch size other than 1 in offset diversity.	Feb 28, 2024
DCVC-FM	DCVC-FM	code and model for the paper Neural Video Compression with Feature Mo…	Feb 28, 2024
DCVC-HEM	DCVC-HEM	rename folders to align with model names.	Mar 14, 2023
DCVC-TCM	DCVC-TCM	rename folders to align with model names.	Mar 14, 2023
DCVC	DCVC	rename folders to align with model names.	Mar 14, 2023
EVC	EVC	rename folders to align with model names.	Mar 14, 2023
assets	assets	add more details on the test pipeline.	Mar 1, 2023
.flake8	.flake8	code and model for the paper EVC: Towards Real-Time Neural Image Comp…	Feb 14, 2023
.gitignore	.gitignore	code and model for the paper Hybrid Spatial-Temporal Entropy Modellin…	Jul 14, 2022
CODE_OF_CONDUCT.md	CODE_OF_CONDUCT.md	initial commit of DCVC.	Mar 22, 2022
CONTRIBUTING.md	CONTRIBUTING.md	initial commit of DCVC.	Mar 22, 2022
LICENSE.txt	LICENSE.txt	initial commit of DCVC.	Mar 22, 2022
NOTICE .txt	NOTICE .txt	add license notice for PyTorchVideoCompression	Mar 29, 2022
README.md	README.md	Update model link on the front page.	Jan 20, 2025
SECURITY.md	SECURITY.md	initial commit of DCVC.	Mar 22, 2022
azure-pipelines.yml	azure-pipelines.yml	Update azure-pipelines.yml	Mar 30, 2022
test_conditions.md	test_conditions.md	add more details on the test pipeline.	Mar 1, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Introduction

Pretrained models

On the comparison

Acknowledgement

Citation

Trademarks

About

Releases

Packages

Contributors 4

Languages

License

microsoft/DCVC

Folders and files

Latest commit

History

Repository files navigation

Introduction

Pretrained models

On the comparison

Acknowledgement

Citation

Trademarks

About

Resources

License

Code of conduct

Security policy

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Packages