My research revolves around multi-model understanding (image, video and text).
- Shanghai, China
- https://scholar.google.com/citations?user=xZ-0R3cAAAAJ
Pinned Loading
-
Sign-Language-Datasets
Sign-Language-Datasets PublicIntro of some sign language datasets suitable for research
-
statistical_learning_homework
statistical_learning_homework PublicThis is the programming exercises for statistical learning in USTC. (统计学习)
Python 3
-
-
-
Visual-AI/PruneVid
Visual-AI/PruneVid PublicThe official repository for paper "PruneVid: Visual Token Pruning for Efficient Video Large Language Models".
Python 32
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.