Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

大佬请问这个视频理解可以定位视频时间或者帧吗 #233

Open
tppqt opened this issue Jan 24, 2025 · 1 comment
Open

大佬请问这个视频理解可以定位视频时间或者帧吗 #233

tppqt opened this issue Jan 24, 2025 · 1 comment

Comments

@tppqt
Copy link

tppqt commented Jan 24, 2025

比如我希望让AI帮我找到某个场景或者某个物体出现在我上传视频中的时间点或者帧,这个能做到吗?

@leexinhao
Copy link
Collaborator

可以的,不过目前不算很精准,可以试试VideoChat-Flash或者InternVideo2.5-Chat,用这个prompt:

    pre_prompt: "Please find the visual event described by a sentence in the video, determining its starting and ending times. The format should be: 'The event happens in the start time - end time'. For example, The event 'person turn a light on' happens in the 24.3 - 30.4 seonds. Now I will give you the textual sentence: "
    post_prompt: "Please return its start time and end time."

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants