Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Inquiry About Comparison with WF-VAE #3

Open
LAW1223 opened this issue Dec 26, 2024 · 1 comment
Open

Inquiry About Comparison with WF-VAE #3

LAW1223 opened this issue Dec 26, 2024 · 1 comment

Comments

@LAW1223
Copy link

LAW1223 commented Dec 26, 2024

Hello, I’ve been following your work, and I find it particularly interesting and helpful. May I ask two questions?

  1. I'd like to learn more about how your work compares to the approach used in WF-VAE (https://github.com/PKU-YuanGroup/WF-VAE). Would you be willing to share some insights on this?

  2. How about the inference speed?

@annitang1997
Copy link
Collaborator

annitang1997 commented Dec 28, 2024

Thank you for your interest in our work.

  1. We have tested the open-source model of WF-VAE, and the comparison results under the same setting (causal model, video compression ratio: 4×8×8, input shape: 17×256×256, testing data: MCL-JCV, 30 FPS) is as follows:
Method Param. PSNR SSIM LPIPS FVD
WF-VAE-L-16chn 317M 33.76 0.928 0.091 90.8
VidTok-16chn 157M 35.04 0.942 0.047 78.9
  1. As for the inference time, we will provide updates on it later.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants