Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Beyond FVD: Enhanced Evaluation Metrics for Video Generation Quality

Oct 07, 2024

Ge Ya, Luo, Gian Favero, Zhi Hao Luo, Alexia Jolicoeur-Martineau, Christopher Pal

Figure 1 for Beyond FVD: Enhanced Evaluation Metrics for Video Generation Quality

Figure 2 for Beyond FVD: Enhanced Evaluation Metrics for Video Generation Quality

Figure 3 for Beyond FVD: Enhanced Evaluation Metrics for Video Generation Quality

Figure 4 for Beyond FVD: Enhanced Evaluation Metrics for Video Generation Quality

Share this with someone who'll enjoy it:

Abstract:The Fr\'echet Video Distance (FVD) is a widely adopted metric for evaluating video generation distribution quality. However, its effectiveness relies on critical assumptions. Our analysis reveals three significant limitations: (1) the non-Gaussianity of the Inflated 3D Convnet (I3D) feature space; (2) the insensitivity of I3D features to temporal distortions; (3) the impractical sample sizes required for reliable estimation. These findings undermine FVD's reliability and show that FVD falls short as a standalone metric for video generation evaluation. After extensive analysis of a wide range of metrics and backbone architectures, we propose JEDi, the JEPA Embedding Distance, based on features derived from a Joint Embedding Predictive Architecture, measured using Maximum Mean Discrepancy with polynomial kernel. Our experiments on multiple open-source datasets show clear evidence that it is a superior alternative to the widely used FVD metric, requiring only 16% of the samples to reach its steady value, while increasing alignment with human evaluation by 34%, on average.

View paper on

Share this with someone who'll enjoy it:

Title:Beyond FVD: Enhanced Evaluation Metrics for Video Generation Quality

Paper and Code