Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Hao-Yu Hsu

AutoVFX: Physically Realistic Video Editing from Natural Language Instructions

Nov 04, 2024

Hao-Yu Hsu, Zhi-Hao Lin, Albert Zhai, Hongchi Xia, Shenlong Wang

Abstract:Modern visual effects (VFX) software has made it possible for skilled artists to create imagery of virtually anything. However, the creation process remains laborious, complex, and largely inaccessible to everyday users. In this work, we present AutoVFX, a framework that automatically creates realistic and dynamic VFX videos from a single video and natural language instructions. By carefully integrating neural scene modeling, LLM-based code generation, and physical simulation, AutoVFX is able to provide physically-grounded, photorealistic editing effects that can be controlled directly using natural language instructions. We conduct extensive experiments to validate AutoVFX's efficacy across a diverse spectrum of videos and instructions. Quantitative and qualitative results suggest that AutoVFX outperforms all competing methods by a large margin in generative quality, instruction alignment, editing versatility, and physical plausibility.

* Project page: https://haoyuhsu.github.io/autovfx-website/

Via

Access Paper or Ask Questions

NeurMiPs: Neural Mixture of Planar Experts for View Synthesis

Apr 28, 2022

Zhi-Hao Lin, Wei-Chiu Ma, Hao-Yu Hsu, Yu-Chiang Frank Wang, Shenlong Wang

Figure 1 for NeurMiPs: Neural Mixture of Planar Experts for View Synthesis

Figure 2 for NeurMiPs: Neural Mixture of Planar Experts for View Synthesis

Figure 3 for NeurMiPs: Neural Mixture of Planar Experts for View Synthesis

Figure 4 for NeurMiPs: Neural Mixture of Planar Experts for View Synthesis

Abstract:We present Neural Mixtures of Planar Experts (NeurMiPs), a novel planar-based scene representation for modeling geometry and appearance. NeurMiPs leverages a collection of local planar experts in 3D space as the scene representation. Each planar expert consists of the parameters of the local rectangular shape representing geometry and a neural radiance field modeling the color and opacity. We render novel views by calculating ray-plane intersections and composite output colors and densities at intersected points to the image. NeurMiPs blends the efficiency of explicit mesh rendering and flexibility of the neural radiance field. Experiments demonstrate superior performance and speed of our proposed method, compared to other 3D representations in novel view synthesis.

* CVPR 2022. Project page: https://zhihao-lin.github.io/neurmips/

Via

Access Paper or Ask Questions