Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Ahmet Bilican

FG-DFPN: Flow Guided Deformable Frame Prediction Network

Mar 14, 2025

M. Akın Yılmaz, Ahmet Bilican, A. Murat Tekalp

Abstract:Video frame prediction remains a fundamental challenge in computer vision with direct implications for autonomous systems, video compression, and media synthesis. We present FG-DFPN, a novel architecture that harnesses the synergy between optical flow estimation and deformable convolutions to model complex spatio-temporal dynamics. By guiding deformable sampling with motion cues, our approach addresses the limitations of fixed-kernel networks when handling diverse motion patterns. The multi-scale design enables FG-DFPN to simultaneously capture global scene transformations and local object movements with remarkable precision. Our experiments demonstrate that FG-DFPN achieves state-of-the-art performance on eight diverse MPEG test sequences, outperforming existing methods by 1dB PSNR while maintaining competitive inference speeds. The integration of motion cues with adaptive geometric transformations makes FG-DFPN a promising solution for next-generation video processing systems that require high-fidelity temporal predictions. The model and instructions to reproduce our results will be released at: https://github.com/KUIS-AI-Tekalp-Research Group/frame-prediction

* Submitted to 33th European Signal Processing Conference (EUSIPCO) 2025

Via

Access Paper or Ask Questions

Motion-Adaptive Inference for Flexible Learned B-Frame Compression

Feb 13, 2024

M. Akin Yilmaz, O. Ugur Ulas, Ahmet Bilican, A. Murat Tekalp

Figure 1 for Motion-Adaptive Inference for Flexible Learned B-Frame Compression

Figure 2 for Motion-Adaptive Inference for Flexible Learned B-Frame Compression

Figure 3 for Motion-Adaptive Inference for Flexible Learned B-Frame Compression

Figure 4 for Motion-Adaptive Inference for Flexible Learned B-Frame Compression

Abstract:While the performance of recent learned intra and sequential video compression models exceed that of respective traditional codecs, the performance of learned B-frame compression models generally lag behind traditional B-frame coding. The performance gap is bigger for complex scenes with large motions. This is related to the fact that the distance between the past and future references vary in hierarchical B-frame compression depending on the level of hierarchy, which causes motion range to vary. The inability of a single B-frame compression model to adapt to various motion ranges causes loss of performance. As a remedy, we propose controlling the motion range for flow prediction during inference (to approximately match the range of motions in the training data) by downsampling video frames adaptively according to amount of motion and level of hierarchy in order to compress all B-frames using a single flexible-rate model. We present state-of-the-art BD rate results to demonstrate the superiority of our proposed single-model motion-adaptive inference approach to all existing learned B-frame compression models.

* 7 pages, submitted to IEEE ICIP 2024

Via

Access Paper or Ask Questions