Picture for Meng Wang

Meng Wang

School of Electronic and Information Engineering Liaoning Technical University Xingcheng City, Liaoning Province, P. R. China

ASAP: Advancing Semantic Alignment Promotes Multi-Modal Manipulation Detecting and Grounding

Add code
Dec 17, 2024
Viaarxiv icon

Repetitive Action Counting with Hybrid Temporal Relation Modeling

Add code
Dec 10, 2024
Viaarxiv icon

Towards Pixel-Level Prediction for Gaze Following: Benchmark and Approach

Add code
Nov 30, 2024
Viaarxiv icon

LokiTalk: Learning Fine-Grained and Generalizable Correspondences to Enhance NeRF-based Talking Head Synthesis

Add code
Nov 29, 2024
Figure 1 for LokiTalk: Learning Fine-Grained and Generalizable Correspondences to Enhance NeRF-based Talking Head Synthesis
Figure 2 for LokiTalk: Learning Fine-Grained and Generalizable Correspondences to Enhance NeRF-based Talking Head Synthesis
Figure 3 for LokiTalk: Learning Fine-Grained and Generalizable Correspondences to Enhance NeRF-based Talking Head Synthesis
Figure 4 for LokiTalk: Learning Fine-Grained and Generalizable Correspondences to Enhance NeRF-based Talking Head Synthesis
Viaarxiv icon

Multimodal Crash Likelihood Prediction: A Complexity-Infused Approach Integrating Semantic, Contextual, and Driving Features

Add code
Nov 26, 2024
Viaarxiv icon

An Information-Theoretic Regularizer for Lossy Neural Image Compression

Add code
Nov 23, 2024
Viaarxiv icon

Separable Mixture of Low-Rank Adaptation for Continual Visual Instruction Tuning

Add code
Nov 21, 2024
Viaarxiv icon

Compact Visual Data Representation for Green Multimedia -- A Human Visual System Perspective

Add code
Nov 21, 2024
Viaarxiv icon

Visual-Oriented Fine-Grained Knowledge Editing for MultiModal Large Language Models

Add code
Nov 19, 2024
Viaarxiv icon

Towards Open-Vocabulary Audio-Visual Event Localization

Add code
Nov 18, 2024
Viaarxiv icon