Picture for Meiqi Wu

Meiqi Wu

Artifact-Aware Evaluation for High-Quality Video Generation

Add code
Jan 28, 2026
Viaarxiv icon

Latent Temporal Discrepancy as Motion Prior: A Loss-Weighting Strategy for Dynamic Fidelity in T2V

Add code
Jan 28, 2026
Viaarxiv icon

Taming Preference Mode Collapse via Directional Decoupling Alignment in Diffusion Reinforcement Learning

Add code
Dec 30, 2025
Viaarxiv icon

Taming Hallucinations: Boosting MLLMs' Video Understanding via Counterfactual Video Generation

Add code
Dec 30, 2025
Viaarxiv icon

Omni-Effects: Unified and Spatially-Controllable Visual Effects Generation

Add code
Aug 12, 2025
Viaarxiv icon

VS-LLM: Visual-Semantic Depression Assessment based on LLM for Drawing Projection Test

Add code
Aug 07, 2025
Viaarxiv icon

VMBench: A Benchmark for Perception-Aligned Video Motion Generation

Add code
Mar 13, 2025
Viaarxiv icon

Finger in Camera Speaks Everything: Unconstrained Air-Writing for Real-World

Add code
Dec 27, 2024
Figure 1 for Finger in Camera Speaks Everything: Unconstrained Air-Writing for Real-World
Figure 2 for Finger in Camera Speaks Everything: Unconstrained Air-Writing for Real-World
Figure 3 for Finger in Camera Speaks Everything: Unconstrained Air-Writing for Real-World
Figure 4 for Finger in Camera Speaks Everything: Unconstrained Air-Writing for Real-World
Viaarxiv icon

How Texts Help? A Fine-grained Evaluation to Reveal the Role of Language in Vision-Language Tracking

Add code
Nov 23, 2024
Figure 1 for How Texts Help? A Fine-grained Evaluation to Reveal the Role of Language in Vision-Language Tracking
Figure 2 for How Texts Help? A Fine-grained Evaluation to Reveal the Role of Language in Vision-Language Tracking
Figure 3 for How Texts Help? A Fine-grained Evaluation to Reveal the Role of Language in Vision-Language Tracking
Figure 4 for How Texts Help? A Fine-grained Evaluation to Reveal the Role of Language in Vision-Language Tracking
Viaarxiv icon

DTVLT: A Multi-modal Diverse Text Benchmark for Visual Language Tracking Based on LLM

Add code
Oct 03, 2024
Figure 1 for DTVLT: A Multi-modal Diverse Text Benchmark for Visual Language Tracking Based on LLM
Figure 2 for DTVLT: A Multi-modal Diverse Text Benchmark for Visual Language Tracking Based on LLM
Figure 3 for DTVLT: A Multi-modal Diverse Text Benchmark for Visual Language Tracking Based on LLM
Figure 4 for DTVLT: A Multi-modal Diverse Text Benchmark for Visual Language Tracking Based on LLM
Viaarxiv icon