Picture for Mingyu Liu

Mingyu Liu

PerturboLLaVA: Reducing Multimodal Hallucinations with Perturbative Visual Training

Add code
Mar 09, 2025
Viaarxiv icon

DICEPTION: A Generalist Diffusion Model for Visual Perceptual Tasks

Add code
Feb 25, 2025
Viaarxiv icon

TUMTraffic-VideoQA: A Benchmark for Unified Spatio-Temporal Video Understanding in Traffic Scenes

Add code
Feb 04, 2025
Figure 1 for TUMTraffic-VideoQA: A Benchmark for Unified Spatio-Temporal Video Understanding in Traffic Scenes
Figure 2 for TUMTraffic-VideoQA: A Benchmark for Unified Spatio-Temporal Video Understanding in Traffic Scenes
Figure 3 for TUMTraffic-VideoQA: A Benchmark for Unified Spatio-Temporal Video Understanding in Traffic Scenes
Figure 4 for TUMTraffic-VideoQA: A Benchmark for Unified Spatio-Temporal Video Understanding in Traffic Scenes
Viaarxiv icon

MovieBench: A Hierarchical Movie Level Dataset for Long Video Generation

Add code
Nov 22, 2024
Figure 1 for MovieBench: A Hierarchical Movie Level Dataset for Long Video Generation
Figure 2 for MovieBench: A Hierarchical Movie Level Dataset for Long Video Generation
Figure 3 for MovieBench: A Hierarchical Movie Level Dataset for Long Video Generation
Figure 4 for MovieBench: A Hierarchical Movie Level Dataset for Long Video Generation
Viaarxiv icon

WARM-3D: A Weakly-Supervised Sim2Real Domain Adaptation Framework for Roadside Monocular 3D Object Detection

Add code
Jul 30, 2024
Viaarxiv icon

Take A Step Back: Rethinking the Two Stages in Visual Reasoning

Add code
Jul 29, 2024
Viaarxiv icon

MovieDreamer: Hierarchical Generation for Coherent Long Visual Sequence

Add code
Jul 23, 2024
Figure 1 for MovieDreamer: Hierarchical Generation for Coherent Long Visual Sequence
Figure 2 for MovieDreamer: Hierarchical Generation for Coherent Long Visual Sequence
Figure 3 for MovieDreamer: Hierarchical Generation for Coherent Long Visual Sequence
Figure 4 for MovieDreamer: Hierarchical Generation for Coherent Long Visual Sequence
Viaarxiv icon

Look Within, Why LLMs Hallucinate: A Causal Perspective

Add code
Jul 14, 2024
Viaarxiv icon

A Transformer variant for multi-step forecasting of water level and hydrometeorological sensitivity analysis based on explainable artificial intelligence technology

Add code
May 22, 2024
Viaarxiv icon

GraphRelate3D: Context-Dependent 3D Object Detection with Inter-Object Relationship Graphs

Add code
May 10, 2024
Figure 1 for GraphRelate3D: Context-Dependent 3D Object Detection with Inter-Object Relationship Graphs
Figure 2 for GraphRelate3D: Context-Dependent 3D Object Detection with Inter-Object Relationship Graphs
Figure 3 for GraphRelate3D: Context-Dependent 3D Object Detection with Inter-Object Relationship Graphs
Figure 4 for GraphRelate3D: Context-Dependent 3D Object Detection with Inter-Object Relationship Graphs
Viaarxiv icon