Picture for Sicheng Zhao

Sicheng Zhao

LLaVA-MLB: Mitigating and Leveraging Attention Bias for Training-Free Video LLMs

Add code
Mar 14, 2025
Viaarxiv icon

FastVID: Dynamic Density Pruning for Fast Video Large Language Models

Add code
Mar 14, 2025
Viaarxiv icon

Bridge then Begin Anew: Generating Target-relevant Intermediate Model for Source-free Visual Emotion Adaptation

Add code
Dec 18, 2024
Figure 1 for Bridge then Begin Anew: Generating Target-relevant Intermediate Model for Source-free Visual Emotion Adaptation
Figure 2 for Bridge then Begin Anew: Generating Target-relevant Intermediate Model for Source-free Visual Emotion Adaptation
Figure 3 for Bridge then Begin Anew: Generating Target-relevant Intermediate Model for Source-free Visual Emotion Adaptation
Figure 4 for Bridge then Begin Anew: Generating Target-relevant Intermediate Model for Source-free Visual Emotion Adaptation
Viaarxiv icon

From Easy to Hard: Progressive Active Learning Framework for Infrared Small Target Detection with Single Point Supervision

Add code
Dec 15, 2024
Viaarxiv icon

HEIE: MLLM-Based Hierarchical Explainable AIGC Image Implausibility Evaluator

Add code
Nov 26, 2024
Figure 1 for HEIE: MLLM-Based Hierarchical Explainable AIGC Image Implausibility Evaluator
Figure 2 for HEIE: MLLM-Based Hierarchical Explainable AIGC Image Implausibility Evaluator
Figure 3 for HEIE: MLLM-Based Hierarchical Explainable AIGC Image Implausibility Evaluator
Figure 4 for HEIE: MLLM-Based Hierarchical Explainable AIGC Image Implausibility Evaluator
Viaarxiv icon

TempMe: Video Temporal Token Merging for Efficient Text-Video Retrieval

Add code
Sep 02, 2024
Viaarxiv icon

Multi-source Domain Adaptation for Panoramic Semantic Segmentation

Add code
Aug 29, 2024
Figure 1 for Multi-source Domain Adaptation for Panoramic Semantic Segmentation
Figure 2 for Multi-source Domain Adaptation for Panoramic Semantic Segmentation
Figure 3 for Multi-source Domain Adaptation for Panoramic Semantic Segmentation
Figure 4 for Multi-source Domain Adaptation for Panoramic Semantic Segmentation
Viaarxiv icon

A Survey of Embodied Learning for Object-Centric Robotic Manipulation

Add code
Aug 21, 2024
Figure 1 for A Survey of Embodied Learning for Object-Centric Robotic Manipulation
Figure 2 for A Survey of Embodied Learning for Object-Centric Robotic Manipulation
Figure 3 for A Survey of Embodied Learning for Object-Centric Robotic Manipulation
Figure 4 for A Survey of Embodied Learning for Object-Centric Robotic Manipulation
Viaarxiv icon

LLMI3D: Empowering LLM with 3D Perception from a Single 2D Image

Add code
Aug 14, 2024
Figure 1 for LLMI3D: Empowering LLM with 3D Perception from a Single 2D Image
Figure 2 for LLMI3D: Empowering LLM with 3D Perception from a Single 2D Image
Figure 3 for LLMI3D: Empowering LLM with 3D Perception from a Single 2D Image
Figure 4 for LLMI3D: Empowering LLM with 3D Perception from a Single 2D Image
Viaarxiv icon

Local Manifold Learning for No-Reference Image Quality Assessment

Add code
Jun 27, 2024
Figure 1 for Local Manifold Learning for No-Reference Image Quality Assessment
Figure 2 for Local Manifold Learning for No-Reference Image Quality Assessment
Figure 3 for Local Manifold Learning for No-Reference Image Quality Assessment
Figure 4 for Local Manifold Learning for No-Reference Image Quality Assessment
Viaarxiv icon