Picture for Xuecheng Wu

Xuecheng Wu

HKD4VLM: A Progressive Hybrid Knowledge Distillation Framework for Robust Multimodal Hallucination and Factuality Detection in VLMs

Add code
Jun 16, 2025
Viaarxiv icon

NTIRE 2025 challenge on Text to Image Generation Model Quality Assessment

Add code
May 22, 2025
Viaarxiv icon

ViC-Bench: Benchmarking Visual-Interleaved Chain-of-Thought Capability in MLLMs with Free-Style Intermediate State Representations

Add code
May 20, 2025
Viaarxiv icon

TokenFocus-VQA: Enhancing Text-to-Image Alignment with Position-Aware Focus and Multi-Perspective Aggregations on LVLMs

Add code
Apr 10, 2025
Viaarxiv icon

3A-YOLO: New Real-Time Object Detectors with Triple Discriminative Awareness and Coordinated Representations

Add code
Dec 10, 2024
Viaarxiv icon

eMotions: A Large-Scale Dataset for Emotion Recognition in Short Videos

Add code
Nov 29, 2023
Viaarxiv icon

Emotion Recognition by Video: A review

Add code
Oct 26, 2023
Viaarxiv icon

Research on Mask Wearing Detection of Natural Population Based on Improved YOLOv4

Add code
Aug 24, 2022
Figure 1 for Research on Mask Wearing Detection of Natural Population Based on Improved YOLOv4
Figure 2 for Research on Mask Wearing Detection of Natural Population Based on Improved YOLOv4
Figure 3 for Research on Mask Wearing Detection of Natural Population Based on Improved YOLOv4
Figure 4 for Research on Mask Wearing Detection of Natural Population Based on Improved YOLOv4
Viaarxiv icon

ICANet: A Method of Short Video Emotion Recognition Driven by Multimodal Data

Add code
Aug 24, 2022
Figure 1 for ICANet: A Method of Short Video Emotion Recognition Driven by Multimodal Data
Figure 2 for ICANet: A Method of Short Video Emotion Recognition Driven by Multimodal Data
Figure 3 for ICANet: A Method of Short Video Emotion Recognition Driven by Multimodal Data
Figure 4 for ICANet: A Method of Short Video Emotion Recognition Driven by Multimodal Data
Viaarxiv icon