Picture for Hao Yin

Hao Yin

Lifting the Veil on Visual Information Flow in MLLMs: Unlocking Pathways to Faster Inference

Add code
Mar 17, 2025
Viaarxiv icon

ClearSight: Visual Signal Enhancement for Object Hallucination Mitigation in Multimodal Large language Models

Add code
Mar 17, 2025
Viaarxiv icon

A Decade of Action Quality Assessment: Largest Systematic Survey of Trends, Challenges, and Future Directions

Add code
Feb 05, 2025
Viaarxiv icon

Perturbation Ontology based Graph Attention Networks

Add code
Nov 27, 2024
Viaarxiv icon

Navigating Spatial Inequities in Freight Truck Crash Severity via Counterfactual Inference in Los Angeles

Add code
Nov 26, 2024
Figure 1 for Navigating Spatial Inequities in Freight Truck Crash Severity via Counterfactual Inference in Los Angeles
Figure 2 for Navigating Spatial Inequities in Freight Truck Crash Severity via Counterfactual Inference in Los Angeles
Figure 3 for Navigating Spatial Inequities in Freight Truck Crash Severity via Counterfactual Inference in Los Angeles
Figure 4 for Navigating Spatial Inequities in Freight Truck Crash Severity via Counterfactual Inference in Los Angeles
Viaarxiv icon

EvSign: Sign Language Recognition and Translation with Streaming Events

Add code
Jul 17, 2024
Figure 1 for EvSign: Sign Language Recognition and Translation with Streaming Events
Figure 2 for EvSign: Sign Language Recognition and Translation with Streaming Events
Figure 3 for EvSign: Sign Language Recognition and Translation with Streaming Events
Viaarxiv icon

Single-Codec: Single-Codebook Speech Codec towards High-Performance Speech Generation

Add code
Jun 11, 2024
Viaarxiv icon

Learning for Semantic Knowledge Base-Guided Online Feature Transmission in Dynamic Channels

Add code
Nov 30, 2023
Viaarxiv icon

PromptSpeaker: Speaker Generation Based on Text Descriptions

Add code
Oct 08, 2023
Figure 1 for PromptSpeaker: Speaker Generation Based on Text Descriptions
Figure 2 for PromptSpeaker: Speaker Generation Based on Text Descriptions
Figure 3 for PromptSpeaker: Speaker Generation Based on Text Descriptions
Figure 4 for PromptSpeaker: Speaker Generation Based on Text Descriptions
Viaarxiv icon

Ground-Challenge: A Multi-sensor SLAM Dataset Focusing on Corner Cases for Ground Robots

Add code
Jul 08, 2023
Viaarxiv icon