Picture for Richang Hong

Richang Hong

StgcDiff: Spatial-Temporal Graph Condition Diffusion for Sign Language Transition Generation

Add code
Jun 16, 2025
Viaarxiv icon

Understanding and Benchmarking the Trustworthiness in Multimodal LLMs for Video Understanding

Add code
Jun 14, 2025
Viaarxiv icon

Wi-CBR: WiFi-based Cross-domain Behavior Recognition via Multimodal Collaborative Awareness

Add code
Jun 13, 2025
Viaarxiv icon

SignAligner: Harmonizing Complementary Pose Modalities for Coherent Sign Language Generation

Add code
Jun 13, 2025
Viaarxiv icon

DragNeXt: Rethinking Drag-Based Image Editing

Add code
Jun 09, 2025
Viaarxiv icon

Learning Speaker-Invariant Visual Features for Lipreading

Add code
Jun 09, 2025
Viaarxiv icon

Rebalancing Contrastive Alignment with Learnable Semantic Gaps in Text-Video Retrieval

Add code
May 18, 2025
Viaarxiv icon

VAEmo: Efficient Representation Learning for Visual-Audio Emotion with Knowledge Injection

Add code
May 05, 2025
Viaarxiv icon

Invariance Matters: Empowering Social Recommendation via Graph Invariant Learning

Add code
Apr 14, 2025
Viaarxiv icon

A Short Survey on Small Reasoning Models: Training, Inference, Applications and Research Directions

Add code
Apr 12, 2025
Viaarxiv icon