Picture for Hongchen Luo

Hongchen Luo

University of Science and Technology of China, China

Boosting the Generalization and Reasoning of Vision Language Models with Curriculum Reinforcement Learning

Add code
Mar 10, 2025
Viaarxiv icon

GREAT: Geometry-Intention Collaborative Inference for Open-Vocabulary 3D Object Affordance Grounding

Add code
Nov 29, 2024
Viaarxiv icon

Leverage Task Context for Object Affordance Ranking

Add code
Nov 25, 2024
Viaarxiv icon

Visual-Geometric Collaborative Guidance for Affordance Learning

Add code
Oct 15, 2024
Viaarxiv icon

VMAD: Visual-enhanced Multimodal Large Language Model for Zero-Shot Anomaly Detection

Add code
Sep 30, 2024
Viaarxiv icon

Grounding 3D Scene Affordance From Egocentric Interactions

Add code
Sep 29, 2024
Figure 1 for Grounding 3D Scene Affordance From Egocentric Interactions
Figure 2 for Grounding 3D Scene Affordance From Egocentric Interactions
Figure 3 for Grounding 3D Scene Affordance From Egocentric Interactions
Figure 4 for Grounding 3D Scene Affordance From Egocentric Interactions
Viaarxiv icon

PEAR: Phrase-Based Hand-Object Interaction Anticipation

Add code
Jul 31, 2024
Viaarxiv icon

Bidirectional Progressive Transformer for Interaction Intention Anticipation

Add code
May 09, 2024
Viaarxiv icon

Intention-driven Ego-to-Exo Video Generation

Add code
Mar 17, 2024
Figure 1 for Intention-driven Ego-to-Exo Video Generation
Figure 2 for Intention-driven Ego-to-Exo Video Generation
Figure 3 for Intention-driven Ego-to-Exo Video Generation
Figure 4 for Intention-driven Ego-to-Exo Video Generation
Viaarxiv icon

LEMON: Learning 3D Human-Object Interaction Relation from 2D Images

Add code
Dec 14, 2023
Viaarxiv icon