Picture for Hang Yu

Hang Yu

Improved Dimension Dependence for Bandit Convex Optimization with Gradient Variations

Add code
Feb 04, 2026
Viaarxiv icon

Learning Geometrically-Grounded 3D Visual Representations for View-Generalizable Robotic Manipulation

Add code
Jan 30, 2026
Viaarxiv icon

From One-to-One to Many-to-Many: Dynamic Cross-Layer Injection for Deep Vision-Language Fusion

Add code
Jan 15, 2026
Viaarxiv icon

C2LLM Technical Report: A New Frontier in Code Retrieval via Adaptive Cross-Attention Pooling

Add code
Dec 24, 2025
Viaarxiv icon

Point What You Mean: Visually Grounded Instruction Policy

Add code
Dec 22, 2025
Figure 1 for Point What You Mean: Visually Grounded Instruction Policy
Figure 2 for Point What You Mean: Visually Grounded Instruction Policy
Figure 3 for Point What You Mean: Visually Grounded Instruction Policy
Figure 4 for Point What You Mean: Visually Grounded Instruction Policy
Viaarxiv icon

SpaceDrive: Infusing Spatial Awareness into VLM-based Autonomous Driving

Add code
Dec 11, 2025
Viaarxiv icon

HYPE: Hybrid Planning with Ego Proposal-Conditioned Predictions

Add code
Oct 14, 2025
Viaarxiv icon

F2LLM Technical Report: Matching SOTA Embedding Performance with 6 Million Open-Source Data

Add code
Oct 02, 2025
Figure 1 for F2LLM Technical Report: Matching SOTA Embedding Performance with 6 Million Open-Source Data
Figure 2 for F2LLM Technical Report: Matching SOTA Embedding Performance with 6 Million Open-Source Data
Figure 3 for F2LLM Technical Report: Matching SOTA Embedding Performance with 6 Million Open-Source Data
Figure 4 for F2LLM Technical Report: Matching SOTA Embedding Performance with 6 Million Open-Source Data
Viaarxiv icon

FinZero: Launching Multi-modal Financial Time Series Forecast with Large Reasoning Model

Add code
Sep 10, 2025
Viaarxiv icon

MMGraphRAG: Bridging Vision and Language with Interpretable Multimodal Knowledge Graphs

Add code
Jul 28, 2025
Viaarxiv icon