Picture for Jian Wang

Jian Wang

Top-Down Semantic Refinement for Image Captioning

Add code
Oct 25, 2025
Viaarxiv icon

Ponimator: Unfolding Interactive Pose for Versatile Human-human Interaction Animation

Add code
Oct 16, 2025
Viaarxiv icon

USIM and U0: A Vision-Language-Action Dataset and Model for General Underwater Robots

Add code
Oct 09, 2025
Viaarxiv icon

FocusMed: A Large Language Model-based Framework for Enhancing Medical Question Summarization with Focus Identification

Add code
Oct 06, 2025
Viaarxiv icon

Towards Better Optimization For Listwise Preference in Diffusion Models

Add code
Oct 02, 2025
Viaarxiv icon

Signed Graph Learning with Hidden Nodes

Add code
Sep 11, 2025
Viaarxiv icon

Rethinking Domain-Specific LLM Benchmark Construction: A Comprehensiveness-Compactness Approach

Add code
Aug 13, 2025
Viaarxiv icon

DIVER: A Multi-Stage Approach for Reasoning-intensive Information Retrieval

Add code
Aug 12, 2025
Viaarxiv icon

Learning to Align, Aligning to Learn: A Unified Approach for Self-Optimized Alignment

Add code
Aug 11, 2025
Viaarxiv icon

GMF-Drive: Gated Mamba Fusion with Spatial-Aware BEV Representation for End-to-End Autonomous Driving

Add code
Aug 08, 2025
Viaarxiv icon