Picture for Hao Yang

Hao Yang

PathVLM-R1: A Reinforcement Learning-Driven Reasoning Model for Pathology Visual-Language Tasks

Add code
Apr 12, 2025
Viaarxiv icon

Kimi-VL Technical Report

Add code
Apr 10, 2025
Viaarxiv icon

Two Intermediate Translations Are Better Than One: Fine-tuning LLMs for Document-level Translation Refinement

Add code
Apr 08, 2025
Viaarxiv icon

DoCIA: An Online Document-Level Context Incorporation Agent for Speech Translation

Add code
Apr 07, 2025
Viaarxiv icon

MAVERIX: Multimodal Audio-Visual Evaluation Reasoning IndeX

Add code
Mar 27, 2025
Viaarxiv icon

Adaptive Weighted Parameter Fusion with CLIP for Class-Incremental Learning

Add code
Mar 25, 2025
Viaarxiv icon

Sensorless Remote Center of Motion Misalignment Estimation

Add code
Mar 17, 2025
Viaarxiv icon

Improving LLM-based Document-level Machine Translation with Multi-Knowledge Fusion

Add code
Mar 15, 2025
Viaarxiv icon

UniCombine: Unified Multi-Conditional Combination with Diffusion Transformer

Add code
Mar 12, 2025
Viaarxiv icon

Raccoon: Multi-stage Diffusion Training with Coarse-to-Fine Curating Videos

Add code
Feb 28, 2025
Viaarxiv icon