Picture for Dong Yu

Dong Yu

Save the Good Prefix: Precise Error Penalization via Process-Supervised RL to Enhance LLM Reasoning

Add code
Jan 26, 2026
Viaarxiv icon

SpatialEmb: Extract and Encode Spatial Information for 1-Stage Multi-channel Multi-speaker ASR on Arbitrary Microphone Arrays

Add code
Jan 25, 2026
Viaarxiv icon

Inference-Time Scaling of Verification: Self-Evolving Deep Research Agents via Test-Time Rubric-Guided Verification

Add code
Jan 22, 2026
Viaarxiv icon

Towards Comprehensive Semantic Speech Embeddings for Chinese Dialects

Add code
Jan 12, 2026
Viaarxiv icon

A Versatile Multimodal Agent for Multimedia Content Generation

Add code
Jan 06, 2026
Viaarxiv icon

PhyAVBench: A Challenging Audio Physics-Sensitivity Benchmark for Physically Grounded Text-to-Audio-Video Generation

Add code
Dec 30, 2025
Viaarxiv icon

Stable and Efficient Single-Rollout RL for Multimodal Reasoning

Add code
Dec 20, 2025
Figure 1 for Stable and Efficient Single-Rollout RL for Multimodal Reasoning
Figure 2 for Stable and Efficient Single-Rollout RL for Multimodal Reasoning
Figure 3 for Stable and Efficient Single-Rollout RL for Multimodal Reasoning
Figure 4 for Stable and Efficient Single-Rollout RL for Multimodal Reasoning
Viaarxiv icon

RePlan: Reasoning-guided Region Planning for Complex Instruction-based Image Editing

Add code
Dec 18, 2025
Figure 1 for RePlan: Reasoning-guided Region Planning for Complex Instruction-based Image Editing
Figure 2 for RePlan: Reasoning-guided Region Planning for Complex Instruction-based Image Editing
Figure 3 for RePlan: Reasoning-guided Region Planning for Complex Instruction-based Image Editing
Figure 4 for RePlan: Reasoning-guided Region Planning for Complex Instruction-based Image Editing
Viaarxiv icon

N3D-VLM: Native 3D Grounding Enables Accurate Spatial Reasoning in Vision-Language Models

Add code
Dec 18, 2025
Viaarxiv icon

Can LLMs Guide Their Own Exploration? Gradient-Guided Reinforcement Learning for LLM Reasoning

Add code
Dec 17, 2025
Figure 1 for Can LLMs Guide Their Own Exploration? Gradient-Guided Reinforcement Learning for LLM Reasoning
Figure 2 for Can LLMs Guide Their Own Exploration? Gradient-Guided Reinforcement Learning for LLM Reasoning
Figure 3 for Can LLMs Guide Their Own Exploration? Gradient-Guided Reinforcement Learning for LLM Reasoning
Figure 4 for Can LLMs Guide Their Own Exploration? Gradient-Guided Reinforcement Learning for LLM Reasoning
Viaarxiv icon