Picture for Feng Zhang

Feng Zhang

Efficient Paths and Dense Rewards: Probabilistic Flow Reasoning for Large Language Models

Add code
Jan 14, 2026
Viaarxiv icon

UserLM-R1: Modeling Human Reasoning in User Language Models with Multi-Reward Reinforcement Learning

Add code
Jan 14, 2026
Viaarxiv icon

SHIELD: Spherical-Projection Hybrid-Frontier Integration for Efficient LiDAR-based Drone Exploration

Add code
Dec 30, 2025
Viaarxiv icon

ADHint: Adaptive Hints with Difficulty Priors for Reinforcement Learning

Add code
Dec 15, 2025
Figure 1 for ADHint: Adaptive Hints with Difficulty Priors for Reinforcement Learning
Figure 2 for ADHint: Adaptive Hints with Difficulty Priors for Reinforcement Learning
Figure 3 for ADHint: Adaptive Hints with Difficulty Priors for Reinforcement Learning
Figure 4 for ADHint: Adaptive Hints with Difficulty Priors for Reinforcement Learning
Viaarxiv icon

Hybrid Attribution Priors for Explainable and Robust Model Training

Add code
Dec 09, 2025
Viaarxiv icon

Make It Long, Keep It Fast: End-to-End 10k-Sequence Modeling at Billion Scale on Douyin

Add code
Nov 08, 2025
Figure 1 for Make It Long, Keep It Fast: End-to-End 10k-Sequence Modeling at Billion Scale on Douyin
Figure 2 for Make It Long, Keep It Fast: End-to-End 10k-Sequence Modeling at Billion Scale on Douyin
Figure 3 for Make It Long, Keep It Fast: End-to-End 10k-Sequence Modeling at Billion Scale on Douyin
Figure 4 for Make It Long, Keep It Fast: End-to-End 10k-Sequence Modeling at Billion Scale on Douyin
Viaarxiv icon

Chain-of-Thought Re-ranking for Image Retrieval Tasks

Add code
Sep 18, 2025
Figure 1 for Chain-of-Thought Re-ranking for Image Retrieval Tasks
Figure 2 for Chain-of-Thought Re-ranking for Image Retrieval Tasks
Figure 3 for Chain-of-Thought Re-ranking for Image Retrieval Tasks
Figure 4 for Chain-of-Thought Re-ranking for Image Retrieval Tasks
Viaarxiv icon

Cross-Layer Vision Smoothing: Enhancing Visual Understanding via Sustained Focus on Key Objects in Large Vision-Language Models

Add code
Sep 16, 2025
Viaarxiv icon

Learning Unpaired Image Dehazing with Physics-based Rehazy Generation

Add code
Jun 15, 2025
Figure 1 for Learning Unpaired Image Dehazing with Physics-based Rehazy Generation
Figure 2 for Learning Unpaired Image Dehazing with Physics-based Rehazy Generation
Figure 3 for Learning Unpaired Image Dehazing with Physics-based Rehazy Generation
Figure 4 for Learning Unpaired Image Dehazing with Physics-based Rehazy Generation
Viaarxiv icon

Optimizing Recall or Relevance? A Multi-Task Multi-Head Approach for Item-to-Item Retrieval in Recommendation

Add code
Jun 06, 2025
Viaarxiv icon