Picture for Yisheng Lv

Yisheng Lv

Context-Aware Probabilistic Modeling with LLM for Multimodal Time Series Forecasting

Add code
May 16, 2025
Viaarxiv icon

Stop Summation: Min-Form Credit Assignment Is All Process Reward Model Needs for Reasoning

Add code
Apr 21, 2025
Figure 1 for Stop Summation: Min-Form Credit Assignment Is All Process Reward Model Needs for Reasoning
Figure 2 for Stop Summation: Min-Form Credit Assignment Is All Process Reward Model Needs for Reasoning
Figure 3 for Stop Summation: Min-Form Credit Assignment Is All Process Reward Model Needs for Reasoning
Figure 4 for Stop Summation: Min-Form Credit Assignment Is All Process Reward Model Needs for Reasoning
Viaarxiv icon

Offline Reinforcement Learning with Discrete Diffusion Skills

Add code
Mar 26, 2025
Figure 1 for Offline Reinforcement Learning with Discrete Diffusion Skills
Figure 2 for Offline Reinforcement Learning with Discrete Diffusion Skills
Figure 3 for Offline Reinforcement Learning with Discrete Diffusion Skills
Figure 4 for Offline Reinforcement Learning with Discrete Diffusion Skills
Viaarxiv icon

Evaluation of Safety Cognition Capability in Vision-Language Models for Autonomous Driving

Add code
Mar 09, 2025
Figure 1 for Evaluation of Safety Cognition Capability in Vision-Language Models for Autonomous Driving
Figure 2 for Evaluation of Safety Cognition Capability in Vision-Language Models for Autonomous Driving
Figure 3 for Evaluation of Safety Cognition Capability in Vision-Language Models for Autonomous Driving
Figure 4 for Evaluation of Safety Cognition Capability in Vision-Language Models for Autonomous Driving
Viaarxiv icon

UAVs Meet LLMs: Overviews and Perspectives Toward Agentic Low-Altitude Mobility

Add code
Jan 04, 2025
Viaarxiv icon

Scaling Offline Model-Based RL via Jointly-Optimized World-Action Model Pretraining

Add code
Oct 01, 2024
Figure 1 for Scaling Offline Model-Based RL via Jointly-Optimized World-Action Model Pretraining
Figure 2 for Scaling Offline Model-Based RL via Jointly-Optimized World-Action Model Pretraining
Figure 3 for Scaling Offline Model-Based RL via Jointly-Optimized World-Action Model Pretraining
Figure 4 for Scaling Offline Model-Based RL via Jointly-Optimized World-Action Model Pretraining
Viaarxiv icon

MiniDrive: More Efficient Vision-Language Models with Multi-Level 2D Features as Text Tokens for Autonomous Driving

Add code
Sep 11, 2024
Figure 1 for MiniDrive: More Efficient Vision-Language Models with Multi-Level 2D Features as Text Tokens for Autonomous Driving
Figure 2 for MiniDrive: More Efficient Vision-Language Models with Multi-Level 2D Features as Text Tokens for Autonomous Driving
Figure 3 for MiniDrive: More Efficient Vision-Language Models with Multi-Level 2D Features as Text Tokens for Autonomous Driving
Figure 4 for MiniDrive: More Efficient Vision-Language Models with Multi-Level 2D Features as Text Tokens for Autonomous Driving
Viaarxiv icon

MambaOcc: Visual State Space Model for BEV-based Occupancy Prediction with Local Adaptive Reordering

Add code
Aug 21, 2024
Viaarxiv icon

Traj-LLM: A New Exploration for Empowering Trajectory Prediction with Pre-trained Large Language Models

Add code
May 08, 2024
Figure 1 for Traj-LLM: A New Exploration for Empowering Trajectory Prediction with Pre-trained Large Language Models
Figure 2 for Traj-LLM: A New Exploration for Empowering Trajectory Prediction with Pre-trained Large Language Models
Figure 3 for Traj-LLM: A New Exploration for Empowering Trajectory Prediction with Pre-trained Large Language Models
Figure 4 for Traj-LLM: A New Exploration for Empowering Trajectory Prediction with Pre-trained Large Language Models
Viaarxiv icon

SC-Tune: Unleashing Self-Consistent Referential Comprehension in Large Vision Language Models

Add code
Mar 20, 2024
Figure 1 for SC-Tune: Unleashing Self-Consistent Referential Comprehension in Large Vision Language Models
Figure 2 for SC-Tune: Unleashing Self-Consistent Referential Comprehension in Large Vision Language Models
Figure 3 for SC-Tune: Unleashing Self-Consistent Referential Comprehension in Large Vision Language Models
Figure 4 for SC-Tune: Unleashing Self-Consistent Referential Comprehension in Large Vision Language Models
Viaarxiv icon