Picture for Yi Wu

Yi Wu

Technologies on Effectiveness and Efficiency: A Survey of State Spaces Models

Add code
Mar 14, 2025
Viaarxiv icon

Proxy-Tuning: Tailoring Multimodal Autoregressive Models for Subject-Driven Image Generation

Add code
Mar 13, 2025
Viaarxiv icon

Learning Strategic Language Agents in the Werewolf Game with Iterative Latent Space Policy Optimization

Add code
Feb 07, 2025
Figure 1 for Learning Strategic Language Agents in the Werewolf Game with Iterative Latent Space Policy Optimization
Figure 2 for Learning Strategic Language Agents in the Werewolf Game with Iterative Latent Space Policy Optimization
Figure 3 for Learning Strategic Language Agents in the Werewolf Game with Iterative Latent Space Policy Optimization
Figure 4 for Learning Strategic Language Agents in the Werewolf Game with Iterative Latent Space Policy Optimization
Viaarxiv icon

Offline Reinforcement Learning for LLM Multi-Step Reasoning

Add code
Dec 20, 2024
Figure 1 for Offline Reinforcement Learning for LLM Multi-Step Reasoning
Figure 2 for Offline Reinforcement Learning for LLM Multi-Step Reasoning
Figure 3 for Offline Reinforcement Learning for LLM Multi-Step Reasoning
Figure 4 for Offline Reinforcement Learning for LLM Multi-Step Reasoning
Viaarxiv icon

NPC: Neural Predictive Control for Fuel-Efficient Autonomous Trucks

Add code
Dec 18, 2024
Viaarxiv icon

What Matters in Learning A Zero-Shot Sim-to-Real RL Policy for Quadrotor Control? A Comprehensive Study

Add code
Dec 17, 2024
Viaarxiv icon

Neural Internal Model Control: Learning a Robust Control Policy via Predictive Error Feedback

Add code
Nov 20, 2024
Figure 1 for Neural Internal Model Control: Learning a Robust Control Policy via Predictive Error Feedback
Figure 2 for Neural Internal Model Control: Learning a Robust Control Policy via Predictive Error Feedback
Figure 3 for Neural Internal Model Control: Learning a Robust Control Policy via Predictive Error Feedback
Figure 4 for Neural Internal Model Control: Learning a Robust Control Policy via Predictive Error Feedback
Viaarxiv icon

SleepNetZero: Zero-Burden Zero-Shot Reliable Sleep Staging With Neural Networks Based on Ballistocardiograms

Add code
Oct 30, 2024
Viaarxiv icon

Multi-UAV Behavior-based Formation with Static and Dynamic Obstacles Avoidance via Reinforcement Learning

Add code
Oct 24, 2024
Figure 1 for Multi-UAV Behavior-based Formation with Static and Dynamic Obstacles Avoidance via Reinforcement Learning
Figure 2 for Multi-UAV Behavior-based Formation with Static and Dynamic Obstacles Avoidance via Reinforcement Learning
Figure 3 for Multi-UAV Behavior-based Formation with Static and Dynamic Obstacles Avoidance via Reinforcement Learning
Figure 4 for Multi-UAV Behavior-based Formation with Static and Dynamic Obstacles Avoidance via Reinforcement Learning
Viaarxiv icon

Few-shot In-Context Preference Learning Using Large Language Models

Add code
Oct 22, 2024
Figure 1 for Few-shot In-Context Preference Learning Using Large Language Models
Figure 2 for Few-shot In-Context Preference Learning Using Large Language Models
Figure 3 for Few-shot In-Context Preference Learning Using Large Language Models
Figure 4 for Few-shot In-Context Preference Learning Using Large Language Models
Viaarxiv icon