Picture for Wenhao Wu

Wenhao Wu

Meta-DT: Offline Meta-RL as Conditional Sequence Modeling with World Model Disentanglement

Add code
Oct 15, 2024
Viaarxiv icon

AgentBank: Towards Generalized LLM Agents via Fine-Tuning on 50000+ Interaction Trajectories

Add code
Oct 10, 2024
Figure 1 for AgentBank: Towards Generalized LLM Agents via Fine-Tuning on 50000+ Interaction Trajectories
Figure 2 for AgentBank: Towards Generalized LLM Agents via Fine-Tuning on 50000+ Interaction Trajectories
Figure 3 for AgentBank: Towards Generalized LLM Agents via Fine-Tuning on 50000+ Interaction Trajectories
Figure 4 for AgentBank: Towards Generalized LLM Agents via Fine-Tuning on 50000+ Interaction Trajectories
Viaarxiv icon

Watch Every Step! LLM Agent Learning via Iterative Step-Level Process Refinement

Add code
Jun 17, 2024
Figure 1 for Watch Every Step! LLM Agent Learning via Iterative Step-Level Process Refinement
Figure 2 for Watch Every Step! LLM Agent Learning via Iterative Step-Level Process Refinement
Figure 3 for Watch Every Step! LLM Agent Learning via Iterative Step-Level Process Refinement
Figure 4 for Watch Every Step! LLM Agent Learning via Iterative Step-Level Process Refinement
Viaarxiv icon

Dense Connector for MLLMs

Add code
May 22, 2024
Viaarxiv icon

FreeVA: Offline MLLM as Training-Free Video Assistant

Add code
May 13, 2024
Viaarxiv icon

Long Context Alignment with Short Instructions and Synthesized Positions

Add code
May 07, 2024
Figure 1 for Long Context Alignment with Short Instructions and Synthesized Positions
Figure 2 for Long Context Alignment with Short Instructions and Synthesized Positions
Figure 3 for Long Context Alignment with Short Instructions and Synthesized Positions
Figure 4 for Long Context Alignment with Short Instructions and Synthesized Positions
Viaarxiv icon

Retrieval Head Mechanistically Explains Long-Context Factuality

Add code
Apr 24, 2024
Viaarxiv icon

LongEmbed: Extending Embedding Models for Long Context Retrieval

Add code
Apr 18, 2024
Viaarxiv icon

CoUDA: Coherence Evaluation via Unified Data Augmentation

Add code
Mar 31, 2024
Viaarxiv icon

DetToolChain: A New Prompting Paradigm to Unleash Detection Ability of MLLM

Add code
Mar 19, 2024
Viaarxiv icon