Picture for Wenhao Wu

Wenhao Wu

Mulberry: Empowering MLLM with o1-like Reasoning and Reflection via Collective Monte Carlo Tree Search

Add code
Dec 24, 2024
Viaarxiv icon

More Tokens, Lower Precision: Towards the Optimal Token-Precision Trade-off in KV Cache Compression

Add code
Dec 17, 2024
Viaarxiv icon

DistinctAD: Distinctive Audio Description Generation in Contexts

Add code
Nov 27, 2024
Figure 1 for DistinctAD: Distinctive Audio Description Generation in Contexts
Figure 2 for DistinctAD: Distinctive Audio Description Generation in Contexts
Figure 3 for DistinctAD: Distinctive Audio Description Generation in Contexts
Figure 4 for DistinctAD: Distinctive Audio Description Generation in Contexts
Viaarxiv icon

Meta-DT: Offline Meta-RL as Conditional Sequence Modeling with World Model Disentanglement

Add code
Oct 15, 2024
Viaarxiv icon

AgentBank: Towards Generalized LLM Agents via Fine-Tuning on 50000+ Interaction Trajectories

Add code
Oct 10, 2024
Figure 1 for AgentBank: Towards Generalized LLM Agents via Fine-Tuning on 50000+ Interaction Trajectories
Figure 2 for AgentBank: Towards Generalized LLM Agents via Fine-Tuning on 50000+ Interaction Trajectories
Figure 3 for AgentBank: Towards Generalized LLM Agents via Fine-Tuning on 50000+ Interaction Trajectories
Figure 4 for AgentBank: Towards Generalized LLM Agents via Fine-Tuning on 50000+ Interaction Trajectories
Viaarxiv icon

Watch Every Step! LLM Agent Learning via Iterative Step-Level Process Refinement

Add code
Jun 17, 2024
Figure 1 for Watch Every Step! LLM Agent Learning via Iterative Step-Level Process Refinement
Figure 2 for Watch Every Step! LLM Agent Learning via Iterative Step-Level Process Refinement
Figure 3 for Watch Every Step! LLM Agent Learning via Iterative Step-Level Process Refinement
Figure 4 for Watch Every Step! LLM Agent Learning via Iterative Step-Level Process Refinement
Viaarxiv icon

Dense Connector for MLLMs

Add code
May 22, 2024
Viaarxiv icon

FreeVA: Offline MLLM as Training-Free Video Assistant

Add code
May 13, 2024
Viaarxiv icon

Long Context Alignment with Short Instructions and Synthesized Positions

Add code
May 07, 2024
Figure 1 for Long Context Alignment with Short Instructions and Synthesized Positions
Figure 2 for Long Context Alignment with Short Instructions and Synthesized Positions
Figure 3 for Long Context Alignment with Short Instructions and Synthesized Positions
Figure 4 for Long Context Alignment with Short Instructions and Synthesized Positions
Viaarxiv icon

Retrieval Head Mechanistically Explains Long-Context Factuality

Add code
Apr 24, 2024
Figure 1 for Retrieval Head Mechanistically Explains Long-Context Factuality
Figure 2 for Retrieval Head Mechanistically Explains Long-Context Factuality
Figure 3 for Retrieval Head Mechanistically Explains Long-Context Factuality
Figure 4 for Retrieval Head Mechanistically Explains Long-Context Factuality
Viaarxiv icon