Picture for Yutao Zhu

Yutao Zhu

From Novice to Expert: LLM Agent Policy Optimization via Step-wise Reinforcement Learning

Add code
Nov 06, 2024
Figure 1 for From Novice to Expert: LLM Agent Policy Optimization via Step-wise Reinforcement Learning
Figure 2 for From Novice to Expert: LLM Agent Policy Optimization via Step-wise Reinforcement Learning
Figure 3 for From Novice to Expert: LLM Agent Policy Optimization via Step-wise Reinforcement Learning
Figure 4 for From Novice to Expert: LLM Agent Policy Optimization via Step-wise Reinforcement Learning
Viaarxiv icon

Little Giants: Synthesizing High-Quality Embedding Data at Scale

Add code
Oct 24, 2024
Viaarxiv icon

A Survey of Conversational Search

Add code
Oct 21, 2024
Viaarxiv icon

Toward General Instruction-Following Alignment for Retrieval-Augmented Generation

Add code
Oct 12, 2024
Viaarxiv icon

Root Defence Strategies: Ensuring Safety of LLM at the Decoding Level

Add code
Oct 09, 2024
Viaarxiv icon

From Pixels to Tokens: Revisiting Object Hallucinations in Large Vision-Language Models

Add code
Oct 09, 2024
Figure 1 for From Pixels to Tokens: Revisiting Object Hallucinations in Large Vision-Language Models
Figure 2 for From Pixels to Tokens: Revisiting Object Hallucinations in Large Vision-Language Models
Figure 3 for From Pixels to Tokens: Revisiting Object Hallucinations in Large Vision-Language Models
Figure 4 for From Pixels to Tokens: Revisiting Object Hallucinations in Large Vision-Language Models
Viaarxiv icon

LLMs + Persona-Plug = Personalized LLMs

Add code
Sep 18, 2024
Figure 1 for LLMs + Persona-Plug = Personalized LLMs
Figure 2 for LLMs + Persona-Plug = Personalized LLMs
Figure 3 for LLMs + Persona-Plug = Personalized LLMs
Figure 4 for LLMs + Persona-Plug = Personalized LLMs
Viaarxiv icon

FlatFusion: Delving into Details of Sparse Transformer-based Camera-LiDAR Fusion for Autonomous Driving

Add code
Aug 13, 2024
Viaarxiv icon

Towards Effective and Efficient Continual Pre-training of Large Language Models

Add code
Jul 26, 2024
Figure 1 for Towards Effective and Efficient Continual Pre-training of Large Language Models
Figure 2 for Towards Effective and Efficient Continual Pre-training of Large Language Models
Figure 3 for Towards Effective and Efficient Continual Pre-training of Large Language Models
Figure 4 for Towards Effective and Efficient Continual Pre-training of Large Language Models
Viaarxiv icon

Query-oriented Data Augmentation for Session Search

Add code
Jul 04, 2024
Viaarxiv icon