Picture for Jiaxing Wu

Jiaxing Wu

Enhancing Personalized Multi-Turn Dialogue with Curiosity Reward

Add code
Apr 04, 2025
Viaarxiv icon

Deliberation in Latent Space via Differentiable Cache Augmentation

Add code
Dec 23, 2024
Figure 1 for Deliberation in Latent Space via Differentiable Cache Augmentation
Figure 2 for Deliberation in Latent Space via Differentiable Cache Augmentation
Figure 3 for Deliberation in Latent Space via Differentiable Cache Augmentation
Figure 4 for Deliberation in Latent Space via Differentiable Cache Augmentation
Viaarxiv icon

RLPF: Reinforcement Learning from Prediction Feedback for User Summarization with LLMs

Add code
Sep 06, 2024
Figure 1 for RLPF: Reinforcement Learning from Prediction Feedback for User Summarization with LLMs
Figure 2 for RLPF: Reinforcement Learning from Prediction Feedback for User Summarization with LLMs
Figure 3 for RLPF: Reinforcement Learning from Prediction Feedback for User Summarization with LLMs
Figure 4 for RLPF: Reinforcement Learning from Prediction Feedback for User Summarization with LLMs
Viaarxiv icon

User-LLM: Efficient LLM Contextualization with User Embeddings

Add code
Feb 21, 2024
Viaarxiv icon