Picture for Wendi Li

Wendi Li

Free Process Rewards without Process Labels

Add code
Dec 02, 2024
Viaarxiv icon

FATH: Authentication-based Test-time Defense against Indirect Prompt Injection Attacks

Add code
Oct 28, 2024
Viaarxiv icon

Process Reward Model with Q-Value Rankings

Add code
Oct 15, 2024
Viaarxiv icon

Position Debiasing Fine-Tuning for Causal Perception in Long-Term Dialogue

Add code
Jun 04, 2024
Figure 1 for Position Debiasing Fine-Tuning for Causal Perception in Long-Term Dialogue
Figure 2 for Position Debiasing Fine-Tuning for Causal Perception in Long-Term Dialogue
Figure 3 for Position Debiasing Fine-Tuning for Causal Perception in Long-Term Dialogue
Figure 4 for Position Debiasing Fine-Tuning for Causal Perception in Long-Term Dialogue
Viaarxiv icon

Reinforcement Learning with Token-level Feedback for Controllable Text Generation

Add code
Mar 18, 2024
Figure 1 for Reinforcement Learning with Token-level Feedback for Controllable Text Generation
Figure 2 for Reinforcement Learning with Token-level Feedback for Controllable Text Generation
Figure 3 for Reinforcement Learning with Token-level Feedback for Controllable Text Generation
Figure 4 for Reinforcement Learning with Token-level Feedback for Controllable Text Generation
Viaarxiv icon

TREA: Tree-Structure Reasoning Schema for Conversational Recommendation

Add code
Jul 20, 2023
Figure 1 for TREA: Tree-Structure Reasoning Schema for Conversational Recommendation
Figure 2 for TREA: Tree-Structure Reasoning Schema for Conversational Recommendation
Figure 3 for TREA: Tree-Structure Reasoning Schema for Conversational Recommendation
Figure 4 for TREA: Tree-Structure Reasoning Schema for Conversational Recommendation
Viaarxiv icon

Towards Hierarchical Policy Learning for Conversational Recommendation with Hypergraph-based Reinforcement Learning

Add code
May 04, 2023
Viaarxiv icon

DDG-DA: Data Distribution Generation for Predictable Concept Drift Adaptation

Add code
Jan 11, 2022
Figure 1 for DDG-DA: Data Distribution Generation for Predictable Concept Drift Adaptation
Figure 2 for DDG-DA: Data Distribution Generation for Predictable Concept Drift Adaptation
Figure 3 for DDG-DA: Data Distribution Generation for Predictable Concept Drift Adaptation
Figure 4 for DDG-DA: Data Distribution Generation for Predictable Concept Drift Adaptation
Viaarxiv icon