Picture for Xiaojiang Liu

Xiaojiang Liu

Distance between Relevant Information Pieces Causes Bias in Long-Context LLMs

Add code
Oct 18, 2024
Viaarxiv icon

TIS-DPO: Token-level Importance Sampling for Direct Preference Optimization With Estimated Weights

Add code
Oct 06, 2024
Figure 1 for TIS-DPO: Token-level Importance Sampling for Direct Preference Optimization With Estimated Weights
Figure 2 for TIS-DPO: Token-level Importance Sampling for Direct Preference Optimization With Estimated Weights
Figure 3 for TIS-DPO: Token-level Importance Sampling for Direct Preference Optimization With Estimated Weights
Figure 4 for TIS-DPO: Token-level Importance Sampling for Direct Preference Optimization With Estimated Weights
Viaarxiv icon

Exploring Format Consistency for Instruction Tuning

Add code
Jul 28, 2023
Viaarxiv icon

Learning to Break the Loop: Analyzing and Mitigating Repetitions for Neural Text Generation

Add code
Jun 06, 2022
Figure 1 for Learning to Break the Loop: Analyzing and Mitigating Repetitions for Neural Text Generation
Figure 2 for Learning to Break the Loop: Analyzing and Mitigating Repetitions for Neural Text Generation
Figure 3 for Learning to Break the Loop: Analyzing and Mitigating Repetitions for Neural Text Generation
Figure 4 for Learning to Break the Loop: Analyzing and Mitigating Repetitions for Neural Text Generation
Viaarxiv icon

Dialogue Generation on Infrequent Sentence Functions via Structured Meta-Learning

Add code
Oct 04, 2020
Figure 1 for Dialogue Generation on Infrequent Sentence Functions via Structured Meta-Learning
Figure 2 for Dialogue Generation on Infrequent Sentence Functions via Structured Meta-Learning
Figure 3 for Dialogue Generation on Infrequent Sentence Functions via Structured Meta-Learning
Figure 4 for Dialogue Generation on Infrequent Sentence Functions via Structured Meta-Learning
Viaarxiv icon

Profile Consistency Identification for Open-domain Dialogue Agents

Add code
Sep 30, 2020
Figure 1 for Profile Consistency Identification for Open-domain Dialogue Agents
Figure 2 for Profile Consistency Identification for Open-domain Dialogue Agents
Figure 3 for Profile Consistency Identification for Open-domain Dialogue Agents
Figure 4 for Profile Consistency Identification for Open-domain Dialogue Agents
Viaarxiv icon

Enhancing Dialogue Generation via Multi-Level Contrastive Learning

Add code
Sep 19, 2020
Figure 1 for Enhancing Dialogue Generation via Multi-Level Contrastive Learning
Figure 2 for Enhancing Dialogue Generation via Multi-Level Contrastive Learning
Figure 3 for Enhancing Dialogue Generation via Multi-Level Contrastive Learning
Figure 4 for Enhancing Dialogue Generation via Multi-Level Contrastive Learning
Viaarxiv icon

Interpretable Real-Time Win Prediction for Honor of Kings, a Popular Mobile MOBA Esport

Add code
Sep 04, 2020
Figure 1 for Interpretable Real-Time Win Prediction for Honor of Kings, a Popular Mobile MOBA Esport
Figure 2 for Interpretable Real-Time Win Prediction for Honor of Kings, a Popular Mobile MOBA Esport
Figure 3 for Interpretable Real-Time Win Prediction for Honor of Kings, a Popular Mobile MOBA Esport
Figure 4 for Interpretable Real-Time Win Prediction for Honor of Kings, a Popular Mobile MOBA Esport
Viaarxiv icon

A Batch Normalized Inference Network Keeps the KL Vanishing Away

Add code
Jun 01, 2020
Figure 1 for A Batch Normalized Inference Network Keeps the KL Vanishing Away
Figure 2 for A Batch Normalized Inference Network Keeps the KL Vanishing Away
Figure 3 for A Batch Normalized Inference Network Keeps the KL Vanishing Away
Figure 4 for A Batch Normalized Inference Network Keeps the KL Vanishing Away
Viaarxiv icon

Response-Anticipated Memory for On-Demand Knowledge Integration in Response Generation

Add code
May 13, 2020
Figure 1 for Response-Anticipated Memory for On-Demand Knowledge Integration in Response Generation
Figure 2 for Response-Anticipated Memory for On-Demand Knowledge Integration in Response Generation
Figure 3 for Response-Anticipated Memory for On-Demand Knowledge Integration in Response Generation
Figure 4 for Response-Anticipated Memory for On-Demand Knowledge Integration in Response Generation
Viaarxiv icon