Picture for Liangyou Li

Liangyou Li

ToolFlow: Boosting LLM Tool-Calling Through Natural and Coherent Dialogue Synthesis

Add code
Oct 24, 2024
Viaarxiv icon

Subtle Errors Matter: Preference Learning via Error-injected Self-editing

Add code
Oct 09, 2024
Figure 1 for Subtle Errors Matter: Preference Learning via Error-injected Self-editing
Figure 2 for Subtle Errors Matter: Preference Learning via Error-injected Self-editing
Figure 3 for Subtle Errors Matter: Preference Learning via Error-injected Self-editing
Figure 4 for Subtle Errors Matter: Preference Learning via Error-injected Self-editing
Viaarxiv icon

RevisEval: Improving LLM-as-a-Judge via Response-Adapted References

Add code
Oct 07, 2024
Figure 1 for RevisEval: Improving LLM-as-a-Judge via Response-Adapted References
Figure 2 for RevisEval: Improving LLM-as-a-Judge via Response-Adapted References
Figure 3 for RevisEval: Improving LLM-as-a-Judge via Response-Adapted References
Figure 4 for RevisEval: Improving LLM-as-a-Judge via Response-Adapted References
Viaarxiv icon

Bridging and Modeling Correlations in Pairwise Data for Direct Preference Optimization

Add code
Aug 14, 2024
Viaarxiv icon

Chain-of-Probe: Examing the Necessity and Accuracy of CoT Step-by-Step

Add code
Jun 23, 2024
Viaarxiv icon

Mitigating Large Language Model Hallucination with Faithful Finetuning

Add code
Jun 17, 2024
Figure 1 for Mitigating Large Language Model Hallucination with Faithful Finetuning
Figure 2 for Mitigating Large Language Model Hallucination with Faithful Finetuning
Figure 3 for Mitigating Large Language Model Hallucination with Faithful Finetuning
Figure 4 for Mitigating Large Language Model Hallucination with Faithful Finetuning
Viaarxiv icon

Learning to Edit: Aligning LLMs with Knowledge Editing

Add code
Feb 19, 2024
Figure 1 for Learning to Edit: Aligning LLMs with Knowledge Editing
Figure 2 for Learning to Edit: Aligning LLMs with Knowledge Editing
Figure 3 for Learning to Edit: Aligning LLMs with Knowledge Editing
Figure 4 for Learning to Edit: Aligning LLMs with Knowledge Editing
Viaarxiv icon

MT-Eval: A Multi-Turn Capabilities Evaluation Benchmark for Large Language Models

Add code
Jan 30, 2024
Viaarxiv icon

FollowBench: A Multi-level Fine-grained Constraints Following Benchmark for Large Language Models

Add code
Nov 14, 2023
Viaarxiv icon

M4LE: A Multi-Ability Multi-Range Multi-Task Multi-Domain Long-Context Evaluation Benchmark for Large Language Models

Add code
Oct 30, 2023
Viaarxiv icon