Picture for Lifeng Shang

Lifeng Shang

DAST: Difficulty-Aware Self-Training on Large Language Models

Add code
Mar 12, 2025
Viaarxiv icon

Learning to Align Multi-Faceted Evaluation: A Unified and Robust Framework

Add code
Feb 26, 2025
Viaarxiv icon

DocPuzzle: A Process-Aware Benchmark for Evaluating Realistic Long-Context Reasoning Capabilities

Add code
Feb 25, 2025
Viaarxiv icon

Crowd Comparative Reasoning: Unlocking Comprehensive Evaluations for LLM-as-a-Judge

Add code
Feb 18, 2025
Figure 1 for Crowd Comparative Reasoning: Unlocking Comprehensive Evaluations for LLM-as-a-Judge
Figure 2 for Crowd Comparative Reasoning: Unlocking Comprehensive Evaluations for LLM-as-a-Judge
Figure 3 for Crowd Comparative Reasoning: Unlocking Comprehensive Evaluations for LLM-as-a-Judge
Figure 4 for Crowd Comparative Reasoning: Unlocking Comprehensive Evaluations for LLM-as-a-Judge
Viaarxiv icon

Teaching LLMs According to Their Aptitude: Adaptive Reasoning for Mathematical Problem Solving

Add code
Feb 17, 2025
Figure 1 for Teaching LLMs According to Their Aptitude: Adaptive Reasoning for Mathematical Problem Solving
Figure 2 for Teaching LLMs According to Their Aptitude: Adaptive Reasoning for Mathematical Problem Solving
Figure 3 for Teaching LLMs According to Their Aptitude: Adaptive Reasoning for Mathematical Problem Solving
Figure 4 for Teaching LLMs According to Their Aptitude: Adaptive Reasoning for Mathematical Problem Solving
Viaarxiv icon

More Tokens, Lower Precision: Towards the Optimal Token-Precision Trade-off in KV Cache Compression

Add code
Dec 17, 2024
Viaarxiv icon

ToolFlow: Boosting LLM Tool-Calling Through Natural and Coherent Dialogue Synthesis

Add code
Oct 24, 2024
Viaarxiv icon

Subtle Errors Matter: Preference Learning via Error-injected Self-editing

Add code
Oct 09, 2024
Figure 1 for Subtle Errors Matter: Preference Learning via Error-injected Self-editing
Figure 2 for Subtle Errors Matter: Preference Learning via Error-injected Self-editing
Figure 3 for Subtle Errors Matter: Preference Learning via Error-injected Self-editing
Figure 4 for Subtle Errors Matter: Preference Learning via Error-injected Self-editing
Viaarxiv icon

RevisEval: Improving LLM-as-a-Judge via Response-Adapted References

Add code
Oct 07, 2024
Figure 1 for RevisEval: Improving LLM-as-a-Judge via Response-Adapted References
Figure 2 for RevisEval: Improving LLM-as-a-Judge via Response-Adapted References
Figure 3 for RevisEval: Improving LLM-as-a-Judge via Response-Adapted References
Figure 4 for RevisEval: Improving LLM-as-a-Judge via Response-Adapted References
Viaarxiv icon

Flat-LoRA: Low-Rank Adaption over a Flat Loss Landscape

Add code
Sep 22, 2024
Figure 1 for Flat-LoRA: Low-Rank Adaption over a Flat Loss Landscape
Figure 2 for Flat-LoRA: Low-Rank Adaption over a Flat Loss Landscape
Figure 3 for Flat-LoRA: Low-Rank Adaption over a Flat Loss Landscape
Figure 4 for Flat-LoRA: Low-Rank Adaption over a Flat Loss Landscape
Viaarxiv icon