Picture for Chuyi Tan

Chuyi Tan

Beyond One-Size-Fits-All: Tailored Benchmarks for Efficient Evaluation

Add code
Feb 19, 2025
Viaarxiv icon

From Sub-Ability Diagnosis to Human-Aligned Generation: Bridging the Gap for Text Length Control via MARKERGEN

Add code
Feb 19, 2025
Viaarxiv icon

UniCBE: An Uniformity-driven Comparing Based Evaluation Framework with Unified Multi-Objective Optimization

Add code
Feb 17, 2025
Viaarxiv icon

InsBank: Evolving Instruction Subset for Ongoing Alignment

Add code
Feb 17, 2025
Viaarxiv icon

Focused Large Language Models are Stable Many-Shot Learners

Add code
Aug 26, 2024
Figure 1 for Focused Large Language Models are Stable Many-Shot Learners
Figure 2 for Focused Large Language Models are Stable Many-Shot Learners
Figure 3 for Focused Large Language Models are Stable Many-Shot Learners
Figure 4 for Focused Large Language Models are Stable Many-Shot Learners
Viaarxiv icon