Picture for Yueqi Zhang

Yueqi Zhang

Beyond One-Size-Fits-All: Tailored Benchmarks for Efficient Evaluation

Add code
Feb 19, 2025
Viaarxiv icon

From Sub-Ability Diagnosis to Human-Aligned Generation: Bridging the Gap for Text Length Control via MARKERGEN

Add code
Feb 19, 2025
Viaarxiv icon

InsBank: Evolving Instruction Subset for Ongoing Alignment

Add code
Feb 17, 2025
Viaarxiv icon

UniCBE: An Uniformity-driven Comparing Based Evaluation Framework with Unified Multi-Objective Optimization

Add code
Feb 17, 2025
Viaarxiv icon

A Robust Quadruped Robot with Twisting Waist for Flexible Motions

Add code
Oct 08, 2024
Viaarxiv icon

Focused Large Language Models are Stable Many-Shot Learners

Add code
Aug 26, 2024
Figure 1 for Focused Large Language Models are Stable Many-Shot Learners
Figure 2 for Focused Large Language Models are Stable Many-Shot Learners
Figure 3 for Focused Large Language Models are Stable Many-Shot Learners
Figure 4 for Focused Large Language Models are Stable Many-Shot Learners
Viaarxiv icon

Make Every Penny Count: Difficulty-Adaptive Self-Consistency for Cost-Efficient Reasoning

Add code
Aug 24, 2024
Figure 1 for Make Every Penny Count: Difficulty-Adaptive Self-Consistency for Cost-Efficient Reasoning
Figure 2 for Make Every Penny Count: Difficulty-Adaptive Self-Consistency for Cost-Efficient Reasoning
Figure 3 for Make Every Penny Count: Difficulty-Adaptive Self-Consistency for Cost-Efficient Reasoning
Figure 4 for Make Every Penny Count: Difficulty-Adaptive Self-Consistency for Cost-Efficient Reasoning
Viaarxiv icon