Picture for Zhiheng Xi

Zhiheng Xi

Mitigating Object Hallucinations in MLLMs via Multi-Frequency Perturbations

Add code
Mar 19, 2025
Viaarxiv icon

PFDial: A Structured Dialogue Instruction Fine-tuning Method Based on UML Flowcharts

Add code
Mar 09, 2025
Viaarxiv icon

Better Process Supervision with Bi-directional Rewarding Signals

Add code
Mar 06, 2025
Viaarxiv icon

EliteKV: Scalable KV Cache Compression via RoPE Frequency Selection and Joint Low-Rank Projection

Add code
Mar 03, 2025
Viaarxiv icon

CritiQ: Mining Data Quality Criteria from Human Preferences

Add code
Feb 26, 2025
Viaarxiv icon

Predicting Large Language Model Capabilities on Closed-Book QA Tasks Using Only Information Available Prior to Training

Add code
Feb 06, 2025
Viaarxiv icon

Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training

Add code
Jan 20, 2025
Viaarxiv icon

ToolHop: A Query-Driven Benchmark for Evaluating Large Language Models in Multi-Hop Tool Use

Add code
Jan 07, 2025
Viaarxiv icon

Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision

Add code
Nov 25, 2024
Figure 1 for Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision
Figure 2 for Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision
Figure 3 for Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision
Figure 4 for Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision
Viaarxiv icon

Mitigating Tail Narrowing in LLM Self-Improvement via Socratic-Guided Sampling

Add code
Nov 01, 2024
Viaarxiv icon