Picture for Tao Gui

Tao Gui

Mitigating Object Hallucinations in MLLMs via Multi-Frequency Perturbations

Add code
Mar 19, 2025
Viaarxiv icon

PFDial: A Structured Dialogue Instruction Fine-tuning Method Based on UML Flowcharts

Add code
Mar 09, 2025
Viaarxiv icon

Better Process Supervision with Bi-directional Rewarding Signals

Add code
Mar 06, 2025
Viaarxiv icon

CritiQ: Mining Data Quality Criteria from Human Preferences

Add code
Feb 26, 2025
Viaarxiv icon

Measuring Data Diversity for Instruction Tuning: A Systematic Analysis and A Reliable Metric

Add code
Feb 25, 2025
Viaarxiv icon

Towards Economical Inference: Enabling DeepSeek's Multi-Head Latent Attention in Any Transformer-based LLMs

Add code
Feb 20, 2025
Viaarxiv icon

Self-Consistency of the Internal Reward Models Improves Self-Rewarding Language Models

Add code
Feb 13, 2025
Viaarxiv icon

Predicting Large Language Model Capabilities on Closed-Book QA Tasks Using Only Information Available Prior to Training

Add code
Feb 06, 2025
Viaarxiv icon

ToolHop: A Query-Driven Benchmark for Evaluating Large Language Models in Multi-Hop Tool Use

Add code
Jan 07, 2025
Viaarxiv icon

TL-Training: A Task-Feature-Based Framework for Training Large Language Models in Tool Use

Add code
Dec 20, 2024
Figure 1 for TL-Training: A Task-Feature-Based Framework for Training Large Language Models in Tool Use
Figure 2 for TL-Training: A Task-Feature-Based Framework for Training Large Language Models in Tool Use
Figure 3 for TL-Training: A Task-Feature-Based Framework for Training Large Language Models in Tool Use
Figure 4 for TL-Training: A Task-Feature-Based Framework for Training Large Language Models in Tool Use
Viaarxiv icon