Picture for Guanting Dong

Guanting Dong

CORAL: Benchmarking Multi-turn Conversational Retrieval-Augmentation Generation

Add code
Oct 30, 2024
Viaarxiv icon

Toward General Instruction-Following Alignment for Retrieval-Augmented Generation

Add code
Oct 12, 2024
Viaarxiv icon

MSI-Agent: Incorporating Multi-Scale Insight into Embodied Agents for Superior Planning and Decision-Making

Add code
Sep 25, 2024
Viaarxiv icon

How Do Your Code LLMs Perform? Empowering Code Instruction Tuning with High-Quality Data

Add code
Sep 05, 2024
Viaarxiv icon

Qwen2 Technical Report

Add code
Jul 16, 2024
Figure 1 for Qwen2 Technical Report
Figure 2 for Qwen2 Technical Report
Figure 3 for Qwen2 Technical Report
Figure 4 for Qwen2 Technical Report
Viaarxiv icon

DotaMath: Decomposition of Thought with Code Assistance and Self-correction for Mathematical Reasoning

Add code
Jul 04, 2024
Viaarxiv icon

We-Math: Does Your Large Multimodal Model Achieve Human-like Mathematical Reasoning?

Add code
Jul 01, 2024
Viaarxiv icon

Understand What LLM Needs: Dual Preference Alignment for Retrieval-Augmented Generation

Add code
Jun 26, 2024
Viaarxiv icon

Self-play with Execution Feedback: Improving Instruction-following Capabilities of Large Language Models

Add code
Jun 19, 2024
Viaarxiv icon

CS-Bench: A Comprehensive Benchmark for Large Language Models towards Computer Science Mastery

Add code
Jun 12, 2024
Figure 1 for CS-Bench: A Comprehensive Benchmark for Large Language Models towards Computer Science Mastery
Figure 2 for CS-Bench: A Comprehensive Benchmark for Large Language Models towards Computer Science Mastery
Figure 3 for CS-Bench: A Comprehensive Benchmark for Large Language Models towards Computer Science Mastery
Figure 4 for CS-Bench: A Comprehensive Benchmark for Large Language Models towards Computer Science Mastery
Viaarxiv icon