Picture for Weiran Xu

Weiran Xu

How Do Your Code LLMs Perform? Empowering Code Instruction Tuning with High-Quality Data

Add code
Sep 05, 2024
Viaarxiv icon

SEAS: Self-Evolving Adversarial Safety Optimization for Large Language Models

Add code
Aug 05, 2024
Viaarxiv icon

CS-Bench: A Comprehensive Benchmark for Large Language Models towards Computer Science Mastery

Add code
Jun 12, 2024
Figure 1 for CS-Bench: A Comprehensive Benchmark for Large Language Models towards Computer Science Mastery
Figure 2 for CS-Bench: A Comprehensive Benchmark for Large Language Models towards Computer Science Mastery
Figure 3 for CS-Bench: A Comprehensive Benchmark for Large Language Models towards Computer Science Mastery
Figure 4 for CS-Bench: A Comprehensive Benchmark for Large Language Models towards Computer Science Mastery
Viaarxiv icon

HFT: Half Fine-Tuning for Large Language Models

Add code
Apr 29, 2024
Viaarxiv icon

DivTOD: Unleashing the Power of LLMs for Diversifying Task-Oriented Dialogue Representations

Add code
Mar 31, 2024
Viaarxiv icon

Faceptor: A Generalist Model for Face Perception

Add code
Mar 14, 2024
Viaarxiv icon

Noise-BERT: A Unified Perturbation-Robust Framework with Noise Alignment Pre-training for Noisy Slot Filling Task

Add code
Mar 06, 2024
Viaarxiv icon

Beyond the Known: Investigating LLMs Performance on Out-of-Domain Intent Detection

Add code
Mar 04, 2024
Viaarxiv icon

BootTOD: Bootstrap Task-oriented Dialogue Representations by Aligning Diverse Responses

Add code
Mar 02, 2024
Viaarxiv icon

PreAct: Predicting Future in ReAct Enhances Agent's Planning Ability

Add code
Feb 18, 2024
Viaarxiv icon