Picture for Yuchen Tian

Yuchen Tian

CodeJudge-Eval: Can Large Language Models be Good Judges in Code Understanding?

Add code
Aug 20, 2024
Viaarxiv icon

CodeHalu: Code Hallucinations in LLMs Driven by Execution-based Verification

Add code
Apr 30, 2024
Viaarxiv icon

MMCode: Evaluating Multi-Modal Code Large Language Models with Visually Rich Programming Problems

Add code
Apr 15, 2024
Figure 1 for MMCode: Evaluating Multi-Modal Code Large Language Models with Visually Rich Programming Problems
Figure 2 for MMCode: Evaluating Multi-Modal Code Large Language Models with Visually Rich Programming Problems
Figure 3 for MMCode: Evaluating Multi-Modal Code Large Language Models with Visually Rich Programming Problems
Figure 4 for MMCode: Evaluating Multi-Modal Code Large Language Models with Visually Rich Programming Problems
Viaarxiv icon

Token Alignment via Character Matching for Subword Completion

Add code
Mar 13, 2024
Figure 1 for Token Alignment via Character Matching for Subword Completion
Figure 2 for Token Alignment via Character Matching for Subword Completion
Figure 3 for Token Alignment via Character Matching for Subword Completion
Figure 4 for Token Alignment via Character Matching for Subword Completion
Viaarxiv icon

CodeTransOcean: A Comprehensive Multilingual Benchmark for Code Translation

Add code
Oct 08, 2023
Viaarxiv icon

A Static Evaluation of Code Completion by Large Language Models

Add code
Jun 05, 2023
Figure 1 for A Static Evaluation of Code Completion by Large Language Models
Figure 2 for A Static Evaluation of Code Completion by Large Language Models
Figure 3 for A Static Evaluation of Code Completion by Large Language Models
Figure 4 for A Static Evaluation of Code Completion by Large Language Models
Viaarxiv icon

Greener yet Powerful: Taming Large Code Generation Models with Quantization

Add code
Mar 09, 2023
Figure 1 for Greener yet Powerful: Taming Large Code Generation Models with Quantization
Figure 2 for Greener yet Powerful: Taming Large Code Generation Models with Quantization
Figure 3 for Greener yet Powerful: Taming Large Code Generation Models with Quantization
Figure 4 for Greener yet Powerful: Taming Large Code Generation Models with Quantization
Viaarxiv icon

Multi-lingual Evaluation of Code Generation Models

Add code
Oct 26, 2022
Viaarxiv icon