Picture for Yibo Miao

Yibo Miao

CodeElo: Benchmarking Competition-level Code Generation of LLMs with Human-comparable Elo Ratings

Add code
Jan 03, 2025
Viaarxiv icon

PowerMLP: An Efficient Version of KAN

Add code
Dec 18, 2024
Viaarxiv icon

ExecRepoBench: Multi-level Executable Code Completion Evaluation

Add code
Dec 16, 2024
Figure 1 for ExecRepoBench: Multi-level Executable Code Completion Evaluation
Figure 2 for ExecRepoBench: Multi-level Executable Code Completion Evaluation
Figure 3 for ExecRepoBench: Multi-level Executable Code Completion Evaluation
Figure 4 for ExecRepoBench: Multi-level Executable Code Completion Evaluation
Viaarxiv icon

Evaluating and Aligning CodeLLMs on Human Preference

Add code
Dec 06, 2024
Figure 1 for Evaluating and Aligning CodeLLMs on Human Preference
Figure 2 for Evaluating and Aligning CodeLLMs on Human Preference
Figure 3 for Evaluating and Aligning CodeLLMs on Human Preference
Figure 4 for Evaluating and Aligning CodeLLMs on Human Preference
Viaarxiv icon

Generalizability of Memorization Neural Networks

Add code
Nov 01, 2024
Figure 1 for Generalizability of Memorization Neural Networks
Figure 2 for Generalizability of Memorization Neural Networks
Figure 3 for Generalizability of Memorization Neural Networks
Figure 4 for Generalizability of Memorization Neural Networks
Viaarxiv icon

Aligning CodeLLMs with Direct Preference Optimization

Add code
Oct 24, 2024
Viaarxiv icon

Omni-MATH: A Universal Olympiad Level Mathematic Benchmark For Large Language Models

Add code
Oct 10, 2024
Figure 1 for Omni-MATH: A Universal Olympiad Level Mathematic Benchmark For Large Language Models
Figure 2 for Omni-MATH: A Universal Olympiad Level Mathematic Benchmark For Large Language Models
Figure 3 for Omni-MATH: A Universal Olympiad Level Mathematic Benchmark For Large Language Models
Figure 4 for Omni-MATH: A Universal Olympiad Level Mathematic Benchmark For Large Language Models
Viaarxiv icon

Towards a Unified View of Preference Learning for Large Language Models: A Survey

Add code
Sep 04, 2024
Figure 1 for Towards a Unified View of Preference Learning for Large Language Models: A Survey
Figure 2 for Towards a Unified View of Preference Learning for Large Language Models: A Survey
Figure 3 for Towards a Unified View of Preference Learning for Large Language Models: A Survey
Figure 4 for Towards a Unified View of Preference Learning for Large Language Models: A Survey
Viaarxiv icon

T2VSafetyBench: Evaluating the Safety of Text-to-Video Generative Models

Add code
Jul 08, 2024
Viaarxiv icon

AdaMoE: Token-Adaptive Routing with Null Experts for Mixture-of-Experts Language Models

Add code
Jun 19, 2024
Figure 1 for AdaMoE: Token-Adaptive Routing with Null Experts for Mixture-of-Experts Language Models
Figure 2 for AdaMoE: Token-Adaptive Routing with Null Experts for Mixture-of-Experts Language Models
Figure 3 for AdaMoE: Token-Adaptive Routing with Null Experts for Mixture-of-Experts Language Models
Figure 4 for AdaMoE: Token-Adaptive Routing with Null Experts for Mixture-of-Experts Language Models
Viaarxiv icon