Picture for Ge Zhang

Ge Zhang

FullStack Bench: Evaluating LLMs as Full Stack Coders

Add code
Dec 03, 2024
Viaarxiv icon

PhysGame: Uncovering Physical Commonsense Violations in Gameplay Videos

Add code
Dec 02, 2024
Viaarxiv icon

OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision

Add code
Nov 11, 2024
Figure 1 for OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision
Figure 2 for OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision
Figure 3 for OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision
Figure 4 for OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision
Viaarxiv icon

OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models

Add code
Nov 07, 2024
Figure 1 for OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models
Figure 2 for OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models
Figure 3 for OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models
Figure 4 for OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models
Viaarxiv icon

MdEval: Massively Multilingual Code Debugging

Add code
Nov 04, 2024
Figure 1 for MdEval: Massively Multilingual Code Debugging
Figure 2 for MdEval: Massively Multilingual Code Debugging
Figure 3 for MdEval: Massively Multilingual Code Debugging
Figure 4 for MdEval: Massively Multilingual Code Debugging
Viaarxiv icon

AutoKaggle: A Multi-Agent Framework for Autonomous Data Science Competitions

Add code
Oct 29, 2024
Figure 1 for AutoKaggle: A Multi-Agent Framework for Autonomous Data Science Competitions
Figure 2 for AutoKaggle: A Multi-Agent Framework for Autonomous Data Science Competitions
Figure 3 for AutoKaggle: A Multi-Agent Framework for Autonomous Data Science Competitions
Figure 4 for AutoKaggle: A Multi-Agent Framework for Autonomous Data Science Competitions
Viaarxiv icon

M2rc-Eval: Massively Multilingual Repository-level Code Completion Evaluation

Add code
Oct 28, 2024
Figure 1 for M2rc-Eval: Massively Multilingual Repository-level Code Completion Evaluation
Figure 2 for M2rc-Eval: Massively Multilingual Repository-level Code Completion Evaluation
Figure 3 for M2rc-Eval: Massively Multilingual Repository-level Code Completion Evaluation
Figure 4 for M2rc-Eval: Massively Multilingual Repository-level Code Completion Evaluation
Viaarxiv icon

Sparse Decomposition of Graph Neural Networks

Add code
Oct 25, 2024
Figure 1 for Sparse Decomposition of Graph Neural Networks
Figure 2 for Sparse Decomposition of Graph Neural Networks
Figure 3 for Sparse Decomposition of Graph Neural Networks
Figure 4 for Sparse Decomposition of Graph Neural Networks
Viaarxiv icon

Can MLLMs Understand the Deep Implication Behind Chinese Images?

Add code
Oct 17, 2024
Figure 1 for Can MLLMs Understand the Deep Implication Behind Chinese Images?
Figure 2 for Can MLLMs Understand the Deep Implication Behind Chinese Images?
Figure 3 for Can MLLMs Understand the Deep Implication Behind Chinese Images?
Figure 4 for Can MLLMs Understand the Deep Implication Behind Chinese Images?
Viaarxiv icon

CLaMP 2: Multimodal Music Information Retrieval Across 101 Languages Using Large Language Models

Add code
Oct 17, 2024
Figure 1 for CLaMP 2: Multimodal Music Information Retrieval Across 101 Languages Using Large Language Models
Figure 2 for CLaMP 2: Multimodal Music Information Retrieval Across 101 Languages Using Large Language Models
Figure 3 for CLaMP 2: Multimodal Music Information Retrieval Across 101 Languages Using Large Language Models
Figure 4 for CLaMP 2: Multimodal Music Information Retrieval Across 101 Languages Using Large Language Models
Viaarxiv icon