Picture for Zhoujun Li

Zhoujun Li

DependEval: Benchmarking LLMs for Repository Dependency Understanding

Add code
Mar 09, 2025
Viaarxiv icon

CodeIF: Benchmarking the Instruction-Following Capabilities of Large Language Models for Code Generation

Add code
Feb 26, 2025
Viaarxiv icon

SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines

Add code
Feb 20, 2025
Viaarxiv icon

SimpleVQA: Multimodal Factuality Evaluation for Multimodal Large Language Models

Add code
Feb 18, 2025
Viaarxiv icon

UnrealZoo: Enriching Photo-realistic Virtual Worlds for Embodied AI

Add code
Dec 30, 2024
Figure 1 for UnrealZoo: Enriching Photo-realistic Virtual Worlds for Embodied AI
Figure 2 for UnrealZoo: Enriching Photo-realistic Virtual Worlds for Embodied AI
Figure 3 for UnrealZoo: Enriching Photo-realistic Virtual Worlds for Embodied AI
Figure 4 for UnrealZoo: Enriching Photo-realistic Virtual Worlds for Embodied AI
Viaarxiv icon

MdEval: Massively Multilingual Code Debugging

Add code
Nov 04, 2024
Figure 1 for MdEval: Massively Multilingual Code Debugging
Figure 2 for MdEval: Massively Multilingual Code Debugging
Figure 3 for MdEval: Massively Multilingual Code Debugging
Figure 4 for MdEval: Massively Multilingual Code Debugging
Viaarxiv icon

Intent-Enhanced Data Augmentation for Sequential Recommendation

Add code
Oct 11, 2024
Figure 1 for Intent-Enhanced Data Augmentation for Sequential Recommendation
Figure 2 for Intent-Enhanced Data Augmentation for Sequential Recommendation
Figure 3 for Intent-Enhanced Data Augmentation for Sequential Recommendation
Figure 4 for Intent-Enhanced Data Augmentation for Sequential Recommendation
Viaarxiv icon

FuzzCoder: Byte-level Fuzzing Test via Large Language Model

Add code
Sep 03, 2024
Figure 1 for FuzzCoder: Byte-level Fuzzing Test via Large Language Model
Figure 2 for FuzzCoder: Byte-level Fuzzing Test via Large Language Model
Figure 3 for FuzzCoder: Byte-level Fuzzing Test via Large Language Model
Figure 4 for FuzzCoder: Byte-level Fuzzing Test via Large Language Model
Viaarxiv icon

TableBench: A Comprehensive and Complex Benchmark for Table Question Answering

Add code
Aug 17, 2024
Viaarxiv icon

RB-SQL: A Retrieval-based LLM Framework for Text-to-SQL

Add code
Jul 11, 2024
Viaarxiv icon