Picture for Jialun Cao

Jialun Cao

From Informal to Formal -- Incorporating and Evaluating LLMs on Natural Language Requirements to Verifiable Formal Proofs

Add code
Jan 27, 2025
Viaarxiv icon

How Should I Build A Benchmark?

Add code
Jan 18, 2025
Viaarxiv icon

ICM-Assistant: Instruction-tuning Multimodal Large Language Models for Rule-based Explainable Image Content Moderation

Add code
Dec 24, 2024
Viaarxiv icon

CODECLEANER: Elevating Standards with A Robust Data Contamination Mitigation Toolkit

Add code
Nov 16, 2024
Viaarxiv icon

CRUXEval-X: A Benchmark for Multilingual Code Reasoning, Understanding and Execution

Add code
Aug 23, 2024
Figure 1 for CRUXEval-X: A Benchmark for Multilingual Code Reasoning, Understanding and Execution
Figure 2 for CRUXEval-X: A Benchmark for Multilingual Code Reasoning, Understanding and Execution
Figure 3 for CRUXEval-X: A Benchmark for Multilingual Code Reasoning, Understanding and Execution
Figure 4 for CRUXEval-X: A Benchmark for Multilingual Code Reasoning, Understanding and Execution
Viaarxiv icon

DOMAINEVAL: An Auto-Constructed Benchmark for Multi-Domain Code Generation

Add code
Aug 23, 2024
Viaarxiv icon

DLLens: Testing Deep Learning Libraries via LLM-aided Synthesis

Add code
Jun 12, 2024
Figure 1 for DLLens: Testing Deep Learning Libraries via LLM-aided Synthesis
Figure 2 for DLLens: Testing Deep Learning Libraries via LLM-aided Synthesis
Figure 3 for DLLens: Testing Deep Learning Libraries via LLM-aided Synthesis
Figure 4 for DLLens: Testing Deep Learning Libraries via LLM-aided Synthesis
Viaarxiv icon

Can AI Beat Undergraduates in Entry-level Java Assignments? Benchmarking Large Language Models on JavaBench

Add code
Jun 10, 2024
Viaarxiv icon

MEMO: Coverage-guided Model Generation For Deep Learning Library Testing

Add code
Aug 02, 2022
Figure 1 for MEMO: Coverage-guided Model Generation For Deep Learning Library Testing
Figure 2 for MEMO: Coverage-guided Model Generation For Deep Learning Library Testing
Figure 3 for MEMO: Coverage-guided Model Generation For Deep Learning Library Testing
Figure 4 for MEMO: Coverage-guided Model Generation For Deep Learning Library Testing
Viaarxiv icon

DeepFD: Automated Fault Diagnosis and Localization for Deep Learning Programs

Add code
May 04, 2022
Figure 1 for DeepFD: Automated Fault Diagnosis and Localization for Deep Learning Programs
Figure 2 for DeepFD: Automated Fault Diagnosis and Localization for Deep Learning Programs
Figure 3 for DeepFD: Automated Fault Diagnosis and Localization for Deep Learning Programs
Figure 4 for DeepFD: Automated Fault Diagnosis and Localization for Deep Learning Programs
Viaarxiv icon