Picture for Lin Shi

Lin Shi

Jack

How Different AI Chatbots Behave? Benchmarking Large Language Models in Behavioral Economics Games

Add code
Dec 16, 2024
Viaarxiv icon

Customize Segment Anything Model for Multi-Modal Semantic Segmentation with Mixture of LoRA Experts

Add code
Dec 05, 2024
Viaarxiv icon

CodePurify: Defend Backdoor Attacks on Neural Code Models via Entropy-based Purification

Add code
Oct 26, 2024
Figure 1 for CodePurify: Defend Backdoor Attacks on Neural Code Models via Entropy-based Purification
Figure 2 for CodePurify: Defend Backdoor Attacks on Neural Code Models via Entropy-based Purification
Figure 3 for CodePurify: Defend Backdoor Attacks on Neural Code Models via Entropy-based Purification
Figure 4 for CodePurify: Defend Backdoor Attacks on Neural Code Models via Entropy-based Purification
Viaarxiv icon

PatUntrack: Automated Generating Patch Examples for Issue Reports without Tracked Insecure Code

Add code
Aug 16, 2024
Viaarxiv icon

Judging the Judges: A Systematic Investigation of Position Bias in Pairwise Comparative Assessments by LLMs

Add code
Jun 12, 2024
Viaarxiv icon

Exploring and Evaluating Hallucinations in LLM-Powered Code Generation

Add code
Apr 01, 2024
Viaarxiv icon

DevBench: A Comprehensive Benchmark for Software Development

Add code
Mar 15, 2024
Viaarxiv icon

FineMath: A Fine-Grained Mathematical Evaluation Benchmark for Chinese Large Language Models

Add code
Mar 12, 2024
Viaarxiv icon

A Survey on Query-based API Recommendation

Add code
Dec 21, 2023
Viaarxiv icon

Curriculum Learning for Relative Overgeneralization

Add code
Dec 06, 2022
Viaarxiv icon