Picture for Lin Shi

Lin Shi

Jack

Terminal-Bench: Benchmarking Agents on Hard, Realistic Tasks in Command Line Interfaces

Add code
Jan 17, 2026
Viaarxiv icon

Addressing Overthinking in Large Vision-Language Models via Gated Perception-Reasoning Optimization

Add code
Jan 07, 2026
Viaarxiv icon

ExAnte: A Benchmark for Ex-Ante Inference in Large Language Models

Add code
May 26, 2025
Figure 1 for ExAnte: A Benchmark for Ex-Ante Inference in Large Language Models
Figure 2 for ExAnte: A Benchmark for Ex-Ante Inference in Large Language Models
Figure 3 for ExAnte: A Benchmark for Ex-Ante Inference in Large Language Models
Figure 4 for ExAnte: A Benchmark for Ex-Ante Inference in Large Language Models
Viaarxiv icon

Judging with Many Minds: Do More Perspectives Mean Less Prejudice?

Add code
May 26, 2025
Viaarxiv icon

Detection of Disease on Nasal Breath Sound by New Lightweight Architecture: Using COVID-19 as An Example

Add code
Apr 01, 2025
Figure 1 for Detection of Disease on Nasal Breath Sound by New Lightweight Architecture: Using COVID-19 as An Example
Figure 2 for Detection of Disease on Nasal Breath Sound by New Lightweight Architecture: Using COVID-19 as An Example
Figure 3 for Detection of Disease on Nasal Breath Sound by New Lightweight Architecture: Using COVID-19 as An Example
Figure 4 for Detection of Disease on Nasal Breath Sound by New Lightweight Architecture: Using COVID-19 as An Example
Viaarxiv icon

Vulnerability of Text-to-Image Models to Prompt Template Stealing: A Differential Evolution Approach

Add code
Feb 20, 2025
Viaarxiv icon

How Different AI Chatbots Behave? Benchmarking Large Language Models in Behavioral Economics Games

Add code
Dec 16, 2024
Viaarxiv icon

Customize Segment Anything Model for Multi-Modal Semantic Segmentation with Mixture of LoRA Experts

Add code
Dec 05, 2024
Figure 1 for Customize Segment Anything Model for Multi-Modal Semantic Segmentation with Mixture of LoRA Experts
Figure 2 for Customize Segment Anything Model for Multi-Modal Semantic Segmentation with Mixture of LoRA Experts
Figure 3 for Customize Segment Anything Model for Multi-Modal Semantic Segmentation with Mixture of LoRA Experts
Figure 4 for Customize Segment Anything Model for Multi-Modal Semantic Segmentation with Mixture of LoRA Experts
Viaarxiv icon

CodePurify: Defend Backdoor Attacks on Neural Code Models via Entropy-based Purification

Add code
Oct 26, 2024
Figure 1 for CodePurify: Defend Backdoor Attacks on Neural Code Models via Entropy-based Purification
Figure 2 for CodePurify: Defend Backdoor Attacks on Neural Code Models via Entropy-based Purification
Figure 3 for CodePurify: Defend Backdoor Attacks on Neural Code Models via Entropy-based Purification
Figure 4 for CodePurify: Defend Backdoor Attacks on Neural Code Models via Entropy-based Purification
Viaarxiv icon

PatUntrack: Automated Generating Patch Examples for Issue Reports without Tracked Insecure Code

Add code
Aug 16, 2024
Viaarxiv icon