Picture for Elsie Nallipogu

Elsie Nallipogu

DevBench: A Realistic, Developer-Informed Benchmark for Code Generation Models

Add code
Jan 17, 2026
Viaarxiv icon

Sphinx: Benchmarking and Modeling for LLM-Driven Pull Request Review

Add code
Jan 06, 2026
Viaarxiv icon

SWE-bench Goes Live!

Add code
May 29, 2025
Viaarxiv icon

Skeleton-Guided-Translation: A Benchmarking Framework for Code Repository Translation with Fine-Grained Quality Evaluation

Add code
Jan 27, 2025
Figure 1 for Skeleton-Guided-Translation: A Benchmarking Framework for Code Repository Translation with Fine-Grained Quality Evaluation
Figure 2 for Skeleton-Guided-Translation: A Benchmarking Framework for Code Repository Translation with Fine-Grained Quality Evaluation
Figure 3 for Skeleton-Guided-Translation: A Benchmarking Framework for Code Repository Translation with Fine-Grained Quality Evaluation
Figure 4 for Skeleton-Guided-Translation: A Benchmarking Framework for Code Repository Translation with Fine-Grained Quality Evaluation
Viaarxiv icon

DI-BENCH: Benchmarking Large Language Models on Dependency Inference with Testable Repositories at Scale

Add code
Jan 23, 2025
Viaarxiv icon