Picture for Saravan Rajmohan

Saravan Rajmohan

Minerva: A Programmable Memory Test Benchmark for Language Models

Add code
Feb 05, 2025
Viaarxiv icon

MEETING DELEGATE: Benchmarking LLMs on Attending Meetings on Our Behalf

Add code
Feb 05, 2025
Viaarxiv icon

Enabling Autonomic Microservice Management through Self-Learning Agents

Add code
Jan 31, 2025
Figure 1 for Enabling Autonomic Microservice Management through Self-Learning Agents
Figure 2 for Enabling Autonomic Microservice Management through Self-Learning Agents
Figure 3 for Enabling Autonomic Microservice Management through Self-Learning Agents
Figure 4 for Enabling Autonomic Microservice Management through Self-Learning Agents
Viaarxiv icon

Skeleton-Guided-Translation: A Benchmarking Framework for Code Repository Translation with Fine-Grained Quality Evaluation

Add code
Jan 27, 2025
Figure 1 for Skeleton-Guided-Translation: A Benchmarking Framework for Code Repository Translation with Fine-Grained Quality Evaluation
Figure 2 for Skeleton-Guided-Translation: A Benchmarking Framework for Code Repository Translation with Fine-Grained Quality Evaluation
Figure 3 for Skeleton-Guided-Translation: A Benchmarking Framework for Code Repository Translation with Fine-Grained Quality Evaluation
Figure 4 for Skeleton-Guided-Translation: A Benchmarking Framework for Code Repository Translation with Fine-Grained Quality Evaluation
Viaarxiv icon

DI-BENCH: Benchmarking Large Language Models on Dependency Inference with Testable Repositories at Scale

Add code
Jan 23, 2025
Viaarxiv icon

AIOpsLab: A Holistic Framework to Evaluate AI Agents for Enabling Autonomous Clouds

Add code
Jan 12, 2025
Figure 1 for AIOpsLab: A Holistic Framework to Evaluate AI Agents for Enabling Autonomous Clouds
Figure 2 for AIOpsLab: A Holistic Framework to Evaluate AI Agents for Enabling Autonomous Clouds
Figure 3 for AIOpsLab: A Holistic Framework to Evaluate AI Agents for Enabling Autonomous Clouds
Figure 4 for AIOpsLab: A Holistic Framework to Evaluate AI Agents for Enabling Autonomous Clouds
Viaarxiv icon

WarriorCoder: Learning from Expert Battles to Augment Code Large Language Models

Add code
Dec 23, 2024
Viaarxiv icon

REFA: Reference Free Alignment for multi-preference optimization

Add code
Dec 20, 2024
Viaarxiv icon

Reason-before-Retrieve: One-Stage Reflective Chain-of-Thoughts for Training-Free Zero-Shot Composed Image Retrieval

Add code
Dec 15, 2024
Viaarxiv icon

Large Action Models: From Inception to Implementation

Add code
Dec 13, 2024
Figure 1 for Large Action Models: From Inception to Implementation
Figure 2 for Large Action Models: From Inception to Implementation
Figure 3 for Large Action Models: From Inception to Implementation
Figure 4 for Large Action Models: From Inception to Implementation
Viaarxiv icon