Picture for Ramayya Krishnan

Ramayya Krishnan

AgentBeats: Agentifying Agent Assessment for Openness, Standardization, and Reproducibility

Add code
Jun 11, 2026
Viaarxiv icon

The Chain Holds, the Answer Folds: Trace-Answer Dissociation in Reasoning Models Under Adversarial Pressure

Add code
May 27, 2026
Viaarxiv icon

Same Question, Different Source, Different Answer: Auditing Source-Dependence in Medical Multi-Source RAG

Add code
May 27, 2026
Viaarxiv icon

The Model Says Walk: How Surface Heuristics Override Implicit Constraints in LLM Reasoning

Add code
Mar 30, 2026
Viaarxiv icon

When Documents Disagree: Measuring Institutional Variation in Transplant Guidance with Retrieval-Augmented Language Models

Add code
Mar 23, 2026
Viaarxiv icon

Consistency of Large Reasoning Models Under Multi-Turn Attacks

Add code
Feb 16, 2026
Viaarxiv icon

Closing Reasoning Gaps in Clinical Agents with Differential Reasoning Learning

Add code
Feb 10, 2026
Viaarxiv icon

ML Compass: Navigating Capability, Cost, and Compliance Trade-offs in AI Model Deployment

Add code
Dec 29, 2025
Viaarxiv icon

Beyond Single-Turn: A Survey on Multi-Turn Interactions with Large Language Models

Add code
Apr 08, 2025
Viaarxiv icon

Firm or Fickle? Evaluating Large Language Models Consistency in Sequential Interactions

Add code
Mar 28, 2025
Figure 1 for Firm or Fickle? Evaluating Large Language Models Consistency in Sequential Interactions
Figure 2 for Firm or Fickle? Evaluating Large Language Models Consistency in Sequential Interactions
Figure 3 for Firm or Fickle? Evaluating Large Language Models Consistency in Sequential Interactions
Figure 4 for Firm or Fickle? Evaluating Large Language Models Consistency in Sequential Interactions
Viaarxiv icon