Picture for Chetan Bansal

Chetan Bansal

AIOpsLab: A Holistic Framework to Evaluate AI Agents for Enabling Autonomous Clouds

Add code
Jan 12, 2025
Viaarxiv icon

REFA: Reference Free Alignment for multi-preference optimization

Add code
Dec 20, 2024
Viaarxiv icon

SWEPO: Simultaneous Weighted Preference Optimization for Group Contrastive Alignment

Add code
Dec 05, 2024
Viaarxiv icon

Ensuring Fair LLM Serving Amid Diverse Applications

Add code
Nov 24, 2024
Figure 1 for Ensuring Fair LLM Serving Amid Diverse Applications
Figure 2 for Ensuring Fair LLM Serving Amid Diverse Applications
Figure 3 for Ensuring Fair LLM Serving Amid Diverse Applications
Figure 4 for Ensuring Fair LLM Serving Amid Diverse Applications
Viaarxiv icon

Streetwise Agents: Empowering Offline RL Policies to Outsmart Exogenous Stochastic Disturbances in RTC

Add code
Nov 11, 2024
Viaarxiv icon

Unveiling Context-Aware Criteria in Self-Assessing LLMs

Add code
Oct 28, 2024
Figure 1 for Unveiling Context-Aware Criteria in Self-Assessing LLMs
Figure 2 for Unveiling Context-Aware Criteria in Self-Assessing LLMs
Figure 3 for Unveiling Context-Aware Criteria in Self-Assessing LLMs
Figure 4 for Unveiling Context-Aware Criteria in Self-Assessing LLMs
Viaarxiv icon

CREAM: Consistency Regularized Self-Rewarding Language Models

Add code
Oct 17, 2024
Figure 1 for CREAM: Consistency Regularized Self-Rewarding Language Models
Figure 2 for CREAM: Consistency Regularized Self-Rewarding Language Models
Figure 3 for CREAM: Consistency Regularized Self-Rewarding Language Models
Figure 4 for CREAM: Consistency Regularized Self-Rewarding Language Models
Viaarxiv icon

Building AI Agents for Autonomous Clouds: Challenges and Design Principles

Add code
Jul 16, 2024
Figure 1 for Building AI Agents for Autonomous Clouds: Challenges and Design Principles
Figure 2 for Building AI Agents for Autonomous Clouds: Challenges and Design Principles
Figure 3 for Building AI Agents for Autonomous Clouds: Challenges and Design Principles
Viaarxiv icon

CARES: A Comprehensive Benchmark of Trustworthiness in Medical Vision Language Models

Add code
Jun 10, 2024
Figure 1 for CARES: A Comprehensive Benchmark of Trustworthiness in Medical Vision Language Models
Figure 2 for CARES: A Comprehensive Benchmark of Trustworthiness in Medical Vision Language Models
Figure 3 for CARES: A Comprehensive Benchmark of Trustworthiness in Medical Vision Language Models
Figure 4 for CARES: A Comprehensive Benchmark of Trustworthiness in Medical Vision Language Models
Viaarxiv icon

Large Language Models can Deliver Accurate and Interpretable Time Series Anomaly Detection

Add code
May 24, 2024
Viaarxiv icon