Picture for Souradip Chakraborty

Souradip Chakraborty

Test-Time Scaling in Diffusion LLMs via Hidden Semi-Autoregressive Experts

Add code
Oct 06, 2025
Figure 1 for Test-Time Scaling in Diffusion LLMs via Hidden Semi-Autoregressive Experts
Figure 2 for Test-Time Scaling in Diffusion LLMs via Hidden Semi-Autoregressive Experts
Figure 3 for Test-Time Scaling in Diffusion LLMs via Hidden Semi-Autoregressive Experts
Figure 4 for Test-Time Scaling in Diffusion LLMs via Hidden Semi-Autoregressive Experts
Viaarxiv icon

MIRA: Towards Mitigating Reward Hacking in Inference-Time Alignment of T2I Diffusion Models

Add code
Oct 02, 2025
Viaarxiv icon

Enhancing Diversity in Large Language Models via Determinantal Point Processes

Add code
Sep 05, 2025
Viaarxiv icon

Does Thinking More always Help? Understanding Test-Time Scaling in Reasoning Models

Add code
Jun 04, 2025
Viaarxiv icon

Bounded Rationality for LLMs: Satisficing Alignment at Inference-Time

Add code
May 29, 2025
Viaarxiv icon

Review, Refine, Repeat: Understanding Iterative Decoding of AI Agents with Dynamic Evaluation and Selection

Add code
Apr 02, 2025
Viaarxiv icon

Collab: Controlled Decoding using Mixture of Agents for LLM Alignment

Add code
Mar 27, 2025
Viaarxiv icon

VARP: Reinforcement Learning from Vision-Language Model Feedback with Agent Regularized Preferences

Add code
Mar 18, 2025
Viaarxiv icon

BalancedDPO: Adaptive Multi-Metric Alignment

Add code
Mar 16, 2025
Viaarxiv icon

Align-Pro: A Principled Approach to Prompt Optimization for LLM Alignment

Add code
Jan 07, 2025
Figure 1 for Align-Pro: A Principled Approach to Prompt Optimization for LLM Alignment
Figure 2 for Align-Pro: A Principled Approach to Prompt Optimization for LLM Alignment
Figure 3 for Align-Pro: A Principled Approach to Prompt Optimization for LLM Alignment
Figure 4 for Align-Pro: A Principled Approach to Prompt Optimization for LLM Alignment
Viaarxiv icon