Picture for Subhabrata Dutta

Subhabrata Dutta

Many Circuits, One Mechanism: Input Variation and Evaluation Granularity in Circuit Discovery

Add code
Jun 04, 2026
Viaarxiv icon

Co-FactChecker: A Framework for Human-AI Collaborative Claim Verification Using Large Reasoning Models

Add code
Apr 15, 2026
Viaarxiv icon

Patches of Nonlinearity: Instruction Vectors in Large Language Models

Add code
Feb 08, 2026
Viaarxiv icon

Reward Modeling for Scientific Writing Evaluation

Add code
Jan 16, 2026
Viaarxiv icon

Expert Preference-based Evaluation of Automated Related Work Generation

Add code
Aug 11, 2025
Viaarxiv icon

Factual Self-Awareness in Language Models: Representation, Robustness, and Scaling

Add code
May 27, 2025
Viaarxiv icon

Illusion or Algorithm? Investigating Memorization, Emergence, and Symbolic Processing in In-Context Learning

Add code
May 16, 2025
Viaarxiv icon

Mechanistic Behavior Editing of Language Models

Add code
Oct 05, 2024
Viaarxiv icon

Can LLMs replace Neil deGrasse Tyson? Evaluating the Reliability of LLMs as Science Communicators

Add code
Sep 21, 2024
Viaarxiv icon

Problem Solving Through Human-AI Preference-Based Cooperation

Add code
Aug 15, 2024
Figure 1 for Problem Solving Through Human-AI Preference-Based Cooperation
Figure 2 for Problem Solving Through Human-AI Preference-Based Cooperation
Figure 3 for Problem Solving Through Human-AI Preference-Based Cooperation
Viaarxiv icon