Picture for Ayush Jain

Ayush Jain

Department of Computer Science and Engineering, Indian Institute of Technology Hyderabad, India

Imbalanced Gradients in RL Post-Training of Multi-Task LLMs

Add code
Oct 22, 2025
Viaarxiv icon

Actor-Free Continuous Control via Structurally Maximizable Q-Functions

Add code
Oct 21, 2025
Viaarxiv icon

Internalizing Self-Consistency in Language Models: Multi-Agent Consensus Alignment

Add code
Sep 18, 2025
Viaarxiv icon

Grounded Reinforcement Learning for Visual Reasoning

Add code
May 29, 2025
Viaarxiv icon

polyGen: A Learning Framework for Atomic-level Polymer Structure Generation

Add code
Apr 24, 2025
Viaarxiv icon

Locate 3D: Real-World Object Localization via Self-Supervised Learning in 3D

Add code
Apr 19, 2025
Viaarxiv icon

Unifying 2D and 3D Vision-Language Understanding

Add code
Mar 13, 2025
Viaarxiv icon

LIFT-GS: Cross-Scene Render-Supervised Distillation for 3D Language Grounding

Add code
Feb 27, 2025
Viaarxiv icon

Mitigating Suboptimality of Deterministic Policy Gradients in Complex Q-functions

Add code
Oct 15, 2024
Figure 1 for Mitigating Suboptimality of Deterministic Policy Gradients in Complex Q-functions
Figure 2 for Mitigating Suboptimality of Deterministic Policy Gradients in Complex Q-functions
Figure 3 for Mitigating Suboptimality of Deterministic Policy Gradients in Complex Q-functions
Figure 4 for Mitigating Suboptimality of Deterministic Policy Gradients in Complex Q-functions
Viaarxiv icon

Language Models and Retrieval Augmented Generation for Automated Structured Data Extraction from Diagnostic Reports

Add code
Sep 18, 2024
Figure 1 for Language Models and Retrieval Augmented Generation for Automated Structured Data Extraction from Diagnostic Reports
Figure 2 for Language Models and Retrieval Augmented Generation for Automated Structured Data Extraction from Diagnostic Reports
Figure 3 for Language Models and Retrieval Augmented Generation for Automated Structured Data Extraction from Diagnostic Reports
Figure 4 for Language Models and Retrieval Augmented Generation for Automated Structured Data Extraction from Diagnostic Reports
Viaarxiv icon