Picture for Sumit Jha

Sumit Jha

ACE-RLHF: Automated Code Evaluation and Socratic Feedback Generation Tool using Large Language Models and Reinforcement Learning with Human Feedback

Add code
Apr 07, 2025
Viaarxiv icon

DML-RAM: Deep Multimodal Learning Framework for Robotic Arm Manipulation using Pre-trained Models

Add code
Apr 04, 2025
Viaarxiv icon

Enhanced Penalty-based Bidirectional Reinforcement Learning Algorithms

Add code
Apr 04, 2025
Viaarxiv icon

MORAL: A Multimodal Reinforcement Learning Framework for Decision Making in Autonomous Laboratories

Add code
Apr 04, 2025
Viaarxiv icon

NSP: A Neuro-Symbolic Natural Language Navigational Planner

Add code
Sep 10, 2024
Viaarxiv icon

Improving Robustness of Spectrogram Classifiers with Neural Stochastic Differential Equations

Add code
Sep 03, 2024
Viaarxiv icon

Automaton Distillation: Neuro-Symbolic Transfer Learning for Deep Reinforcement Learning

Add code
Oct 29, 2023
Figure 1 for Automaton Distillation: Neuro-Symbolic Transfer Learning for Deep Reinforcement Learning
Figure 2 for Automaton Distillation: Neuro-Symbolic Transfer Learning for Deep Reinforcement Learning
Figure 3 for Automaton Distillation: Neuro-Symbolic Transfer Learning for Deep Reinforcement Learning
Figure 4 for Automaton Distillation: Neuro-Symbolic Transfer Learning for Deep Reinforcement Learning
Viaarxiv icon

Integrated Decision Gradients: Compute Your Attributions Where the Model Makes Its Decision

Add code
May 31, 2023
Figure 1 for Integrated Decision Gradients: Compute Your Attributions Where the Model Makes Its Decision
Figure 2 for Integrated Decision Gradients: Compute Your Attributions Where the Model Makes Its Decision
Figure 3 for Integrated Decision Gradients: Compute Your Attributions Where the Model Makes Its Decision
Figure 4 for Integrated Decision Gradients: Compute Your Attributions Where the Model Makes Its Decision
Viaarxiv icon

On the Robustness of AlphaFold: A COVID-19 Case Study

Add code
Jan 12, 2023
Figure 1 for On the Robustness of AlphaFold: A COVID-19 Case Study
Figure 2 for On the Robustness of AlphaFold: A COVID-19 Case Study
Figure 3 for On the Robustness of AlphaFold: A COVID-19 Case Study
Figure 4 for On the Robustness of AlphaFold: A COVID-19 Case Study
Viaarxiv icon

The Multimodal Driver Monitoring Database: A Naturalistic Corpus to Study Driver Attention

Add code
Dec 23, 2020
Figure 1 for The Multimodal Driver Monitoring Database: A Naturalistic Corpus to Study Driver Attention
Figure 2 for The Multimodal Driver Monitoring Database: A Naturalistic Corpus to Study Driver Attention
Figure 3 for The Multimodal Driver Monitoring Database: A Naturalistic Corpus to Study Driver Attention
Figure 4 for The Multimodal Driver Monitoring Database: A Naturalistic Corpus to Study Driver Attention
Viaarxiv icon