Picture for Yash Satsangi

Yash Satsangi

An Unsupervised Method for Estimating Class Separability of Datasets with Application to LLMs Fine-Tuning

Add code
May 24, 2023
Viaarxiv icon

Bandit-Based Policy Invariant Explicit Shaping for Incorporating External Advice in Reinforcement Learning

Add code
Apr 14, 2023
Viaarxiv icon

Topical: Learning Repository Embeddings from Source Code using Attention

Add code
Aug 19, 2022
Figure 1 for Topical: Learning Repository Embeddings from Source Code using Attention
Figure 2 for Topical: Learning Repository Embeddings from Source Code using Attention
Figure 3 for Topical: Learning Repository Embeddings from Source Code using Attention
Figure 4 for Topical: Learning Repository Embeddings from Source Code using Attention
Viaarxiv icon

Learning to Be Cautious

Add code
Oct 29, 2021
Figure 1 for Learning to Be Cautious
Figure 2 for Learning to Be Cautious
Figure 3 for Learning to Be Cautious
Figure 4 for Learning to Be Cautious
Viaarxiv icon

Useful Policy Invariant Shaping from Arbitrary Advice

Add code
Nov 02, 2020
Figure 1 for Useful Policy Invariant Shaping from Arbitrary Advice
Figure 2 for Useful Policy Invariant Shaping from Arbitrary Advice
Figure 3 for Useful Policy Invariant Shaping from Arbitrary Advice
Figure 4 for Useful Policy Invariant Shaping from Arbitrary Advice
Viaarxiv icon

Exploiting Submodular Value Functions For Scaling Up Active Perception

Add code
Sep 21, 2020
Figure 1 for Exploiting Submodular Value Functions For Scaling Up Active Perception
Figure 2 for Exploiting Submodular Value Functions For Scaling Up Active Perception
Figure 3 for Exploiting Submodular Value Functions For Scaling Up Active Perception
Figure 4 for Exploiting Submodular Value Functions For Scaling Up Active Perception
Viaarxiv icon

Real-Time Resource Allocation for Tracking Systems

Add code
Sep 21, 2020
Figure 1 for Real-Time Resource Allocation for Tracking Systems
Figure 2 for Real-Time Resource Allocation for Tracking Systems
Figure 3 for Real-Time Resource Allocation for Tracking Systems
Figure 4 for Real-Time Resource Allocation for Tracking Systems
Viaarxiv icon

Maximizing Information Gain in Partially Observable Environments via Prediction Reward

Add code
May 11, 2020
Figure 1 for Maximizing Information Gain in Partially Observable Environments via Prediction Reward
Figure 2 for Maximizing Information Gain in Partially Observable Environments via Prediction Reward
Figure 3 for Maximizing Information Gain in Partially Observable Environments via Prediction Reward
Figure 4 for Maximizing Information Gain in Partially Observable Environments via Prediction Reward
Viaarxiv icon

Probably Approximately Correct Greedy Maximization

Add code
Feb 25, 2016
Figure 1 for Probably Approximately Correct Greedy Maximization
Viaarxiv icon