Picture for Sarath Chandar

Sarath Chandar

Do Robot Snakes Dream like Electric Sheep? Investigating the Effects of Architectural Inductive Biases on Hallucination

Add code
Oct 22, 2024
Viaarxiv icon

Toward Debugging Deep Reinforcement Learning Programs with RLExplorer

Add code
Oct 06, 2024
Figure 1 for Toward Debugging Deep Reinforcement Learning Programs with RLExplorer
Figure 2 for Toward Debugging Deep Reinforcement Learning Programs with RLExplorer
Figure 3 for Toward Debugging Deep Reinforcement Learning Programs with RLExplorer
Figure 4 for Toward Debugging Deep Reinforcement Learning Programs with RLExplorer
Viaarxiv icon

Context-Aware Assistant Selection for Improved Inference Acceleration with Large Language Models

Add code
Aug 16, 2024
Viaarxiv icon

Exploring Quantization for Efficient Pre-Training of Transformer Language Models

Add code
Jul 16, 2024
Figure 1 for Exploring Quantization for Efficient Pre-Training of Transformer Language Models
Figure 2 for Exploring Quantization for Efficient Pre-Training of Transformer Language Models
Figure 3 for Exploring Quantization for Efficient Pre-Training of Transformer Language Models
Figure 4 for Exploring Quantization for Efficient Pre-Training of Transformer Language Models
Viaarxiv icon

Why Don't Prompt-Based Fairness Metrics Correlate?

Add code
Jun 09, 2024
Figure 1 for Why Don't Prompt-Based Fairness Metrics Correlate?
Figure 2 for Why Don't Prompt-Based Fairness Metrics Correlate?
Figure 3 for Why Don't Prompt-Based Fairness Metrics Correlate?
Figure 4 for Why Don't Prompt-Based Fairness Metrics Correlate?
Viaarxiv icon

A Deep Dive into the Trade-Offs of Parameter-Efficient Preference Alignment Techniques

Add code
Jun 07, 2024
Figure 1 for A Deep Dive into the Trade-Offs of Parameter-Efficient Preference Alignment Techniques
Figure 2 for A Deep Dive into the Trade-Offs of Parameter-Efficient Preference Alignment Techniques
Figure 3 for A Deep Dive into the Trade-Offs of Parameter-Efficient Preference Alignment Techniques
Figure 4 for A Deep Dive into the Trade-Offs of Parameter-Efficient Preference Alignment Techniques
Viaarxiv icon

BindGPT: A Scalable Framework for 3D Molecular Design via Language Modeling and Reinforcement Learning

Add code
Jun 06, 2024
Figure 1 for BindGPT: A Scalable Framework for 3D Molecular Design via Language Modeling and Reinforcement Learning
Figure 2 for BindGPT: A Scalable Framework for 3D Molecular Design via Language Modeling and Reinforcement Learning
Figure 3 for BindGPT: A Scalable Framework for 3D Molecular Design via Language Modeling and Reinforcement Learning
Figure 4 for BindGPT: A Scalable Framework for 3D Molecular Design via Language Modeling and Reinforcement Learning
Viaarxiv icon

Predicting the Impact of Model Expansion through the Minima Manifold: A Loss Landscape Perspective

Add code
May 24, 2024
Figure 1 for Predicting the Impact of Model Expansion through the Minima Manifold: A Loss Landscape Perspective
Figure 2 for Predicting the Impact of Model Expansion through the Minima Manifold: A Loss Landscape Perspective
Figure 3 for Predicting the Impact of Model Expansion through the Minima Manifold: A Loss Landscape Perspective
Figure 4 for Predicting the Impact of Model Expansion through the Minima Manifold: A Loss Landscape Perspective
Viaarxiv icon

Interpretability Needs a New Paradigm

Add code
May 08, 2024
Viaarxiv icon

Sub-goal Distillation: A Method to Improve Small Language Agents

Add code
May 04, 2024
Viaarxiv icon