Picture for Joshua Greaves

Joshua Greaves

Tapered Off-Policy REINFORCE: Stable and efficient reinforcement learning for LLMs

Add code
Mar 19, 2025
Viaarxiv icon

Learning and Controlling Silicon Dopant Transitions in Graphene using Scanning Transmission Electron Microscopy

Add code
Nov 21, 2023
Viaarxiv icon

Proto-Value Networks: Scaling Representation Learning with Auxiliary Tasks

Add code
Apr 25, 2023
Viaarxiv icon

A Novel Stochastic Gradient Descent Algorithm for Learning Principal Subspaces

Add code
Dec 08, 2022
Viaarxiv icon

Multi-path Neural Networks for On-device Multi-domain Visual Classification

Add code
Oct 10, 2020
Figure 1 for Multi-path Neural Networks for On-device Multi-domain Visual Classification
Figure 2 for Multi-path Neural Networks for On-device Multi-domain Visual Classification
Figure 3 for Multi-path Neural Networks for On-device Multi-domain Visual Classification
Figure 4 for Multi-path Neural Networks for On-device Multi-domain Visual Classification
Viaarxiv icon