Picture for Dong Yin

Dong Yin

SALSA: Soup-based Alignment Learning for Stronger Adaptation in RLHF

Add code
Nov 04, 2024
Figure 1 for SALSA: Soup-based Alignment Learning for Stronger Adaptation in RLHF
Figure 2 for SALSA: Soup-based Alignment Learning for Stronger Adaptation in RLHF
Figure 3 for SALSA: Soup-based Alignment Learning for Stronger Adaptation in RLHF
Figure 4 for SALSA: Soup-based Alignment Learning for Stronger Adaptation in RLHF
Viaarxiv icon

Step-by-Step Reasoning for Math Problems via Twisted Sequential Monte Carlo

Add code
Oct 02, 2024
Figure 1 for Step-by-Step Reasoning for Math Problems via Twisted Sequential Monte Carlo
Figure 2 for Step-by-Step Reasoning for Math Problems via Twisted Sequential Monte Carlo
Figure 3 for Step-by-Step Reasoning for Math Problems via Twisted Sequential Monte Carlo
Figure 4 for Step-by-Step Reasoning for Math Problems via Twisted Sequential Monte Carlo
Viaarxiv icon

Apple Intelligence Foundation Language Models

Add code
Jul 29, 2024
Figure 1 for Apple Intelligence Foundation Language Models
Figure 2 for Apple Intelligence Foundation Language Models
Figure 3 for Apple Intelligence Foundation Language Models
Figure 4 for Apple Intelligence Foundation Language Models
Viaarxiv icon

MindSet: Vision. A toolbox for testing DNNs on key psychological experiments

Add code
Apr 08, 2024
Viaarxiv icon

Convolutional Neural Networks Trained to Identify Words Provide a Good Account of Visual Form Priming Effects

Add code
Mar 02, 2023
Viaarxiv icon

Sample Efficient Deep Reinforcement Learning via Local Planning

Add code
Jan 29, 2023
Figure 1 for Sample Efficient Deep Reinforcement Learning via Local Planning
Figure 2 for Sample Efficient Deep Reinforcement Learning via Local Planning
Figure 3 for Sample Efficient Deep Reinforcement Learning via Local Planning
Figure 4 for Sample Efficient Deep Reinforcement Learning via Local Planning
Viaarxiv icon

Architecture Matters in Continual Learning

Add code
Feb 01, 2022
Viaarxiv icon

MORSE-STF: A Privacy Preserving Computation System

Add code
Sep 24, 2021
Figure 1 for MORSE-STF: A Privacy Preserving Computation System
Figure 2 for MORSE-STF: A Privacy Preserving Computation System
Figure 3 for MORSE-STF: A Privacy Preserving Computation System
Figure 4 for MORSE-STF: A Privacy Preserving Computation System
Viaarxiv icon

Efficient Local Planning with Linear Function Approximation

Add code
Aug 12, 2021
Figure 1 for Efficient Local Planning with Linear Function Approximation
Viaarxiv icon

A Realistic Simulation Framework for Learning with Label Noise

Add code
Jul 23, 2021
Figure 1 for A Realistic Simulation Framework for Learning with Label Noise
Figure 2 for A Realistic Simulation Framework for Learning with Label Noise
Figure 3 for A Realistic Simulation Framework for Learning with Label Noise
Figure 4 for A Realistic Simulation Framework for Learning with Label Noise
Viaarxiv icon