Picture for Davit Soselia

Davit Soselia

OPTune: Efficient Online Preference Tuning

Add code
Jun 11, 2024
Figure 1 for OPTune: Efficient Online Preference Tuning
Figure 2 for OPTune: Efficient Online Preference Tuning
Figure 3 for OPTune: Efficient Online Preference Tuning
Figure 4 for OPTune: Efficient Online Preference Tuning
Viaarxiv icon

ODIN: Disentangled Reward Mitigates Hacking in RLHF

Add code
Feb 11, 2024
Viaarxiv icon

Reviving Shift Equivariance in Vision Transformers

Add code
Jun 13, 2023
Viaarxiv icon

Reinforcement Learning finetuned Vision-Code Transformer for UI-to-Code Generation

Add code
May 24, 2023
Viaarxiv icon

RNN-based Online Handwritten Character Recognition Using Accelerometer and Gyroscope Data

Add code
Jul 24, 2019
Figure 1 for RNN-based Online Handwritten Character Recognition Using Accelerometer and Gyroscope Data
Figure 2 for RNN-based Online Handwritten Character Recognition Using Accelerometer and Gyroscope Data
Viaarxiv icon

Reproduction Report on "Learn to Pay Attention"

Add code
Dec 11, 2018
Figure 1 for Reproduction Report on "Learn to Pay Attention"
Figure 2 for Reproduction Report on "Learn to Pay Attention"
Viaarxiv icon