Picture for Shiqi Gao

Shiqi Gao

Beyond Human Preferences: Exploring Reinforcement Learning Trajectory Evaluation and Improvement through LLMs

Add code
Jun 28, 2024
Viaarxiv icon

Quality-guided Skin Tone Enhancement for Portrait Photography

Add code
Jun 22, 2024
Viaarxiv icon

MIKO: Multimodal Intention Knowledge Distillation from Large Language Models for Social-Media Commonsense Discovery

Add code
Feb 29, 2024
Viaarxiv icon

SSGD: A safe and efficient method of gradient descent

Add code
Dec 03, 2020
Figure 1 for SSGD: A safe and efficient method of gradient descent
Figure 2 for SSGD: A safe and efficient method of gradient descent
Figure 3 for SSGD: A safe and efficient method of gradient descent
Figure 4 for SSGD: A safe and efficient method of gradient descent
Viaarxiv icon

Building a Computer Mahjong Player via Deep Convolutional Neural Networks

Add code
Jun 07, 2019
Figure 1 for Building a Computer Mahjong Player via Deep Convolutional Neural Networks
Figure 2 for Building a Computer Mahjong Player via Deep Convolutional Neural Networks
Figure 3 for Building a Computer Mahjong Player via Deep Convolutional Neural Networks
Figure 4 for Building a Computer Mahjong Player via Deep Convolutional Neural Networks
Viaarxiv icon