Picture for Shaoteng Liu

Shaoteng Liu

Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models

Add code
Mar 27, 2024
Figure 1 for Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models
Figure 2 for Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models
Figure 3 for Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models
Figure 4 for Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models
Viaarxiv icon

RL-GPT: Integrating Reinforcement Learning and Code-as-policy

Add code
Feb 29, 2024
Figure 1 for RL-GPT: Integrating Reinforcement Learning and Code-as-policy
Figure 2 for RL-GPT: Integrating Reinforcement Learning and Code-as-policy
Figure 3 for RL-GPT: Integrating Reinforcement Learning and Code-as-policy
Figure 4 for RL-GPT: Integrating Reinforcement Learning and Code-as-policy
Viaarxiv icon

Direct Inversion: Boosting Diffusion-based Editing with 3 Lines of Code

Add code
Oct 19, 2023
Viaarxiv icon

Self-supervised Learning by View Synthesis

Add code
Apr 22, 2023
Viaarxiv icon

Video-P2P: Video Editing with Cross-attention Control

Add code
Mar 08, 2023
Viaarxiv icon

Generative Model Watermarking Based on Human Visual System

Add code
Sep 30, 2022
Figure 1 for Generative Model Watermarking Based on Human Visual System
Figure 2 for Generative Model Watermarking Based on Human Visual System
Figure 3 for Generative Model Watermarking Based on Human Visual System
Figure 4 for Generative Model Watermarking Based on Human Visual System
Viaarxiv icon

On-target Adaptation

Add code
Sep 02, 2021
Figure 1 for On-target Adaptation
Figure 2 for On-target Adaptation
Figure 3 for On-target Adaptation
Figure 4 for On-target Adaptation
Viaarxiv icon

Multi-modal Cooking Workflow Construction for Food Recipes

Add code
Aug 20, 2020
Figure 1 for Multi-modal Cooking Workflow Construction for Food Recipes
Figure 2 for Multi-modal Cooking Workflow Construction for Food Recipes
Figure 3 for Multi-modal Cooking Workflow Construction for Food Recipes
Figure 4 for Multi-modal Cooking Workflow Construction for Food Recipes
Viaarxiv icon

GREEN: a Graph REsidual rE-ranking Network for Grading Diabetic Retinopathy

Add code
Jul 21, 2020
Figure 1 for GREEN: a Graph REsidual rE-ranking Network for Grading Diabetic Retinopathy
Figure 2 for GREEN: a Graph REsidual rE-ranking Network for Grading Diabetic Retinopathy
Figure 3 for GREEN: a Graph REsidual rE-ranking Network for Grading Diabetic Retinopathy
Figure 4 for GREEN: a Graph REsidual rE-ranking Network for Grading Diabetic Retinopathy
Viaarxiv icon

Fully Test-time Adaptation by Entropy Minimization

Add code
Jun 18, 2020
Figure 1 for Fully Test-time Adaptation by Entropy Minimization
Figure 2 for Fully Test-time Adaptation by Entropy Minimization
Figure 3 for Fully Test-time Adaptation by Entropy Minimization
Figure 4 for Fully Test-time Adaptation by Entropy Minimization
Viaarxiv icon