Picture for Hongyang Chao

Hongyang Chao

DreamStory: Open-Domain Story Visualization by LLM-Guided Multi-Subject Consistent Diffusion

Add code
Jul 17, 2024
Viaarxiv icon

TinyCLIP: CLIP Distillation via Affinity Mimicking and Weight Inheritance

Add code
Sep 21, 2023
Viaarxiv icon

Learning Profitable NFT Image Diffusions via Multiple Visual-Policy Guided Reinforcement Learning

Add code
Jun 20, 2023
Figure 1 for Learning Profitable NFT Image Diffusions via Multiple Visual-Policy Guided Reinforcement Learning
Figure 2 for Learning Profitable NFT Image Diffusions via Multiple Visual-Policy Guided Reinforcement Learning
Figure 3 for Learning Profitable NFT Image Diffusions via Multiple Visual-Policy Guided Reinforcement Learning
Figure 4 for Learning Profitable NFT Image Diffusions via Multiple Visual-Policy Guided Reinforcement Learning
Viaarxiv icon

Semantic-Conditional Diffusion Networks for Image Captioning

Add code
Dec 06, 2022
Viaarxiv icon

Out-of-Distribution Detection with Hilbert-Schmidt Independence Optimization

Add code
Sep 26, 2022
Figure 1 for Out-of-Distribution Detection with Hilbert-Schmidt Independence Optimization
Figure 2 for Out-of-Distribution Detection with Hilbert-Schmidt Independence Optimization
Figure 3 for Out-of-Distribution Detection with Hilbert-Schmidt Independence Optimization
Figure 4 for Out-of-Distribution Detection with Hilbert-Schmidt Independence Optimization
Viaarxiv icon

CoCo-BERT: Improving Video-Language Pre-training with Contrastive Cross-modal Matching and Denoising

Add code
Dec 14, 2021
Figure 1 for CoCo-BERT: Improving Video-Language Pre-training with Contrastive Cross-modal Matching and Denoising
Figure 2 for CoCo-BERT: Improving Video-Language Pre-training with Contrastive Cross-modal Matching and Denoising
Figure 3 for CoCo-BERT: Improving Video-Language Pre-training with Contrastive Cross-modal Matching and Denoising
Figure 4 for CoCo-BERT: Improving Video-Language Pre-training with Contrastive Cross-modal Matching and Denoising
Viaarxiv icon

CORE-Text: Improving Scene Text Detection with Contrastive Relational Reasoning

Add code
Dec 14, 2021
Figure 1 for CORE-Text: Improving Scene Text Detection with Contrastive Relational Reasoning
Figure 2 for CORE-Text: Improving Scene Text Detection with Contrastive Relational Reasoning
Figure 3 for CORE-Text: Improving Scene Text Detection with Contrastive Relational Reasoning
Figure 4 for CORE-Text: Improving Scene Text Detection with Contrastive Relational Reasoning
Viaarxiv icon

Searching the Search Space of Vision Transformer

Add code
Nov 29, 2021
Figure 1 for Searching the Search Space of Vision Transformer
Figure 2 for Searching the Search Space of Vision Transformer
Figure 3 for Searching the Search Space of Vision Transformer
Figure 4 for Searching the Search Space of Vision Transformer
Viaarxiv icon

Improving Visual Quality of Image Synthesis by A Token-based Generator with Transformers

Add code
Nov 05, 2021
Figure 1 for Improving Visual Quality of Image Synthesis by A Token-based Generator with Transformers
Figure 2 for Improving Visual Quality of Image Synthesis by A Token-based Generator with Transformers
Figure 3 for Improving Visual Quality of Image Synthesis by A Token-based Generator with Transformers
Figure 4 for Improving Visual Quality of Image Synthesis by A Token-based Generator with Transformers
Viaarxiv icon

Reference-based Defect Detection Network

Add code
Aug 10, 2021
Figure 1 for Reference-based Defect Detection Network
Figure 2 for Reference-based Defect Detection Network
Figure 3 for Reference-based Defect Detection Network
Figure 4 for Reference-based Defect Detection Network
Viaarxiv icon