Picture for Hongyang Chao

Hongyang Chao

Improving Multi-Subject Consistency in Open-Domain Image Generation with Isolation and Reposition Attention

Add code
Nov 28, 2024
Viaarxiv icon

DreamStory: Open-Domain Story Visualization by LLM-Guided Multi-Subject Consistent Diffusion

Add code
Jul 17, 2024
Figure 1 for DreamStory: Open-Domain Story Visualization by LLM-Guided Multi-Subject Consistent Diffusion
Figure 2 for DreamStory: Open-Domain Story Visualization by LLM-Guided Multi-Subject Consistent Diffusion
Figure 3 for DreamStory: Open-Domain Story Visualization by LLM-Guided Multi-Subject Consistent Diffusion
Figure 4 for DreamStory: Open-Domain Story Visualization by LLM-Guided Multi-Subject Consistent Diffusion
Viaarxiv icon

TinyCLIP: CLIP Distillation via Affinity Mimicking and Weight Inheritance

Add code
Sep 21, 2023
Viaarxiv icon

Learning Profitable NFT Image Diffusions via Multiple Visual-Policy Guided Reinforcement Learning

Add code
Jun 20, 2023
Figure 1 for Learning Profitable NFT Image Diffusions via Multiple Visual-Policy Guided Reinforcement Learning
Figure 2 for Learning Profitable NFT Image Diffusions via Multiple Visual-Policy Guided Reinforcement Learning
Figure 3 for Learning Profitable NFT Image Diffusions via Multiple Visual-Policy Guided Reinforcement Learning
Figure 4 for Learning Profitable NFT Image Diffusions via Multiple Visual-Policy Guided Reinforcement Learning
Viaarxiv icon

Semantic-Conditional Diffusion Networks for Image Captioning

Add code
Dec 06, 2022
Viaarxiv icon

Out-of-Distribution Detection with Hilbert-Schmidt Independence Optimization

Add code
Sep 26, 2022
Figure 1 for Out-of-Distribution Detection with Hilbert-Schmidt Independence Optimization
Figure 2 for Out-of-Distribution Detection with Hilbert-Schmidt Independence Optimization
Figure 3 for Out-of-Distribution Detection with Hilbert-Schmidt Independence Optimization
Figure 4 for Out-of-Distribution Detection with Hilbert-Schmidt Independence Optimization
Viaarxiv icon

CoCo-BERT: Improving Video-Language Pre-training with Contrastive Cross-modal Matching and Denoising

Add code
Dec 14, 2021
Figure 1 for CoCo-BERT: Improving Video-Language Pre-training with Contrastive Cross-modal Matching and Denoising
Figure 2 for CoCo-BERT: Improving Video-Language Pre-training with Contrastive Cross-modal Matching and Denoising
Figure 3 for CoCo-BERT: Improving Video-Language Pre-training with Contrastive Cross-modal Matching and Denoising
Figure 4 for CoCo-BERT: Improving Video-Language Pre-training with Contrastive Cross-modal Matching and Denoising
Viaarxiv icon

CORE-Text: Improving Scene Text Detection with Contrastive Relational Reasoning

Add code
Dec 14, 2021
Figure 1 for CORE-Text: Improving Scene Text Detection with Contrastive Relational Reasoning
Figure 2 for CORE-Text: Improving Scene Text Detection with Contrastive Relational Reasoning
Figure 3 for CORE-Text: Improving Scene Text Detection with Contrastive Relational Reasoning
Figure 4 for CORE-Text: Improving Scene Text Detection with Contrastive Relational Reasoning
Viaarxiv icon

Searching the Search Space of Vision Transformer

Add code
Nov 29, 2021
Figure 1 for Searching the Search Space of Vision Transformer
Figure 2 for Searching the Search Space of Vision Transformer
Figure 3 for Searching the Search Space of Vision Transformer
Figure 4 for Searching the Search Space of Vision Transformer
Viaarxiv icon

Improving Visual Quality of Image Synthesis by A Token-based Generator with Transformers

Add code
Nov 05, 2021
Figure 1 for Improving Visual Quality of Image Synthesis by A Token-based Generator with Transformers
Figure 2 for Improving Visual Quality of Image Synthesis by A Token-based Generator with Transformers
Figure 3 for Improving Visual Quality of Image Synthesis by A Token-based Generator with Transformers
Figure 4 for Improving Visual Quality of Image Synthesis by A Token-based Generator with Transformers
Viaarxiv icon