Picture for Zhiyuan Fang

Zhiyuan Fang

Skews in the Phenomenon Space Hinder Generalization in Text-to-Image Generation

Add code
Mar 25, 2024
Viaarxiv icon

End-to-end Knowledge Retrieval with Multi-modal Queries

Add code
Jun 01, 2023
Viaarxiv icon

Mining Unseen Classes via Regional Objectness: A Simple Baseline for Incremental Segmentation

Add code
Nov 15, 2022
Viaarxiv icon

Tragedy Plus Time: Capturing Unintended Human Activities from Weakly-labeled Videos

Add code
Apr 28, 2022
Figure 1 for Tragedy Plus Time: Capturing Unintended Human Activities from Weakly-labeled Videos
Figure 2 for Tragedy Plus Time: Capturing Unintended Human Activities from Weakly-labeled Videos
Figure 3 for Tragedy Plus Time: Capturing Unintended Human Activities from Weakly-labeled Videos
Figure 4 for Tragedy Plus Time: Capturing Unintended Human Activities from Weakly-labeled Videos
Viaarxiv icon

Injecting Semantic Concepts into End-to-End Image Captioning

Add code
Dec 09, 2021
Figure 1 for Injecting Semantic Concepts into End-to-End Image Captioning
Figure 2 for Injecting Semantic Concepts into End-to-End Image Captioning
Figure 3 for Injecting Semantic Concepts into End-to-End Image Captioning
Figure 4 for Injecting Semantic Concepts into End-to-End Image Captioning
Viaarxiv icon

Compressing Visual-linguistic Model via Knowledge Distillation

Add code
Apr 05, 2021
Figure 1 for Compressing Visual-linguistic Model via Knowledge Distillation
Figure 2 for Compressing Visual-linguistic Model via Knowledge Distillation
Figure 3 for Compressing Visual-linguistic Model via Knowledge Distillation
Figure 4 for Compressing Visual-linguistic Model via Knowledge Distillation
Viaarxiv icon

SEED: Self-supervised Distillation For Visual Representation

Add code
Jan 12, 2021
Figure 1 for SEED: Self-supervised Distillation For Visual Representation
Figure 2 for SEED: Self-supervised Distillation For Visual Representation
Figure 3 for SEED: Self-supervised Distillation For Visual Representation
Figure 4 for SEED: Self-supervised Distillation For Visual Representation
Viaarxiv icon

Weak Supervision and Referring Attention for Temporal-Textual Association Learning

Add code
Jun 27, 2020
Figure 1 for Weak Supervision and Referring Attention for Temporal-Textual Association Learning
Figure 2 for Weak Supervision and Referring Attention for Temporal-Textual Association Learning
Figure 3 for Weak Supervision and Referring Attention for Temporal-Textual Association Learning
Figure 4 for Weak Supervision and Referring Attention for Temporal-Textual Association Learning
Viaarxiv icon

HRDNet: High-resolution Detection Network for Small Objects

Add code
Jun 13, 2020
Figure 1 for HRDNet: High-resolution Detection Network for Small Objects
Figure 2 for HRDNet: High-resolution Detection Network for Small Objects
Figure 3 for HRDNet: High-resolution Detection Network for Small Objects
Figure 4 for HRDNet: High-resolution Detection Network for Small Objects
Viaarxiv icon

ViTAA: Visual-Textual Attributes Alignment in Person Search by Natural Language

Add code
May 15, 2020
Figure 1 for ViTAA: Visual-Textual Attributes Alignment in Person Search by Natural Language
Figure 2 for ViTAA: Visual-Textual Attributes Alignment in Person Search by Natural Language
Figure 3 for ViTAA: Visual-Textual Attributes Alignment in Person Search by Natural Language
Figure 4 for ViTAA: Visual-Textual Attributes Alignment in Person Search by Natural Language
Viaarxiv icon