Picture for Youpeng Wen

Youpeng Wen

VidMan: Exploiting Implicit Dynamics from Video Diffusion Model for Effective Robot Manipulation

Add code
Nov 14, 2024
Viaarxiv icon

CapDet: Unifying Dense Captioning and Open-World Detection Pretraining

Add code
Mar 15, 2023
Figure 1 for CapDet: Unifying Dense Captioning and Open-World Detection Pretraining
Figure 2 for CapDet: Unifying Dense Captioning and Open-World Detection Pretraining
Figure 3 for CapDet: Unifying Dense Captioning and Open-World Detection Pretraining
Figure 4 for CapDet: Unifying Dense Captioning and Open-World Detection Pretraining
Viaarxiv icon

DetCLIP: Dictionary-Enriched Visual-Concept Paralleled Pre-training for Open-world Detection

Add code
Sep 20, 2022
Figure 1 for DetCLIP: Dictionary-Enriched Visual-Concept Paralleled Pre-training for Open-world Detection
Figure 2 for DetCLIP: Dictionary-Enriched Visual-Concept Paralleled Pre-training for Open-world Detection
Figure 3 for DetCLIP: Dictionary-Enriched Visual-Concept Paralleled Pre-training for Open-world Detection
Figure 4 for DetCLIP: Dictionary-Enriched Visual-Concept Paralleled Pre-training for Open-world Detection
Viaarxiv icon

Open-world Semantic Segmentation via Contrasting and Clustering Vision-Language Embedding

Add code
Jul 19, 2022
Figure 1 for Open-world Semantic Segmentation via Contrasting and Clustering Vision-Language Embedding
Figure 2 for Open-world Semantic Segmentation via Contrasting and Clustering Vision-Language Embedding
Figure 3 for Open-world Semantic Segmentation via Contrasting and Clustering Vision-Language Embedding
Figure 4 for Open-world Semantic Segmentation via Contrasting and Clustering Vision-Language Embedding
Viaarxiv icon