Picture for Yukun Zhu

Yukun Zhu

Neptune: The Long Orbit to Benchmarking Long Video Understanding

Add code
Dec 12, 2024
Viaarxiv icon

Fine-grained Controllable Video Generation via Object Appearance and Context

Add code
Dec 05, 2023
Viaarxiv icon

Video Summarization: Towards Entity-Aware Captions

Add code
Dec 01, 2023
Figure 1 for Video Summarization: Towards Entity-Aware Captions
Figure 2 for Video Summarization: Towards Entity-Aware Captions
Figure 3 for Video Summarization: Towards Entity-Aware Captions
Figure 4 for Video Summarization: Towards Entity-Aware Captions
Viaarxiv icon

Superpixel Transformers for Efficient Semantic Segmentation

Add code
Oct 02, 2023
Viaarxiv icon

MOAT: Alternating Mobile Convolution and Attention Brings Strong Vision Models

Add code
Oct 04, 2022
Figure 1 for MOAT: Alternating Mobile Convolution and Attention Brings Strong Vision Models
Figure 2 for MOAT: Alternating Mobile Convolution and Attention Brings Strong Vision Models
Figure 3 for MOAT: Alternating Mobile Convolution and Attention Brings Strong Vision Models
Figure 4 for MOAT: Alternating Mobile Convolution and Attention Brings Strong Vision Models
Viaarxiv icon

k-means Mask Transformer

Add code
Jul 08, 2022
Figure 1 for k-means Mask Transformer
Figure 2 for k-means Mask Transformer
Figure 3 for k-means Mask Transformer
Figure 4 for k-means Mask Transformer
Viaarxiv icon

CMT-DeepLab: Clustering Mask Transformers for Panoptic Segmentation

Add code
Jun 17, 2022
Figure 1 for CMT-DeepLab: Clustering Mask Transformers for Panoptic Segmentation
Figure 2 for CMT-DeepLab: Clustering Mask Transformers for Panoptic Segmentation
Figure 3 for CMT-DeepLab: Clustering Mask Transformers for Panoptic Segmentation
Figure 4 for CMT-DeepLab: Clustering Mask Transformers for Panoptic Segmentation
Viaarxiv icon

Waymo Open Dataset: Panoramic Video Panoptic Segmentation

Add code
Jun 15, 2022
Figure 1 for Waymo Open Dataset: Panoramic Video Panoptic Segmentation
Figure 2 for Waymo Open Dataset: Panoramic Video Panoptic Segmentation
Figure 3 for Waymo Open Dataset: Panoramic Video Panoptic Segmentation
Figure 4 for Waymo Open Dataset: Panoramic Video Panoptic Segmentation
Viaarxiv icon

Federated Multi-Target Domain Adaptation

Add code
Aug 17, 2021
Figure 1 for Federated Multi-Target Domain Adaptation
Figure 2 for Federated Multi-Target Domain Adaptation
Figure 3 for Federated Multi-Target Domain Adaptation
Figure 4 for Federated Multi-Target Domain Adaptation
Viaarxiv icon

DeepLab2: A TensorFlow Library for Deep Labeling

Add code
Jun 17, 2021
Figure 1 for DeepLab2: A TensorFlow Library for Deep Labeling
Figure 2 for DeepLab2: A TensorFlow Library for Deep Labeling
Figure 3 for DeepLab2: A TensorFlow Library for Deep Labeling
Viaarxiv icon