Picture for Ziteng Gao

Ziteng Gao

Factorized Visual Tokenization and Generation

Add code
Nov 25, 2024
Viaarxiv icon

One Token to Seg Them All: Language Instructed Reasoning Segmentation in Videos

Add code
Sep 29, 2024
Figure 1 for One Token to Seg Them All: Language Instructed Reasoning Segmentation in Videos
Figure 2 for One Token to Seg Them All: Language Instructed Reasoning Segmentation in Videos
Figure 3 for One Token to Seg Them All: Language Instructed Reasoning Segmentation in Videos
Figure 4 for One Token to Seg Them All: Language Instructed Reasoning Segmentation in Videos
Viaarxiv icon

Learning Video Context as Interleaved Multimodal Sequences

Add code
Jul 31, 2024
Figure 1 for Learning Video Context as Interleaved Multimodal Sequences
Figure 2 for Learning Video Context as Interleaved Multimodal Sequences
Figure 3 for Learning Video Context as Interleaved Multimodal Sequences
Figure 4 for Learning Video Context as Interleaved Multimodal Sequences
Viaarxiv icon

VideoLLM-online: Online Video Large Language Model for Streaming Video

Add code
Jun 17, 2024
Figure 1 for VideoLLM-online: Online Video Large Language Model for Streaming Video
Figure 2 for VideoLLM-online: Online Video Large Language Model for Streaming Video
Figure 3 for VideoLLM-online: Online Video Large Language Model for Streaming Video
Figure 4 for VideoLLM-online: Online Video Large Language Model for Streaming Video
Viaarxiv icon

Bootstrapping SparseFormers from Vision Foundation Models

Add code
Dec 04, 2023
Figure 1 for Bootstrapping SparseFormers from Vision Foundation Models
Figure 2 for Bootstrapping SparseFormers from Vision Foundation Models
Figure 3 for Bootstrapping SparseFormers from Vision Foundation Models
Figure 4 for Bootstrapping SparseFormers from Vision Foundation Models
Viaarxiv icon

SparseFormer: Sparse Visual Recognition via Limited Latent Tokens

Add code
Apr 07, 2023
Viaarxiv icon

STMixer: A One-Stage Sparse Action Detector

Add code
Mar 28, 2023
Viaarxiv icon

AdaMixer: A Fast-Converging Query-Based Object Detector

Add code
Mar 31, 2022
Figure 1 for AdaMixer: A Fast-Converging Query-Based Object Detector
Figure 2 for AdaMixer: A Fast-Converging Query-Based Object Detector
Figure 3 for AdaMixer: A Fast-Converging Query-Based Object Detector
Figure 4 for AdaMixer: A Fast-Converging Query-Based Object Detector
Viaarxiv icon

Mutual Supervision for Dense Object Detection

Add code
Sep 13, 2021
Figure 1 for Mutual Supervision for Dense Object Detection
Figure 2 for Mutual Supervision for Dense Object Detection
Figure 3 for Mutual Supervision for Dense Object Detection
Figure 4 for Mutual Supervision for Dense Object Detection
Viaarxiv icon

LIP: Local Importance-based Pooling

Add code
Aug 27, 2019
Figure 1 for LIP: Local Importance-based Pooling
Figure 2 for LIP: Local Importance-based Pooling
Figure 3 for LIP: Local Importance-based Pooling
Figure 4 for LIP: Local Importance-based Pooling
Viaarxiv icon