Picture for Junke Wang

Junke Wang

OmniTokenizer: A Joint Image-Video Tokenizer for Visual Generation

Add code
Jun 13, 2024
Figure 1 for OmniTokenizer: A Joint Image-Video Tokenizer for Visual Generation
Figure 2 for OmniTokenizer: A Joint Image-Video Tokenizer for Visual Generation
Figure 3 for OmniTokenizer: A Joint Image-Video Tokenizer for Visual Generation
Figure 4 for OmniTokenizer: A Joint Image-Video Tokenizer for Visual Generation
Viaarxiv icon

OmniVid: A Generative Framework for Universal Video Understanding

Add code
Mar 26, 2024
Viaarxiv icon

MouSi: Poly-Visual-Expert Vision-Language Models

Add code
Jan 30, 2024
Viaarxiv icon

To See is to Believe: Prompting GPT-4V for Better Visual Instruction Tuning

Add code
Nov 29, 2023
Viaarxiv icon

ChatVideo: A Tracklet-centric Multimodal and Versatile Video Understanding System

Add code
Apr 29, 2023
Viaarxiv icon

OmniTracker: Unifying Object Tracking by Tracking-with-Detection

Add code
Mar 21, 2023
Viaarxiv icon

Look Before You Match: Instance Understanding Matters in Video Object Segmentation

Add code
Dec 13, 2022
Viaarxiv icon

Fighting Malicious Media Data: A Survey on Tampering Detection and Deepfake Detection

Add code
Dec 12, 2022
Viaarxiv icon

OmniVL:One Foundation Model for Image-Language and Video-Language Tasks

Add code
Sep 15, 2022
Figure 1 for OmniVL:One Foundation Model for Image-Language and Video-Language Tasks
Figure 2 for OmniVL:One Foundation Model for Image-Language and Video-Language Tasks
Figure 3 for OmniVL:One Foundation Model for Image-Language and Video-Language Tasks
Figure 4 for OmniVL:One Foundation Model for Image-Language and Video-Language Tasks
Viaarxiv icon

ObjectFormer for Image Manipulation Detection and Localization

Add code
Mar 29, 2022
Figure 1 for ObjectFormer for Image Manipulation Detection and Localization
Figure 2 for ObjectFormer for Image Manipulation Detection and Localization
Figure 3 for ObjectFormer for Image Manipulation Detection and Localization
Figure 4 for ObjectFormer for Image Manipulation Detection and Localization
Viaarxiv icon