Picture for Liang-Chieh Chen

Liang-Chieh Chen

ViCaS: A Dataset for Combining Holistic and Pixel-level Video Understanding using Captions with Grounded Segmentation

Add code
Dec 12, 2024
Viaarxiv icon

Randomized Autoregressive Visual Generation

Add code
Nov 01, 2024
Viaarxiv icon

MaskBit: Embedding-free Image Generation via Bit Tokens

Add code
Sep 24, 2024
Viaarxiv icon

Alleviating Distortion in Image Generation via Multi-Resolution Diffusion Models

Add code
Jun 13, 2024
Viaarxiv icon

An Image is Worth 32 Tokens for Reconstruction and Generation

Add code
Jun 11, 2024
Viaarxiv icon

Enhancing Temporal Consistency in Video Editing by Reconstructing Videos with 3D Gaussian Splatting

Add code
Jun 04, 2024
Figure 1 for Enhancing Temporal Consistency in Video Editing by Reconstructing Videos with 3D Gaussian Splatting
Figure 2 for Enhancing Temporal Consistency in Video Editing by Reconstructing Videos with 3D Gaussian Splatting
Figure 3 for Enhancing Temporal Consistency in Video Editing by Reconstructing Videos with 3D Gaussian Splatting
Figure 4 for Enhancing Temporal Consistency in Video Editing by Reconstructing Videos with 3D Gaussian Splatting
Viaarxiv icon

COCONut: Modernizing COCO Segmentation

Add code
Apr 12, 2024
Viaarxiv icon

ViTamin: Designing Scalable Vision Models in the Vision-Language Era

Add code
Apr 03, 2024
Figure 1 for ViTamin: Designing Scalable Vision Models in the Vision-Language Era
Figure 2 for ViTamin: Designing Scalable Vision Models in the Vision-Language Era
Figure 3 for ViTamin: Designing Scalable Vision Models in the Vision-Language Era
Figure 4 for ViTamin: Designing Scalable Vision Models in the Vision-Language Era
Viaarxiv icon

SPFormer: Enhancing Vision Transformer with Superpixel Representation

Add code
Jan 05, 2024
Viaarxiv icon

MaskConver: Revisiting Pure Convolution Model for Panoptic Segmentation

Add code
Dec 11, 2023
Viaarxiv icon