Picture for Xueqing Deng

Xueqing Deng

1.58-bit FLUX

Add code
Dec 24, 2024
Viaarxiv icon

ViCaS: A Dataset for Combining Holistic and Pixel-level Video Understanding using Captions with Grounded Segmentation

Add code
Dec 12, 2024
Viaarxiv icon

Randomized Autoregressive Visual Generation

Add code
Nov 01, 2024
Viaarxiv icon

MaskBit: Embedding-free Image Generation via Bit Tokens

Add code
Sep 24, 2024
Viaarxiv icon

An Image is Worth 32 Tokens for Reconstruction and Generation

Add code
Jun 11, 2024
Figure 1 for An Image is Worth 32 Tokens for Reconstruction and Generation
Figure 2 for An Image is Worth 32 Tokens for Reconstruction and Generation
Figure 3 for An Image is Worth 32 Tokens for Reconstruction and Generation
Figure 4 for An Image is Worth 32 Tokens for Reconstruction and Generation
Viaarxiv icon

Enhancing 3D Fidelity of Text-to-3D using Cross-View Correspondences

Add code
Apr 16, 2024
Viaarxiv icon

COCONut: Modernizing COCO Segmentation

Add code
Apr 12, 2024
Viaarxiv icon

MaXTron: Mask Transformer with Trajectory Attention for Video Panoptic Segmentation

Add code
Nov 30, 2023
Viaarxiv icon

Selective Feature Adapter for Dense Vision Transformers

Add code
Oct 03, 2023
Figure 1 for Selective Feature Adapter for Dense Vision Transformers
Figure 2 for Selective Feature Adapter for Dense Vision Transformers
Figure 3 for Selective Feature Adapter for Dense Vision Transformers
Figure 4 for Selective Feature Adapter for Dense Vision Transformers
Viaarxiv icon

Convolutions Die Hard: Open-Vocabulary Segmentation with Single Frozen Convolutional CLIP

Add code
Aug 04, 2023
Viaarxiv icon