Picture for Ju He

Ju He

Democratizing Text-to-Image Masked Generative Models with Compact Text-Aware One-Dimensional Tokens

Add code
Jan 13, 2025
Viaarxiv icon

FlowAR: Scale-wise Autoregressive Image Generation Meets Flow Matching

Add code
Dec 19, 2024
Viaarxiv icon

Randomized Autoregressive Visual Generation

Add code
Nov 01, 2024
Viaarxiv icon

LLaVolta: Efficient Multi-modal Models via Stage-wise Visual Context Compression

Add code
Jun 28, 2024
Figure 1 for LLaVolta: Efficient Multi-modal Models via Stage-wise Visual Context Compression
Figure 2 for LLaVolta: Efficient Multi-modal Models via Stage-wise Visual Context Compression
Figure 3 for LLaVolta: Efficient Multi-modal Models via Stage-wise Visual Context Compression
Figure 4 for LLaVolta: Efficient Multi-modal Models via Stage-wise Visual Context Compression
Viaarxiv icon

Alleviating Distortion in Image Generation via Multi-Resolution Diffusion Models

Add code
Jun 13, 2024
Viaarxiv icon

Learning Part Segmentation from Synthetic Animals

Add code
Nov 30, 2023
Figure 1 for Learning Part Segmentation from Synthetic Animals
Figure 2 for Learning Part Segmentation from Synthetic Animals
Figure 3 for Learning Part Segmentation from Synthetic Animals
Figure 4 for Learning Part Segmentation from Synthetic Animals
Viaarxiv icon

MaXTron: Mask Transformer with Trajectory Attention for Video Panoptic Segmentation

Add code
Nov 30, 2023
Viaarxiv icon

Convolutions Die Hard: Open-Vocabulary Segmentation with Single Frozen Convolutional CLIP

Add code
Aug 04, 2023
Viaarxiv icon

Compositor: Bottom-up Clustering and Compositing for Robust Part and Object Segmentation

Add code
Jun 15, 2023
Viaarxiv icon

Learning from Temporal Gradient for Semi-supervised Action Recognition

Add code
Dec 06, 2021
Figure 1 for Learning from Temporal Gradient for Semi-supervised Action Recognition
Figure 2 for Learning from Temporal Gradient for Semi-supervised Action Recognition
Figure 3 for Learning from Temporal Gradient for Semi-supervised Action Recognition
Figure 4 for Learning from Temporal Gradient for Semi-supervised Action Recognition
Viaarxiv icon