Picture for Jason Kuen

Jason Kuen

Sparse-LaViDa: Sparse Multimodal Discrete Diffusion Language Models

Add code
Dec 16, 2025
Viaarxiv icon

VGent: Visual Grounding via Modular Design for Disentangling Reasoning and Prediction

Add code
Dec 11, 2025
Figure 1 for VGent: Visual Grounding via Modular Design for Disentangling Reasoning and Prediction
Figure 2 for VGent: Visual Grounding via Modular Design for Disentangling Reasoning and Prediction
Figure 3 for VGent: Visual Grounding via Modular Design for Disentangling Reasoning and Prediction
Figure 4 for VGent: Visual Grounding via Modular Design for Disentangling Reasoning and Prediction
Viaarxiv icon

OIDA-QA: A Multimodal Benchmark for Analyzing the Opioid Industry Documents Archive

Add code
Nov 14, 2025
Viaarxiv icon

Image Tokenizer Needs Post-Training

Add code
Sep 15, 2025
Viaarxiv icon

Refer to Anything with Vision-Language Prompts

Add code
Jun 05, 2025
Viaarxiv icon

LaViDa: A Large Diffusion Language Model for Multimodal Understanding

Add code
May 22, 2025
Viaarxiv icon

Robust Latent Matters: Boosting Image Generation with Sampling Error

Add code
Mar 11, 2025
Figure 1 for Robust Latent Matters: Boosting Image Generation with Sampling Error
Figure 2 for Robust Latent Matters: Boosting Image Generation with Sampling Error
Figure 3 for Robust Latent Matters: Boosting Image Generation with Sampling Error
Figure 4 for Robust Latent Matters: Boosting Image Generation with Sampling Error
Viaarxiv icon

LazyDiT: Lazy Learning for the Acceleration of Diffusion Transformers

Add code
Dec 17, 2024
Viaarxiv icon

XQ-GAN: An Open-source Image Tokenization Framework for Autoregressive Generation

Add code
Dec 02, 2024
Figure 1 for XQ-GAN: An Open-source Image Tokenization Framework for Autoregressive Generation
Figure 2 for XQ-GAN: An Open-source Image Tokenization Framework for Autoregressive Generation
Figure 3 for XQ-GAN: An Open-source Image Tokenization Framework for Autoregressive Generation
Figure 4 for XQ-GAN: An Open-source Image Tokenization Framework for Autoregressive Generation
Viaarxiv icon

ImageFolder: Autoregressive Image Generation with Folded Tokens

Add code
Oct 02, 2024
Figure 1 for ImageFolder: Autoregressive Image Generation with Folded Tokens
Figure 2 for ImageFolder: Autoregressive Image Generation with Folded Tokens
Figure 3 for ImageFolder: Autoregressive Image Generation with Folded Tokens
Figure 4 for ImageFolder: Autoregressive Image Generation with Folded Tokens
Viaarxiv icon