Picture for Dazhong Shen

Dazhong Shen

A Comprehensive Survey on Self-Interpretable Neural Networks

Add code
Jan 26, 2025
Viaarxiv icon

Hierarchical Time-Aware Mixture of Experts for Multi-Modal Sequential Recommendation

Add code
Jan 24, 2025
Viaarxiv icon

EasyRef: Omni-Generalized Group Image Reference for Diffusion Models via Multimodal LLM

Add code
Dec 12, 2024
Figure 1 for EasyRef: Omni-Generalized Group Image Reference for Diffusion Models via Multimodal LLM
Figure 2 for EasyRef: Omni-Generalized Group Image Reference for Diffusion Models via Multimodal LLM
Figure 3 for EasyRef: Omni-Generalized Group Image Reference for Diffusion Models via Multimodal LLM
Figure 4 for EasyRef: Omni-Generalized Group Image Reference for Diffusion Models via Multimodal LLM
Viaarxiv icon

Pretrained Reversible Generation as Unsupervised Visual Representation Learning

Add code
Nov 29, 2024
Viaarxiv icon

RIGL: A Unified Reciprocal Approach for Tracing the Independent and Group Learning Processes

Add code
Jun 18, 2024
Viaarxiv icon

Phased Consistency Model

Add code
May 28, 2024
Viaarxiv icon

MoVA: Adapting Mixture of Vision Experts to Multimodal Context

Add code
Apr 19, 2024
Figure 1 for MoVA: Adapting Mixture of Vision Experts to Multimodal Context
Figure 2 for MoVA: Adapting Mixture of Vision Experts to Multimodal Context
Figure 3 for MoVA: Adapting Mixture of Vision Experts to Multimodal Context
Figure 4 for MoVA: Adapting Mixture of Vision Experts to Multimodal Context
Viaarxiv icon

Towards Efficient Resume Understanding: A Multi-Granularity Multi-Modal Pre-Training Approach

Add code
Apr 13, 2024
Figure 1 for Towards Efficient Resume Understanding: A Multi-Granularity Multi-Modal Pre-Training Approach
Figure 2 for Towards Efficient Resume Understanding: A Multi-Granularity Multi-Modal Pre-Training Approach
Figure 3 for Towards Efficient Resume Understanding: A Multi-Granularity Multi-Modal Pre-Training Approach
Figure 4 for Towards Efficient Resume Understanding: A Multi-Granularity Multi-Modal Pre-Training Approach
Viaarxiv icon

Rethinking the Spatial Inconsistency in Classifier-Free Diffusion Guidance

Add code
Apr 08, 2024
Viaarxiv icon

CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching

Add code
Apr 04, 2024
Viaarxiv icon