Picture for Zeyuan Chen

Zeyuan Chen

ProVision: Programmatically Scaling Vision-centric Instruction Data for Multimodal Language Models

Add code
Dec 09, 2024
Viaarxiv icon

CPRM: A LLM-based Continual Pre-training Framework for Relevance Modeling in Commercial Search

Add code
Dec 03, 2024
Figure 1 for CPRM: A LLM-based Continual Pre-training Framework for Relevance Modeling in Commercial Search
Figure 2 for CPRM: A LLM-based Continual Pre-training Framework for Relevance Modeling in Commercial Search
Figure 3 for CPRM: A LLM-based Continual Pre-training Framework for Relevance Modeling in Commercial Search
Figure 4 for CPRM: A LLM-based Continual Pre-training Framework for Relevance Modeling in Commercial Search
Viaarxiv icon

xLAM: A Family of Large Action Models to Empower AI Agent Systems

Add code
Sep 05, 2024
Viaarxiv icon

xGen-VideoSyn-1: High-fidelity Text-to-Video Synthesis with Compressed Representations

Add code
Aug 22, 2024
Figure 1 for xGen-VideoSyn-1: High-fidelity Text-to-Video Synthesis with Compressed Representations
Figure 2 for xGen-VideoSyn-1: High-fidelity Text-to-Video Synthesis with Compressed Representations
Figure 3 for xGen-VideoSyn-1: High-fidelity Text-to-Video Synthesis with Compressed Representations
Figure 4 for xGen-VideoSyn-1: High-fidelity Text-to-Video Synthesis with Compressed Representations
Viaarxiv icon

Towards Boosting LLMs-driven Relevance Modeling with Progressive Retrieved Behavior-augmented Prompting

Add code
Aug 18, 2024
Viaarxiv icon

xGen-MM (BLIP-3): A Family of Open Large Multimodal Models

Add code
Aug 16, 2024
Figure 1 for xGen-MM (BLIP-3): A Family of Open Large Multimodal Models
Figure 2 for xGen-MM (BLIP-3): A Family of Open Large Multimodal Models
Figure 3 for xGen-MM (BLIP-3): A Family of Open Large Multimodal Models
Figure 4 for xGen-MM (BLIP-3): A Family of Open Large Multimodal Models
Viaarxiv icon

OmniControlNet: Dual-stage Integration for Conditional Image Generation

Add code
Jun 09, 2024
Viaarxiv icon

SQ-LLaVA: Self-Questioning for Large Vision-Language Assistant

Add code
Mar 17, 2024
Figure 1 for SQ-LLaVA: Self-Questioning for Large Vision-Language Assistant
Figure 2 for SQ-LLaVA: Self-Questioning for Large Vision-Language Assistant
Figure 3 for SQ-LLaVA: Self-Questioning for Large Vision-Language Assistant
Figure 4 for SQ-LLaVA: Self-Questioning for Large Vision-Language Assistant
Viaarxiv icon

Bayesian Diffusion Models for 3D Shape Reconstruction

Add code
Mar 11, 2024
Viaarxiv icon

Pattern-wise Transparent Sequential Recommendation

Add code
Feb 29, 2024
Figure 1 for Pattern-wise Transparent Sequential Recommendation
Figure 2 for Pattern-wise Transparent Sequential Recommendation
Figure 3 for Pattern-wise Transparent Sequential Recommendation
Figure 4 for Pattern-wise Transparent Sequential Recommendation
Viaarxiv icon