Picture for Rui Zhao

Rui Zhao

State Key Laboratory of Computer Science, Institute of Software, Chinese Academy of Sciences, Beijing, China, University of Chinese Academy of Sciences, Beijing, China

PUMA: Empowering Unified MLLM with Multi-granular Visual Generation

Add code
Oct 17, 2024
Viaarxiv icon

DreamVideo-2: Zero-Shot Subject-Driven Video Customization with Precise Motion Control

Add code
Oct 17, 2024
Viaarxiv icon

EvolveDirector: Approaching Advanced Text-to-Image Generation with Large Vision-Language Models

Add code
Oct 10, 2024
Figure 1 for EvolveDirector: Approaching Advanced Text-to-Image Generation with Large Vision-Language Models
Figure 2 for EvolveDirector: Approaching Advanced Text-to-Image Generation with Large Vision-Language Models
Figure 3 for EvolveDirector: Approaching Advanced Text-to-Image Generation with Large Vision-Language Models
Figure 4 for EvolveDirector: Approaching Advanced Text-to-Image Generation with Large Vision-Language Models
Viaarxiv icon

CTC-GMM: CTC guided modality matching for fast and accurate streaming speech translation

Add code
Oct 07, 2024
Viaarxiv icon

Hybrid Mamba for Few-Shot Segmentation

Add code
Sep 29, 2024
Figure 1 for Hybrid Mamba for Few-Shot Segmentation
Figure 2 for Hybrid Mamba for Few-Shot Segmentation
Figure 3 for Hybrid Mamba for Few-Shot Segmentation
Figure 4 for Hybrid Mamba for Few-Shot Segmentation
Viaarxiv icon

Data Pruning via Separability, Integrity, and Model Uncertainty-Aware Importance Sampling

Add code
Sep 20, 2024
Figure 1 for Data Pruning via Separability, Integrity, and Model Uncertainty-Aware Importance Sampling
Figure 2 for Data Pruning via Separability, Integrity, and Model Uncertainty-Aware Importance Sampling
Figure 3 for Data Pruning via Separability, Integrity, and Model Uncertainty-Aware Importance Sampling
Figure 4 for Data Pruning via Separability, Integrity, and Model Uncertainty-Aware Importance Sampling
Viaarxiv icon

SynthDoc: Bilingual Documents Synthesis for Visual Document Understanding

Add code
Aug 27, 2024
Viaarxiv icon

QPO: Query-dependent Prompt Optimization via Multi-Loop Offline Reinforcement Learning

Add code
Aug 20, 2024
Viaarxiv icon

Offline RLHF Methods Need More Accurate Supervision Signals

Add code
Aug 18, 2024
Viaarxiv icon

CMR Scaling Law: Predicting Critical Mixture Ratios for Continual Pre-training of Language Models

Add code
Jul 24, 2024
Viaarxiv icon