Picture for Xiangzi Dai

Xiangzi Dai

Croc: Pretraining Large Multimodal Models with Cross-Modal Comprehension

Add code
Oct 18, 2024
Figure 1 for Croc: Pretraining Large Multimodal Models with Cross-Modal Comprehension
Figure 2 for Croc: Pretraining Large Multimodal Models with Cross-Modal Comprehension
Figure 3 for Croc: Pretraining Large Multimodal Models with Cross-Modal Comprehension
Figure 4 for Croc: Pretraining Large Multimodal Models with Cross-Modal Comprehension
Viaarxiv icon

CLIP-CID: Efficient CLIP Distillation via Cluster-Instance Discrimination

Add code
Aug 18, 2024
Figure 1 for CLIP-CID: Efficient CLIP Distillation via Cluster-Instance Discrimination
Figure 2 for CLIP-CID: Efficient CLIP Distillation via Cluster-Instance Discrimination
Figure 3 for CLIP-CID: Efficient CLIP Distillation via Cluster-Instance Discrimination
Figure 4 for CLIP-CID: Efficient CLIP Distillation via Cluster-Instance Discrimination
Viaarxiv icon

VAR-CLIP: Text-to-Image Generator with Visual Auto-Regressive Modeling

Add code
Aug 02, 2024
Viaarxiv icon

Multi-label Cluster Discrimination for Visual Representation Learning

Add code
Jul 24, 2024
Figure 1 for Multi-label Cluster Discrimination for Visual Representation Learning
Figure 2 for Multi-label Cluster Discrimination for Visual Representation Learning
Figure 3 for Multi-label Cluster Discrimination for Visual Representation Learning
Figure 4 for Multi-label Cluster Discrimination for Visual Representation Learning
Viaarxiv icon

High-Fidelity Facial Albedo Estimation via Texture Quantization

Add code
Jun 19, 2024
Figure 1 for High-Fidelity Facial Albedo Estimation via Texture Quantization
Figure 2 for High-Fidelity Facial Albedo Estimation via Texture Quantization
Figure 3 for High-Fidelity Facial Albedo Estimation via Texture Quantization
Figure 4 for High-Fidelity Facial Albedo Estimation via Texture Quantization
Viaarxiv icon