Picture for Chris Liu

Chris Liu

Lumina-T2X: Transforming Text into Any Modality, Resolution, and Duration via Flow-based Large Diffusion Transformers

Add code
May 09, 2024
Figure 1 for Lumina-T2X: Transforming Text into Any Modality, Resolution, and Duration via Flow-based Large Diffusion Transformers
Figure 2 for Lumina-T2X: Transforming Text into Any Modality, Resolution, and Duration via Flow-based Large Diffusion Transformers
Figure 3 for Lumina-T2X: Transforming Text into Any Modality, Resolution, and Duration via Flow-based Large Diffusion Transformers
Figure 4 for Lumina-T2X: Transforming Text into Any Modality, Resolution, and Duration via Flow-based Large Diffusion Transformers
Viaarxiv icon

SPHINX-X: Scaling Data and Parameters for a Family of Multi-modal Large Language Models

Add code
Feb 08, 2024
Viaarxiv icon

SPHINX: The Joint Mixing of Weights, Tasks, and Visual Embeddings for Multi-modal Large Language Models

Add code
Nov 13, 2023
Viaarxiv icon

ImageBind-LLM: Multi-modality Instruction Tuning

Add code
Sep 11, 2023
Viaarxiv icon

Structured Knowledge Distillation for Semantic Segmentation

Add code
Mar 12, 2019
Figure 1 for Structured Knowledge Distillation for Semantic Segmentation
Figure 2 for Structured Knowledge Distillation for Semantic Segmentation
Figure 3 for Structured Knowledge Distillation for Semantic Segmentation
Figure 4 for Structured Knowledge Distillation for Semantic Segmentation
Viaarxiv icon