Picture for Yao Lu

Yao Lu

Multilingual Pretraining Using a Large Corpus Machine-Translated from a Single Source Language

Add code
Oct 31, 2024
Figure 1 for Multilingual Pretraining Using a Large Corpus Machine-Translated from a Single Source Language
Figure 2 for Multilingual Pretraining Using a Large Corpus Machine-Translated from a Single Source Language
Figure 3 for Multilingual Pretraining Using a Large Corpus Machine-Translated from a Single Source Language
Figure 4 for Multilingual Pretraining Using a Large Corpus Machine-Translated from a Single Source Language
Viaarxiv icon

COAT: Compressing Optimizer states and Activation for Memory-Efficient FP8 Training

Add code
Oct 25, 2024
Figure 1 for COAT: Compressing Optimizer states and Activation for Memory-Efficient FP8 Training
Figure 2 for COAT: Compressing Optimizer states and Activation for Memory-Efficient FP8 Training
Figure 3 for COAT: Compressing Optimizer states and Activation for Memory-Efficient FP8 Training
Figure 4 for COAT: Compressing Optimizer states and Activation for Memory-Efficient FP8 Training
Viaarxiv icon

SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformers

Add code
Oct 15, 2024
Viaarxiv icon

ECGN: A Cluster-Aware Approach to Graph Neural Networks for Imbalanced Classification

Add code
Oct 15, 2024
Viaarxiv icon

SGLP: A Similarity Guided Fast Layer Partition Pruning for Compressing Large Deep Models

Add code
Oct 14, 2024
Figure 1 for SGLP: A Similarity Guided Fast Layer Partition Pruning for Compressing Large Deep Models
Figure 2 for SGLP: A Similarity Guided Fast Layer Partition Pruning for Compressing Large Deep Models
Figure 3 for SGLP: A Similarity Guided Fast Layer Partition Pruning for Compressing Large Deep Models
Figure 4 for SGLP: A Similarity Guided Fast Layer Partition Pruning for Compressing Large Deep Models
Viaarxiv icon

HART: Efficient Visual Generation with Hybrid Autoregressive Transformer

Add code
Oct 14, 2024
Figure 1 for HART: Efficient Visual Generation with Hybrid Autoregressive Transformer
Figure 2 for HART: Efficient Visual Generation with Hybrid Autoregressive Transformer
Figure 3 for HART: Efficient Visual Generation with Hybrid Autoregressive Transformer
Figure 4 for HART: Efficient Visual Generation with Hybrid Autoregressive Transformer
Viaarxiv icon

Deep Compression Autoencoder for Efficient High-Resolution Diffusion Models

Add code
Oct 14, 2024
Figure 1 for Deep Compression Autoencoder for Efficient High-Resolution Diffusion Models
Figure 2 for Deep Compression Autoencoder for Efficient High-Resolution Diffusion Models
Figure 3 for Deep Compression Autoencoder for Efficient High-Resolution Diffusion Models
Figure 4 for Deep Compression Autoencoder for Efficient High-Resolution Diffusion Models
Viaarxiv icon

Jet Expansions of Residual Computation

Add code
Oct 08, 2024
Figure 1 for Jet Expansions of Residual Computation
Figure 2 for Jet Expansions of Residual Computation
Figure 3 for Jet Expansions of Residual Computation
Figure 4 for Jet Expansions of Residual Computation
Viaarxiv icon

VILA-U: a Unified Foundation Model Integrating Visual Understanding and Generation

Add code
Sep 06, 2024
Figure 1 for VILA-U: a Unified Foundation Model Integrating Visual Understanding and Generation
Figure 2 for VILA-U: a Unified Foundation Model Integrating Visual Understanding and Generation
Figure 3 for VILA-U: a Unified Foundation Model Integrating Visual Understanding and Generation
Figure 4 for VILA-U: a Unified Foundation Model Integrating Visual Understanding and Generation
Viaarxiv icon

LongVILA: Scaling Long-Context Visual Language Models for Long Videos

Add code
Aug 21, 2024
Figure 1 for LongVILA: Scaling Long-Context Visual Language Models for Long Videos
Figure 2 for LongVILA: Scaling Long-Context Visual Language Models for Long Videos
Figure 3 for LongVILA: Scaling Long-Context Visual Language Models for Long Videos
Figure 4 for LongVILA: Scaling Long-Context Visual Language Models for Long Videos
Viaarxiv icon