Picture for Tao Lei

Tao Lei

IDEA Prune: An Integrated Enlarge-and-Prune Pipeline in Generative Language Model Pretraining

Add code
Mar 07, 2025
Viaarxiv icon

Instruction-Following Pruning for Large Language Models

Add code
Jan 07, 2025
Figure 1 for Instruction-Following Pruning for Large Language Models
Figure 2 for Instruction-Following Pruning for Large Language Models
Figure 3 for Instruction-Following Pruning for Large Language Models
Figure 4 for Instruction-Following Pruning for Large Language Models
Viaarxiv icon

Distribution alignment based transfer fusion frameworks on quantum devices for seeking quantum advantages

Add code
Nov 04, 2024
Figure 1 for Distribution alignment based transfer fusion frameworks on quantum devices for seeking quantum advantages
Figure 2 for Distribution alignment based transfer fusion frameworks on quantum devices for seeking quantum advantages
Figure 3 for Distribution alignment based transfer fusion frameworks on quantum devices for seeking quantum advantages
Figure 4 for Distribution alignment based transfer fusion frameworks on quantum devices for seeking quantum advantages
Viaarxiv icon

EC-DIT: Scaling Diffusion Transformers with Adaptive Expert-Choice Routing

Add code
Oct 02, 2024
Figure 1 for EC-DIT: Scaling Diffusion Transformers with Adaptive Expert-Choice Routing
Figure 2 for EC-DIT: Scaling Diffusion Transformers with Adaptive Expert-Choice Routing
Figure 3 for EC-DIT: Scaling Diffusion Transformers with Adaptive Expert-Choice Routing
Figure 4 for EC-DIT: Scaling Diffusion Transformers with Adaptive Expert-Choice Routing
Viaarxiv icon

Apple Intelligence Foundation Language Models

Add code
Jul 29, 2024
Figure 1 for Apple Intelligence Foundation Language Models
Figure 2 for Apple Intelligence Foundation Language Models
Figure 3 for Apple Intelligence Foundation Language Models
Figure 4 for Apple Intelligence Foundation Language Models
Viaarxiv icon

MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training

Add code
Mar 22, 2024
Figure 1 for MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training
Figure 2 for MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training
Figure 3 for MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training
Figure 4 for MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training
Viaarxiv icon

Learning to Skip for Language Modeling

Add code
Nov 26, 2023
Figure 1 for Learning to Skip for Language Modeling
Figure 2 for Learning to Skip for Language Modeling
Figure 3 for Learning to Skip for Language Modeling
Figure 4 for Learning to Skip for Language Modeling
Viaarxiv icon

TEC-Net: Vision Transformer Embrace Convolutional Neural Networks for Medical Image Segmentation

Add code
Jun 07, 2023
Viaarxiv icon

CiT-Net: Convolutional Neural Networks Hand in Hand with Vision Transformers for Medical Image Segmentation

Add code
Jun 06, 2023
Figure 1 for CiT-Net: Convolutional Neural Networks Hand in Hand with Vision Transformers for Medical Image Segmentation
Figure 2 for CiT-Net: Convolutional Neural Networks Hand in Hand with Vision Transformers for Medical Image Segmentation
Figure 3 for CiT-Net: Convolutional Neural Networks Hand in Hand with Vision Transformers for Medical Image Segmentation
Figure 4 for CiT-Net: Convolutional Neural Networks Hand in Hand with Vision Transformers for Medical Image Segmentation
Viaarxiv icon

Lightweight Structure-aware Transformer Network for VHR Remote Sensing Image Change Detection

Add code
Jun 03, 2023
Figure 1 for Lightweight Structure-aware Transformer Network for VHR Remote Sensing Image Change Detection
Figure 2 for Lightweight Structure-aware Transformer Network for VHR Remote Sensing Image Change Detection
Figure 3 for Lightweight Structure-aware Transformer Network for VHR Remote Sensing Image Change Detection
Figure 4 for Lightweight Structure-aware Transformer Network for VHR Remote Sensing Image Change Detection
Viaarxiv icon