Picture for Nan Du

Nan Du

EC-DIT: Scaling Diffusion Transformers with Adaptive Expert-Choice Routing

Add code
Oct 02, 2024
Viaarxiv icon

Apple Intelligence Foundation Language Models

Add code
Jul 29, 2024
Figure 1 for Apple Intelligence Foundation Language Models
Figure 2 for Apple Intelligence Foundation Language Models
Figure 3 for Apple Intelligence Foundation Language Models
Figure 4 for Apple Intelligence Foundation Language Models
Viaarxiv icon

Deep State-Space Generative Model For Correlated Time-to-Event Predictions

Add code
Jul 28, 2024
Figure 1 for Deep State-Space Generative Model For Correlated Time-to-Event Predictions
Figure 2 for Deep State-Space Generative Model For Correlated Time-to-Event Predictions
Figure 3 for Deep State-Space Generative Model For Correlated Time-to-Event Predictions
Figure 4 for Deep State-Space Generative Model For Correlated Time-to-Event Predictions
Viaarxiv icon

Learning to Select the Best Forecasting Tasks for Clinical Outcome Prediction

Add code
Jul 28, 2024
Figure 1 for Learning to Select the Best Forecasting Tasks for Clinical Outcome Prediction
Figure 2 for Learning to Select the Best Forecasting Tasks for Clinical Outcome Prediction
Figure 3 for Learning to Select the Best Forecasting Tasks for Clinical Outcome Prediction
Figure 4 for Learning to Select the Best Forecasting Tasks for Clinical Outcome Prediction
Viaarxiv icon

Revisiting MoE and Dense Speed-Accuracy Comparisons for LLM Training

Add code
May 23, 2024
Viaarxiv icon

Knowledge Graph Reasoning with Self-supervised Reinforcement Learning

Add code
May 22, 2024
Viaarxiv icon

Self-playing Adversarial Language Game Enhances LLM Reasoning

Add code
Apr 16, 2024
Viaarxiv icon

MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training

Add code
Mar 22, 2024
Figure 1 for MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training
Figure 2 for MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training
Figure 3 for MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training
Figure 4 for MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training
Viaarxiv icon

Look Before You Leap: Towards Decision-Aware and Generalizable Tool-Usage for Large Language Models

Add code
Feb 28, 2024
Figure 1 for Look Before You Leap: Towards Decision-Aware and Generalizable Tool-Usage for Large Language Models
Figure 2 for Look Before You Leap: Towards Decision-Aware and Generalizable Tool-Usage for Large Language Models
Figure 3 for Look Before You Leap: Towards Decision-Aware and Generalizable Tool-Usage for Large Language Models
Figure 4 for Look Before You Leap: Towards Decision-Aware and Generalizable Tool-Usage for Large Language Models
Viaarxiv icon

Are Large Language Models Good Prompt Optimizers?

Add code
Feb 03, 2024
Figure 1 for Are Large Language Models Good Prompt Optimizers?
Figure 2 for Are Large Language Models Good Prompt Optimizers?
Figure 3 for Are Large Language Models Good Prompt Optimizers?
Figure 4 for Are Large Language Models Good Prompt Optimizers?
Viaarxiv icon