Picture for Yushi Bai

Yushi Bai

Pre-training Distillation for Large Language Models: A Design Space Exploration

Add code
Oct 21, 2024
Viaarxiv icon

LongCite: Enabling LLMs to Generate Fine-grained Citations in Long-context QA

Add code
Sep 04, 2024
Figure 1 for LongCite: Enabling LLMs to Generate Fine-grained Citations in Long-context QA
Figure 2 for LongCite: Enabling LLMs to Generate Fine-grained Citations in Long-context QA
Figure 3 for LongCite: Enabling LLMs to Generate Fine-grained Citations in Long-context QA
Figure 4 for LongCite: Enabling LLMs to Generate Fine-grained Citations in Long-context QA
Viaarxiv icon

LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs

Add code
Aug 13, 2024
Figure 1 for LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs
Figure 2 for LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs
Figure 3 for LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs
Figure 4 for LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs
Viaarxiv icon

Finding Safety Neurons in Large Language Models

Add code
Jun 20, 2024
Viaarxiv icon

ChatGLM: A Family of Large Language Models from GLM-130B to GLM-4 All Tools

Add code
Jun 18, 2024
Viaarxiv icon

DICE: Detecting In-distribution Contamination in LLM's Fine-tuning Phase for Math Reasoning

Add code
Jun 06, 2024
Viaarxiv icon

Advancing Geometric Problem Solving: A Comprehensive Benchmark for Multimodal Model Evaluation

Add code
Apr 07, 2024
Viaarxiv icon

CogCoM: Train Large Vision-Language Models Diving into Details through Chain of Manipulations

Add code
Feb 06, 2024
Figure 1 for CogCoM: Train Large Vision-Language Models Diving into Details through Chain of Manipulations
Figure 2 for CogCoM: Train Large Vision-Language Models Diving into Details through Chain of Manipulations
Figure 3 for CogCoM: Train Large Vision-Language Models Diving into Details through Chain of Manipulations
Figure 4 for CogCoM: Train Large Vision-Language Models Diving into Details through Chain of Manipulations
Viaarxiv icon

LongAlign: A Recipe for Long Context Alignment of Large Language Models

Add code
Jan 31, 2024
Viaarxiv icon

Text-Image Conditioned Diffusion for Consistent Text-to-3D Generation

Add code
Dec 19, 2023
Viaarxiv icon