Picture for Yushi Bai

Yushi Bai

AlphaTablets: A Generic Plane Representation for 3D Planar Reconstruction from Monocular Videos

Add code
Nov 29, 2024
Viaarxiv icon

Pre-training Distillation for Large Language Models: A Design Space Exploration

Add code
Oct 21, 2024
Viaarxiv icon

LongCite: Enabling LLMs to Generate Fine-grained Citations in Long-context QA

Add code
Sep 04, 2024
Figure 1 for LongCite: Enabling LLMs to Generate Fine-grained Citations in Long-context QA
Figure 2 for LongCite: Enabling LLMs to Generate Fine-grained Citations in Long-context QA
Figure 3 for LongCite: Enabling LLMs to Generate Fine-grained Citations in Long-context QA
Figure 4 for LongCite: Enabling LLMs to Generate Fine-grained Citations in Long-context QA
Viaarxiv icon

LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs

Add code
Aug 13, 2024
Figure 1 for LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs
Figure 2 for LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs
Figure 3 for LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs
Figure 4 for LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs
Viaarxiv icon

Finding Safety Neurons in Large Language Models

Add code
Jun 20, 2024
Viaarxiv icon

ChatGLM: A Family of Large Language Models from GLM-130B to GLM-4 All Tools

Add code
Jun 18, 2024
Figure 1 for ChatGLM: A Family of Large Language Models from GLM-130B to GLM-4 All Tools
Figure 2 for ChatGLM: A Family of Large Language Models from GLM-130B to GLM-4 All Tools
Figure 3 for ChatGLM: A Family of Large Language Models from GLM-130B to GLM-4 All Tools
Figure 4 for ChatGLM: A Family of Large Language Models from GLM-130B to GLM-4 All Tools
Viaarxiv icon

DICE: Detecting In-distribution Contamination in LLM's Fine-tuning Phase for Math Reasoning

Add code
Jun 06, 2024
Viaarxiv icon

Advancing Geometric Problem Solving: A Comprehensive Benchmark for Multimodal Model Evaluation

Add code
Apr 07, 2024
Figure 1 for Advancing Geometric Problem Solving: A Comprehensive Benchmark for Multimodal Model Evaluation
Figure 2 for Advancing Geometric Problem Solving: A Comprehensive Benchmark for Multimodal Model Evaluation
Figure 3 for Advancing Geometric Problem Solving: A Comprehensive Benchmark for Multimodal Model Evaluation
Figure 4 for Advancing Geometric Problem Solving: A Comprehensive Benchmark for Multimodal Model Evaluation
Viaarxiv icon

CogCoM: Train Large Vision-Language Models Diving into Details through Chain of Manipulations

Add code
Feb 06, 2024
Figure 1 for CogCoM: Train Large Vision-Language Models Diving into Details through Chain of Manipulations
Figure 2 for CogCoM: Train Large Vision-Language Models Diving into Details through Chain of Manipulations
Figure 3 for CogCoM: Train Large Vision-Language Models Diving into Details through Chain of Manipulations
Figure 4 for CogCoM: Train Large Vision-Language Models Diving into Details through Chain of Manipulations
Viaarxiv icon

LongAlign: A Recipe for Long Context Alignment of Large Language Models

Add code
Jan 31, 2024
Figure 1 for LongAlign: A Recipe for Long Context Alignment of Large Language Models
Figure 2 for LongAlign: A Recipe for Long Context Alignment of Large Language Models
Figure 3 for LongAlign: A Recipe for Long Context Alignment of Large Language Models
Figure 4 for LongAlign: A Recipe for Long Context Alignment of Large Language Models
Viaarxiv icon