Picture for Aojun Zhou

Aojun Zhou

Chimera: Improving Generalist Model with Domain-Specific Experts

Add code
Dec 08, 2024
Viaarxiv icon

BlueLM-V-3B: Algorithm and System Co-Design for Multimodal Large Language Models on Mobile Devices

Add code
Nov 16, 2024
Viaarxiv icon

MathCoder2: Better Math Reasoning from Continued Pretraining on Model-translated Mathematical Code

Add code
Oct 10, 2024
Figure 1 for MathCoder2: Better Math Reasoning from Continued Pretraining on Model-translated Mathematical Code
Figure 2 for MathCoder2: Better Math Reasoning from Continued Pretraining on Model-translated Mathematical Code
Figure 3 for MathCoder2: Better Math Reasoning from Continued Pretraining on Model-translated Mathematical Code
Figure 4 for MathCoder2: Better Math Reasoning from Continued Pretraining on Model-translated Mathematical Code
Viaarxiv icon

ThinK: Thinner Key Cache by Query-Driven Pruning

Add code
Jul 30, 2024
Viaarxiv icon

MAVIS: Mathematical Visual Instruction Tuning

Add code
Jul 11, 2024
Figure 1 for MAVIS: Mathematical Visual Instruction Tuning
Figure 2 for MAVIS: Mathematical Visual Instruction Tuning
Figure 3 for MAVIS: Mathematical Visual Instruction Tuning
Figure 4 for MAVIS: Mathematical Visual Instruction Tuning
Viaarxiv icon

Step-Controlled DPO: Leveraging Stepwise Error for Enhanced Mathematical Reasoning

Add code
Jul 02, 2024
Figure 1 for Step-Controlled DPO: Leveraging Stepwise Error for Enhanced Mathematical Reasoning
Figure 2 for Step-Controlled DPO: Leveraging Stepwise Error for Enhanced Mathematical Reasoning
Figure 3 for Step-Controlled DPO: Leveraging Stepwise Error for Enhanced Mathematical Reasoning
Figure 4 for Step-Controlled DPO: Leveraging Stepwise Error for Enhanced Mathematical Reasoning
Viaarxiv icon

ReflectionCoder: Learning from Reflection Sequence for Enhanced One-off Code Generation

Add code
May 27, 2024
Figure 1 for ReflectionCoder: Learning from Reflection Sequence for Enhanced One-off Code Generation
Figure 2 for ReflectionCoder: Learning from Reflection Sequence for Enhanced One-off Code Generation
Figure 3 for ReflectionCoder: Learning from Reflection Sequence for Enhanced One-off Code Generation
Figure 4 for ReflectionCoder: Learning from Reflection Sequence for Enhanced One-off Code Generation
Viaarxiv icon

SPP: Sparsity-Preserved Parameter-Efficient Fine-Tuning for Large Language Models

Add code
May 25, 2024
Figure 1 for SPP: Sparsity-Preserved Parameter-Efficient Fine-Tuning for Large Language Models
Figure 2 for SPP: Sparsity-Preserved Parameter-Efficient Fine-Tuning for Large Language Models
Figure 3 for SPP: Sparsity-Preserved Parameter-Efficient Fine-Tuning for Large Language Models
Figure 4 for SPP: Sparsity-Preserved Parameter-Efficient Fine-Tuning for Large Language Models
Viaarxiv icon

TerDiT: Ternary Diffusion Models with Transformers

Add code
May 23, 2024
Figure 1 for TerDiT: Ternary Diffusion Models with Transformers
Figure 2 for TerDiT: Ternary Diffusion Models with Transformers
Figure 3 for TerDiT: Ternary Diffusion Models with Transformers
Figure 4 for TerDiT: Ternary Diffusion Models with Transformers
Viaarxiv icon

MathVerse: Does Your Multi-modal LLM Truly See the Diagrams in Visual Math Problems?

Add code
Mar 21, 2024
Viaarxiv icon