Picture for Zhen Qin

Zhen Qin

MiniMax-01: Scaling Foundation Models with Lightning Attention

Add code
Jan 14, 2025
Viaarxiv icon

Tensor Product Attention Is All You Need

Add code
Jan 11, 2025
Viaarxiv icon

SeMi: When Imbalanced Semi-Supervised Learning Meets Mining Hard Examples

Add code
Jan 10, 2025
Viaarxiv icon

Optimal Error Analysis of Channel Estimation for IRS-assisted MIMO Systems

Add code
Dec 22, 2024
Viaarxiv icon

Scaling Image Tokenizers with Grouped Spherical Quantization

Add code
Dec 03, 2024
Viaarxiv icon

Personalized Federated Fine-Tuning for LLMs via Data-Driven Heterogeneous Model Architectures

Add code
Nov 28, 2024
Viaarxiv icon

Optimal Allocation of Pauli Measurements for Low-rank Quantum State Tomography

Add code
Nov 07, 2024
Viaarxiv icon

Robust Low-rank Tensor Train Recovery

Add code
Oct 19, 2024
Viaarxiv icon

Federated Data-Efficient Instruction Tuning for Large Language Models

Add code
Oct 14, 2024
Viaarxiv icon

Integrating Planning into Single-Turn Long-Form Text Generation

Add code
Oct 08, 2024
Figure 1 for Integrating Planning into Single-Turn Long-Form Text Generation
Figure 2 for Integrating Planning into Single-Turn Long-Form Text Generation
Figure 3 for Integrating Planning into Single-Turn Long-Form Text Generation
Figure 4 for Integrating Planning into Single-Turn Long-Form Text Generation
Viaarxiv icon