Picture for Liqun Ma

Liqun Ma

National Laboratory of Pattern Recognition, Institute of Automation, CAS, Beijing, China, School of Artificial Intelligence, University of Chinese Academy of Sciences, Beijing, China

LLM360 K2: Building a 65B 360-Open-Source Large Language Model from Scratch

Add code
Jan 16, 2025
Viaarxiv icon

Bi-Mamba: Towards Accurate 1-Bit State Space Models

Add code
Nov 18, 2024
Viaarxiv icon

FBI-LLM: Scaling Up Fully Binarized LLMs from Scratch via Autoregressive Distillation

Add code
Jul 09, 2024
Figure 1 for FBI-LLM: Scaling Up Fully Binarized LLMs from Scratch via Autoregressive Distillation
Figure 2 for FBI-LLM: Scaling Up Fully Binarized LLMs from Scratch via Autoregressive Distillation
Figure 3 for FBI-LLM: Scaling Up Fully Binarized LLMs from Scratch via Autoregressive Distillation
Figure 4 for FBI-LLM: Scaling Up Fully Binarized LLMs from Scratch via Autoregressive Distillation
Viaarxiv icon

Beyond Size: How Gradients Shape Pruning Decisions in Large Language Models

Add code
Nov 08, 2023
Viaarxiv icon

SlimPajama-DC: Understanding Data Combinations for LLM Training

Add code
Sep 19, 2023
Viaarxiv icon

CSDS: A Fine-Grained Chinese Dataset for Customer Service Dialogue Summarization

Add code
Sep 06, 2021
Figure 1 for CSDS: A Fine-Grained Chinese Dataset for Customer Service Dialogue Summarization
Figure 2 for CSDS: A Fine-Grained Chinese Dataset for Customer Service Dialogue Summarization
Figure 3 for CSDS: A Fine-Grained Chinese Dataset for Customer Service Dialogue Summarization
Figure 4 for CSDS: A Fine-Grained Chinese Dataset for Customer Service Dialogue Summarization
Viaarxiv icon

An Empirical Evaluation of Multi-task Learning in Deep Neural Networks for Natural Language Processing

Add code
Aug 16, 2019
Figure 1 for An Empirical Evaluation of Multi-task Learning in Deep Neural Networks for Natural Language Processing
Figure 2 for An Empirical Evaluation of Multi-task Learning in Deep Neural Networks for Natural Language Processing
Figure 3 for An Empirical Evaluation of Multi-task Learning in Deep Neural Networks for Natural Language Processing
Figure 4 for An Empirical Evaluation of Multi-task Learning in Deep Neural Networks for Natural Language Processing
Viaarxiv icon