Picture for Zhiqiang Zhang

Zhiqiang Zhang

MASS: Mathematical Data Selection via Skill Graphs for Pretraining Large Language Models

Add code
Mar 19, 2025
Viaarxiv icon

Unlocking General Long Chain-of-Thought Reasoning Capabilities of Large Language Models via Representation Engineering

Add code
Mar 14, 2025
Viaarxiv icon

Stick to Facts: Towards Fidelity-oriented Product Description Generation

Add code
Mar 12, 2025
Viaarxiv icon

Every FLOP Counts: Scaling a 300B Mixture-of-Experts LING LLM without Premium GPUs

Add code
Mar 07, 2025
Figure 1 for Every FLOP Counts: Scaling a 300B Mixture-of-Experts LING LLM without Premium GPUs
Figure 2 for Every FLOP Counts: Scaling a 300B Mixture-of-Experts LING LLM without Premium GPUs
Figure 3 for Every FLOP Counts: Scaling a 300B Mixture-of-Experts LING LLM without Premium GPUs
Figure 4 for Every FLOP Counts: Scaling a 300B Mixture-of-Experts LING LLM without Premium GPUs
Viaarxiv icon

Bi'an: A Bilingual Benchmark and Model for Hallucination Detection in Retrieval-Augmented Generation

Add code
Feb 26, 2025
Viaarxiv icon

Mix Data or Merge Models? Balancing the Helpfulness, Honesty, and Harmlessness of Large Language Model via Model Merging

Add code
Feb 13, 2025
Viaarxiv icon

K-ON: Stacking Knowledge On the Head Layer of Large Language Model

Add code
Feb 10, 2025
Figure 1 for K-ON: Stacking Knowledge On the Head Layer of Large Language Model
Figure 2 for K-ON: Stacking Knowledge On the Head Layer of Large Language Model
Figure 3 for K-ON: Stacking Knowledge On the Head Layer of Large Language Model
Figure 4 for K-ON: Stacking Knowledge On the Head Layer of Large Language Model
Viaarxiv icon

IceBerg: Debiased Self-Training for Class-Imbalanced Node Classification

Add code
Feb 10, 2025
Viaarxiv icon

Improving Natural Language Understanding for LLMs via Large-Scale Instruction Synthesis

Add code
Feb 06, 2025
Figure 1 for Improving Natural Language Understanding for LLMs via Large-Scale Instruction Synthesis
Figure 2 for Improving Natural Language Understanding for LLMs via Large-Scale Instruction Synthesis
Figure 3 for Improving Natural Language Understanding for LLMs via Large-Scale Instruction Synthesis
Figure 4 for Improving Natural Language Understanding for LLMs via Large-Scale Instruction Synthesis
Viaarxiv icon

MAQInstruct: Instruction-based Unified Event Relation Extraction

Add code
Feb 06, 2025
Viaarxiv icon