Picture for Shirong Ma

Shirong Ma

DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence

Add code
Jun 17, 2024
Figure 1 for DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence
Figure 2 for DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence
Figure 3 for DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence
Figure 4 for DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence
Viaarxiv icon

Rethinking the Roles of Large Language Models in Chinese Grammatical Error Correction

Add code
Feb 18, 2024
Viaarxiv icon

Mitigating Catastrophic Forgetting in Multi-domain Chinese Spelling Correction by Multi-stage Knowledge Transfer Framework

Add code
Feb 18, 2024
Viaarxiv icon

When LLMs Meet Cunning Questions: A Fallacy Understanding Benchmark for Large Language Models

Add code
Feb 16, 2024
Figure 1 for When LLMs Meet Cunning Questions: A Fallacy Understanding Benchmark for Large Language Models
Figure 2 for When LLMs Meet Cunning Questions: A Fallacy Understanding Benchmark for Large Language Models
Figure 3 for When LLMs Meet Cunning Questions: A Fallacy Understanding Benchmark for Large Language Models
Figure 4 for When LLMs Meet Cunning Questions: A Fallacy Understanding Benchmark for Large Language Models
Viaarxiv icon

DeepSeek LLM: Scaling Open-Source Language Models with Longtermism

Add code
Jan 05, 2024
Figure 1 for DeepSeek LLM: Scaling Open-Source Language Models with Longtermism
Figure 2 for DeepSeek LLM: Scaling Open-Source Language Models with Longtermism
Figure 3 for DeepSeek LLM: Scaling Open-Source Language Models with Longtermism
Figure 4 for DeepSeek LLM: Scaling Open-Source Language Models with Longtermism
Viaarxiv icon

EcomGPT-CT: Continual Pre-training of E-commerce Large Language Models with Semi-structured Data

Add code
Dec 25, 2023
Viaarxiv icon

EcomGPT: Instruction-tuning Large Language Models with Chain-of-Task Tasks for E-commerce

Add code
Aug 28, 2023
Figure 1 for EcomGPT: Instruction-tuning Large Language Models with Chain-of-Task Tasks for E-commerce
Figure 2 for EcomGPT: Instruction-tuning Large Language Models with Chain-of-Task Tasks for E-commerce
Figure 3 for EcomGPT: Instruction-tuning Large Language Models with Chain-of-Task Tasks for E-commerce
Figure 4 for EcomGPT: Instruction-tuning Large Language Models with Chain-of-Task Tasks for E-commerce
Viaarxiv icon

LatEval: An Interactive LLMs Evaluation Benchmark with Incomplete Information from Lateral Thinking Puzzles

Add code
Aug 21, 2023
Viaarxiv icon

On the Effectiveness of Large Language Models for Chinese Text Correction

Add code
Jul 18, 2023
Viaarxiv icon

Progressive Multi-task Learning Framework for Chinese Text Error Correction

Add code
Jul 03, 2023
Figure 1 for Progressive Multi-task Learning Framework for Chinese Text Error Correction
Figure 2 for Progressive Multi-task Learning Framework for Chinese Text Error Correction
Figure 3 for Progressive Multi-task Learning Framework for Chinese Text Error Correction
Figure 4 for Progressive Multi-task Learning Framework for Chinese Text Error Correction
Viaarxiv icon