Picture for Dakuan Lu

Dakuan Lu

AURORA:Automated Training Framework of Universal Process Reward Models via Ensemble Prompting and Reverse Verification

Add code
Feb 17, 2025
Viaarxiv icon

SCP-116K: A High-Quality Problem-Solution Dataset and a Generalized Pipeline for Automated Extraction in the Higher Education Science Domain

Add code
Jan 26, 2025
Figure 1 for SCP-116K: A High-Quality Problem-Solution Dataset and a Generalized Pipeline for Automated Extraction in the Higher Education Science Domain
Figure 2 for SCP-116K: A High-Quality Problem-Solution Dataset and a Generalized Pipeline for Automated Extraction in the Higher Education Science Domain
Viaarxiv icon

MINDECHO: Role-Playing Language Agents for Key Opinion Leaders

Add code
Jul 07, 2024
Figure 1 for MINDECHO: Role-Playing Language Agents for Key Opinion Leaders
Figure 2 for MINDECHO: Role-Playing Language Agents for Key Opinion Leaders
Figure 3 for MINDECHO: Role-Playing Language Agents for Key Opinion Leaders
Figure 4 for MINDECHO: Role-Playing Language Agents for Key Opinion Leaders
Viaarxiv icon

ConcEPT: Concept-Enhanced Pre-Training for Language Models

Add code
Jan 11, 2024
Viaarxiv icon

Source Prompt: Coordinated Pre-training of Language Models on Diverse Corpora from Multiple Sources

Add code
Nov 16, 2023
Figure 1 for Source Prompt: Coordinated Pre-training of Language Models on Diverse Corpora from Multiple Sources
Figure 2 for Source Prompt: Coordinated Pre-training of Language Models on Diverse Corpora from Multiple Sources
Figure 3 for Source Prompt: Coordinated Pre-training of Language Models on Diverse Corpora from Multiple Sources
Figure 4 for Source Prompt: Coordinated Pre-training of Language Models on Diverse Corpora from Multiple Sources
Viaarxiv icon

BBT-Fin: Comprehensive Construction of Chinese Financial Domain Pre-trained Language Model, Corpus and Benchmark

Add code
Feb 26, 2023
Viaarxiv icon