Picture for Jiahui Peng

Jiahui Peng

Multi-Agent Collaborative Data Selection for Efficient LLM Pretraining

Add code
Oct 10, 2024
Figure 1 for Multi-Agent Collaborative Data Selection for Efficient LLM Pretraining
Figure 2 for Multi-Agent Collaborative Data Selection for Efficient LLM Pretraining
Figure 3 for Multi-Agent Collaborative Data Selection for Efficient LLM Pretraining
Figure 4 for Multi-Agent Collaborative Data Selection for Efficient LLM Pretraining
Viaarxiv icon

DSDL: Data Set Description Language for Bridging Modalities and Tasks in AI Data

Add code
May 28, 2024
Viaarxiv icon

FoundaBench: Evaluating Chinese Fundamental Knowledge Capabilities of Large Language Models

Add code
Apr 29, 2024
Viaarxiv icon

Advanced Unstructured Data Processing for ESG Reports: A Methodology for Structured Transformation and Enhanced Analysis

Add code
Jan 04, 2024
Viaarxiv icon

VIGC: Visual Instruction Generation and Correction

Add code
Sep 11, 2023
Viaarxiv icon