Picture for Zengzhi Wang

Zengzhi Wang

Programming Every Example: Lifting Pre-training Data Quality like Experts at Scale

Add code
Sep 25, 2024
Figure 1 for Programming Every Example: Lifting Pre-training Data Quality like Experts at Scale
Figure 2 for Programming Every Example: Lifting Pre-training Data Quality like Experts at Scale
Figure 3 for Programming Every Example: Lifting Pre-training Data Quality like Experts at Scale
Figure 4 for Programming Every Example: Lifting Pre-training Data Quality like Experts at Scale
Viaarxiv icon

Data Contamination Report from the 2024 CONDA Shared Task

Add code
Jul 31, 2024
Figure 1 for Data Contamination Report from the 2024 CONDA Shared Task
Figure 2 for Data Contamination Report from the 2024 CONDA Shared Task
Figure 3 for Data Contamination Report from the 2024 CONDA Shared Task
Figure 4 for Data Contamination Report from the 2024 CONDA Shared Task
Viaarxiv icon

OlympicArena Medal Ranks: Who Is the Most Intelligent AI So Far?

Add code
Jun 26, 2024
Viaarxiv icon

OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI

Add code
Jun 18, 2024
Figure 1 for OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI
Figure 2 for OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI
Figure 3 for OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI
Figure 4 for OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI
Viaarxiv icon

Benchmarking Benchmark Leakage in Large Language Models

Add code
Apr 29, 2024
Viaarxiv icon

Generative AI for Math: Part I -- MathPile: A Billion-Token-Scale Pretraining Corpus for Math

Add code
Dec 28, 2023
Viaarxiv icon

Ask Again, Then Fail: Large Language Models' Vacillations in Judgement

Add code
Oct 03, 2023
Viaarxiv icon

MEMD-ABSA: A Multi-Element Multi-Domain Dataset for Aspect-Based Sentiment Analysis

Add code
Jun 29, 2023
Figure 1 for MEMD-ABSA: A Multi-Element Multi-Domain Dataset for Aspect-Based Sentiment Analysis
Figure 2 for MEMD-ABSA: A Multi-Element Multi-Domain Dataset for Aspect-Based Sentiment Analysis
Figure 3 for MEMD-ABSA: A Multi-Element Multi-Domain Dataset for Aspect-Based Sentiment Analysis
Figure 4 for MEMD-ABSA: A Multi-Element Multi-Domain Dataset for Aspect-Based Sentiment Analysis
Viaarxiv icon

Is ChatGPT a Good Sentiment Analyzer? A Preliminary Study

Add code
Apr 10, 2023
Viaarxiv icon

UnifiedABSA: A Unified ABSA Framework Based on Multi-task Instruction Tuning

Add code
Nov 20, 2022
Figure 1 for UnifiedABSA: A Unified ABSA Framework Based on Multi-task Instruction Tuning
Figure 2 for UnifiedABSA: A Unified ABSA Framework Based on Multi-task Instruction Tuning
Figure 3 for UnifiedABSA: A Unified ABSA Framework Based on Multi-task Instruction Tuning
Figure 4 for UnifiedABSA: A Unified ABSA Framework Based on Multi-task Instruction Tuning
Viaarxiv icon