Picture for Hongwei Feng

Hongwei Feng

LLM-GAN: Construct Generative Adversarial Network Through Large Language Models For Explainable Fake News Detection

Add code
Sep 03, 2024
Figure 1 for LLM-GAN: Construct Generative Adversarial Network Through Large Language Models For Explainable Fake News Detection
Figure 2 for LLM-GAN: Construct Generative Adversarial Network Through Large Language Models For Explainable Fake News Detection
Figure 3 for LLM-GAN: Construct Generative Adversarial Network Through Large Language Models For Explainable Fake News Detection
Figure 4 for LLM-GAN: Construct Generative Adversarial Network Through Large Language Models For Explainable Fake News Detection
Viaarxiv icon

StrucText-Eval: An Autogenerated Benchmark for Evaluating Large Language Model's Ability in Structure-Rich Text Understanding

Add code
Jun 30, 2024
Viaarxiv icon

DetectBench: Can Large Language Model Detect and Piece Together Implicit Evidence?

Add code
Jun 18, 2024
Figure 1 for DetectBench: Can Large Language Model Detect and Piece Together Implicit Evidence?
Figure 2 for DetectBench: Can Large Language Model Detect and Piece Together Implicit Evidence?
Figure 3 for DetectBench: Can Large Language Model Detect and Piece Together Implicit Evidence?
Figure 4 for DetectBench: Can Large Language Model Detect and Piece Together Implicit Evidence?
Viaarxiv icon

StructBench: An Autogenerated Benchmark for Evaluating Large Language Model's Ability in Structure-Rich Text Understanding

Add code
Jun 15, 2024
Viaarxiv icon

Agent Group Chat: An Interactive Group Chat Simulacra For Better Eliciting Collective Emergent Behavior

Add code
Mar 20, 2024
Viaarxiv icon

The Missing Piece in Model Editing: A Deep Dive into the Hidden Damage Brought By Model Editing

Add code
Mar 12, 2024
Viaarxiv icon

Beyond the Obvious: Evaluating the Reasoning Ability In Real-life Scenarios of Language Models on Life Scapes Reasoning Benchmark~(LSR-Benchmark)

Add code
Jul 11, 2023
Viaarxiv icon

Xiezhi: An Ever-Updating Benchmark for Holistic Domain Knowledge Evaluation

Add code
Jun 15, 2023
Viaarxiv icon

Domain Mastery Benchmark: An Ever-Updating Benchmark for Evaluating Holistic Domain Knowledge of Large Language Model--A Preliminary Release

Add code
Apr 23, 2023
Viaarxiv icon

Sem4SAP: Synonymous Expression Mining From Open Knowledge Graph For Language Model Synonym-Aware Pretraining

Add code
Mar 25, 2023
Viaarxiv icon