Picture for Derek F. Wong

Derek F. Wong

DetectRL: Benchmarking LLM-Generated Text Detection in Real-World Scenarios

Add code
Oct 31, 2024
Viaarxiv icon

VisAidMath: Benchmarking Visual-Aided Mathematical Reasoning

Add code
Oct 30, 2024
Viaarxiv icon

Latent Space Chain-of-Embedding Enables Output-free LLM Self-Evaluation

Add code
Oct 17, 2024
Viaarxiv icon

DelTA: An Online Document-Level Translation Agent Based on Multi-Level Memory

Add code
Oct 10, 2024
Figure 1 for DelTA: An Online Document-Level Translation Agent Based on Multi-Level Memory
Figure 2 for DelTA: An Online Document-Level Translation Agent Based on Multi-Level Memory
Figure 3 for DelTA: An Online Document-Level Translation Agent Based on Multi-Level Memory
Figure 4 for DelTA: An Online Document-Level Translation Agent Based on Multi-Level Memory
Viaarxiv icon

Large Language Model for Multi-Domain Translation: Benchmarking and Domain CoT Fine-tuning

Add code
Oct 03, 2024
Figure 1 for Large Language Model for Multi-Domain Translation: Benchmarking and Domain CoT Fine-tuning
Figure 2 for Large Language Model for Multi-Domain Translation: Benchmarking and Domain CoT Fine-tuning
Figure 3 for Large Language Model for Multi-Domain Translation: Benchmarking and Domain CoT Fine-tuning
Figure 4 for Large Language Model for Multi-Domain Translation: Benchmarking and Domain CoT Fine-tuning
Viaarxiv icon

Is Your Model Really A Good Math Reasoner? Evaluating Mathematical Reasoning with Checklist

Add code
Jul 11, 2024
Figure 1 for Is Your Model Really A Good Math Reasoner? Evaluating Mathematical Reasoning with Checklist
Figure 2 for Is Your Model Really A Good Math Reasoner? Evaluating Mathematical Reasoning with Checklist
Figure 3 for Is Your Model Really A Good Math Reasoner? Evaluating Mathematical Reasoning with Checklist
Figure 4 for Is Your Model Really A Good Math Reasoner? Evaluating Mathematical Reasoning with Checklist
Viaarxiv icon

AnyTrans: Translate AnyText in the Image with Large Scale Models

Add code
Jun 17, 2024
Viaarxiv icon

CoEvol: Constructing Better Responses for Instruction Finetuning through Multi-Agent Cooperation

Add code
Jun 11, 2024
Viaarxiv icon

FOCUS: Forging Originality through Contrastive Use in Self-Plagiarism for Language Models

Add code
Jun 02, 2024
Viaarxiv icon

CPsyCoun: A Report-based Multi-turn Dialogue Reconstruction and Evaluation Framework for Chinese Psychological Counseling

Add code
May 26, 2024
Viaarxiv icon