Picture for Xiaojun Wan

Xiaojun Wan

Re-evaluating Automatic LLM System Ranking for Alignment with Human Preference

Add code
Dec 31, 2024
Figure 1 for Re-evaluating Automatic LLM System Ranking for Alignment with Human Preference
Figure 2 for Re-evaluating Automatic LLM System Ranking for Alignment with Human Preference
Figure 3 for Re-evaluating Automatic LLM System Ranking for Alignment with Human Preference
Figure 4 for Re-evaluating Automatic LLM System Ranking for Alignment with Human Preference
Viaarxiv icon

DSGram: Dynamic Weighting Sub-Metrics for Grammatical Error Correction in the Era of Large Language Models

Add code
Dec 17, 2024
Viaarxiv icon

$B^4$: A Black-Box Scrubbing Attack on LLM Watermarks

Add code
Nov 02, 2024
Viaarxiv icon

Analyzing and Evaluating Correlation Measures in NLG Meta-Evaluation

Add code
Oct 22, 2024
Viaarxiv icon

Evaluating Self-Generated Documents for Enhancing Retrieval-Augmented Generation with Large Language Models

Add code
Oct 17, 2024
Figure 1 for Evaluating Self-Generated Documents for Enhancing Retrieval-Augmented Generation with Large Language Models
Figure 2 for Evaluating Self-Generated Documents for Enhancing Retrieval-Augmented Generation with Large Language Models
Figure 3 for Evaluating Self-Generated Documents for Enhancing Retrieval-Augmented Generation with Large Language Models
Figure 4 for Evaluating Self-Generated Documents for Enhancing Retrieval-Augmented Generation with Large Language Models
Viaarxiv icon

Style-Compress: An LLM-Based Prompt Compression Framework Considering Task-Specific Styles

Add code
Oct 17, 2024
Figure 1 for Style-Compress: An LLM-Based Prompt Compression Framework Considering Task-Specific Styles
Figure 2 for Style-Compress: An LLM-Based Prompt Compression Framework Considering Task-Specific Styles
Figure 3 for Style-Compress: An LLM-Based Prompt Compression Framework Considering Task-Specific Styles
Figure 4 for Style-Compress: An LLM-Based Prompt Compression Framework Considering Task-Specific Styles
Viaarxiv icon

Gödel Agent: A Self-Referential Agent Framework for Recursive Self-Improvement

Add code
Oct 06, 2024
Figure 1 for Gödel Agent: A Self-Referential Agent Framework for Recursive Self-Improvement
Figure 2 for Gödel Agent: A Self-Referential Agent Framework for Recursive Self-Improvement
Figure 3 for Gödel Agent: A Self-Referential Agent Framework for Recursive Self-Improvement
Figure 4 for Gödel Agent: A Self-Referential Agent Framework for Recursive Self-Improvement
Viaarxiv icon

SMART-RAG: Selection using Determinantal Matrices for Augmented Retrieval

Add code
Sep 21, 2024
Figure 1 for SMART-RAG: Selection using Determinantal Matrices for Augmented Retrieval
Figure 2 for SMART-RAG: Selection using Determinantal Matrices for Augmented Retrieval
Figure 3 for SMART-RAG: Selection using Determinantal Matrices for Augmented Retrieval
Figure 4 for SMART-RAG: Selection using Determinantal Matrices for Augmented Retrieval
Viaarxiv icon

PaCoST: Paired Confidence Significance Testing for Benchmark Contamination Detection in Large Language Models

Add code
Jun 26, 2024
Viaarxiv icon

Themis: Towards Flexible and Interpretable NLG Evaluation

Add code
Jun 26, 2024
Viaarxiv icon