Picture for Yulong Chen

Yulong Chen

SuperMat: Physically Consistent PBR Material Estimation at Interactive Rates

Add code
Nov 26, 2024
Viaarxiv icon

Benchmarking GPT-4 against Human Translators: A Comprehensive Evaluation Across Languages, Domains, and Expertise Levels

Add code
Nov 21, 2024
Figure 1 for Benchmarking GPT-4 against Human Translators: A Comprehensive Evaluation Across Languages, Domains, and Expertise Levels
Figure 2 for Benchmarking GPT-4 against Human Translators: A Comprehensive Evaluation Across Languages, Domains, and Expertise Levels
Figure 3 for Benchmarking GPT-4 against Human Translators: A Comprehensive Evaluation Across Languages, Domains, and Expertise Levels
Figure 4 for Benchmarking GPT-4 against Human Translators: A Comprehensive Evaluation Across Languages, Domains, and Expertise Levels
Viaarxiv icon

The Automated Verification of Textual Claims (AVeriTeC) Shared Task

Add code
Oct 31, 2024
Figure 1 for The Automated Verification of Textual Claims (AVeriTeC) Shared Task
Figure 2 for The Automated Verification of Textual Claims (AVeriTeC) Shared Task
Figure 3 for The Automated Verification of Textual Claims (AVeriTeC) Shared Task
Figure 4 for The Automated Verification of Textual Claims (AVeriTeC) Shared Task
Viaarxiv icon

See What LLMs Cannot Answer: A Self-Challenge Framework for Uncovering LLM Weaknesses

Add code
Aug 16, 2024
Figure 1 for See What LLMs Cannot Answer: A Self-Challenge Framework for Uncovering LLM Weaknesses
Figure 2 for See What LLMs Cannot Answer: A Self-Challenge Framework for Uncovering LLM Weaknesses
Figure 3 for See What LLMs Cannot Answer: A Self-Challenge Framework for Uncovering LLM Weaknesses
Figure 4 for See What LLMs Cannot Answer: A Self-Challenge Framework for Uncovering LLM Weaknesses
Viaarxiv icon

GPT-4 vs. Human Translators: A Comprehensive Evaluation of Translation Quality Across Languages, Domains, and Expertise Levels

Add code
Jul 04, 2024
Figure 1 for GPT-4 vs. Human Translators: A Comprehensive Evaluation of Translation Quality Across Languages, Domains, and Expertise Levels
Figure 2 for GPT-4 vs. Human Translators: A Comprehensive Evaluation of Translation Quality Across Languages, Domains, and Expertise Levels
Figure 3 for GPT-4 vs. Human Translators: A Comprehensive Evaluation of Translation Quality Across Languages, Domains, and Expertise Levels
Figure 4 for GPT-4 vs. Human Translators: A Comprehensive Evaluation of Translation Quality Across Languages, Domains, and Expertise Levels
Viaarxiv icon

When Swarm Learning meets energy series data: A decentralized collaborative learning design based on blockchain

Add code
Jun 07, 2024
Figure 1 for When Swarm Learning meets energy series data: A decentralized collaborative learning design based on blockchain
Figure 2 for When Swarm Learning meets energy series data: A decentralized collaborative learning design based on blockchain
Figure 3 for When Swarm Learning meets energy series data: A decentralized collaborative learning design based on blockchain
Figure 4 for When Swarm Learning meets energy series data: A decentralized collaborative learning design based on blockchain
Viaarxiv icon

Tables as Images? Exploring the Strengths and Limitations of LLMs on Multimodal Representations of Tabular Data

Add code
Feb 23, 2024
Figure 1 for Tables as Images? Exploring the Strengths and Limitations of LLMs on Multimodal Representations of Tabular Data
Figure 2 for Tables as Images? Exploring the Strengths and Limitations of LLMs on Multimodal Representations of Tabular Data
Figure 3 for Tables as Images? Exploring the Strengths and Limitations of LLMs on Multimodal Representations of Tabular Data
Figure 4 for Tables as Images? Exploring the Strengths and Limitations of LLMs on Multimodal Representations of Tabular Data
Viaarxiv icon

Constituency Parsing using LLMs

Add code
Oct 31, 2023
Viaarxiv icon

HI-TOM: A Benchmark for Evaluating Higher-Order Theory of Mind Reasoning in Large Language Models

Add code
Oct 25, 2023
Viaarxiv icon

Siren's Song in the AI Ocean: A Survey on Hallucination in Large Language Models

Add code
Sep 03, 2023
Viaarxiv icon