Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:SciAssess: Benchmarking LLM Proficiency in Scientific Literature Analysis

Mar 15, 2024

Hengxing Cai, Xiaochen Cai, Junhan Chang, Sihang Li, Lin Yao, Changxin Wang, Zhifeng Gao, Hongshuai Wang, Yongge Li, Mujie Lin(+6 more)

Figure 1 for SciAssess: Benchmarking LLM Proficiency in Scientific Literature Analysis

Figure 2 for SciAssess: Benchmarking LLM Proficiency in Scientific Literature Analysis

Figure 3 for SciAssess: Benchmarking LLM Proficiency in Scientific Literature Analysis

Figure 4 for SciAssess: Benchmarking LLM Proficiency in Scientific Literature Analysis

Share this with someone who'll enjoy it:

Abstract:Recent breakthroughs in Large Language Models (LLMs) have revolutionized natural language understanding and generation, igniting a surge of interest in leveraging these technologies in the field of scientific literature analysis. Existing benchmarks, however, inadequately evaluate the proficiency of LLMs in scientific literature analysis, especially in scenarios involving complex comprehension and multimodal data. In response, we introduced SciAssess, a benchmark tailored for the in-depth analysis of scientific literature, crafted to provide a thorough assessment of LLMs' efficacy. SciAssess focuses on evaluating LLMs' abilities in memorization, comprehension, and analysis within the context of scientific literature analysis. It includes representative tasks from diverse scientific fields, such as general chemistry, organic materials, and alloy materials. And rigorous quality control measures ensure its reliability in terms of correctness, anonymization, and copyright compliance. SciAssess evaluates leading LLMs, including GPT-4, GPT-3.5, and Gemini, identifying their strengths and aspects for improvement and supporting the ongoing development of LLM applications in scientific literature analysis. SciAssess and its resources are made available at https://sci-assess.github.io, offering a valuable tool for advancing LLM capabilities in scientific literature analysis.

View paper on

Share this with someone who'll enjoy it:

Title:SciAssess: Benchmarking LLM Proficiency in Scientific Literature Analysis

Paper and Code