Picture for Shuyu Liu

Shuyu Liu

Quantifying the Reasoning Abilities of LLMs on Real-world Clinical Cases

Add code
Mar 06, 2025
Figure 1 for Quantifying the Reasoning Abilities of LLMs on Real-world Clinical Cases
Figure 2 for Quantifying the Reasoning Abilities of LLMs on Real-world Clinical Cases
Viaarxiv icon

PediaBench: A Comprehensive Chinese Pediatric Dataset for Benchmarking Large Language Models

Add code
Dec 09, 2024
Figure 1 for PediaBench: A Comprehensive Chinese Pediatric Dataset for Benchmarking Large Language Models
Figure 2 for PediaBench: A Comprehensive Chinese Pediatric Dataset for Benchmarking Large Language Models
Figure 3 for PediaBench: A Comprehensive Chinese Pediatric Dataset for Benchmarking Large Language Models
Figure 4 for PediaBench: A Comprehensive Chinese Pediatric Dataset for Benchmarking Large Language Models
Viaarxiv icon