Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Data-Effective Learning: A Comprehensive Medical Benchmark

Jan 31, 2024

Wenxuan Yang, Weimin Tan, Yuqi Sun, Bo Yan

Figure 1 for Data-Effective Learning: A Comprehensive Medical Benchmark

Figure 2 for Data-Effective Learning: A Comprehensive Medical Benchmark

Figure 3 for Data-Effective Learning: A Comprehensive Medical Benchmark

Figure 4 for Data-Effective Learning: A Comprehensive Medical Benchmark

Share this with someone who'll enjoy it:

Abstract:Data-effective learning aims to use data in the most impactful way to train AI models, which involves strategies that focus on data quality rather than quantity, ensuring the data used for training has high informational value. Data-effective learning plays a profound role in accelerating AI training, reducing computational costs, and saving data storage, which is very important as the volume of medical data in recent years has grown beyond many people's expectations. However, due to the lack of standards and comprehensive benchmark, research on medical data-effective learning is poorly studied. To address this gap, our paper introduces a comprehensive benchmark specifically for evaluating data-effective learning in the medical field. This benchmark includes a dataset with millions of data samples from 31 medical centers (DataDEL), a baseline method for comparison (MedDEL), and a new evaluation metric (NormDEL) to objectively measure data-effective learning performance. Our extensive experimental results show the baseline MedDEL can achieve performance comparable to the original large dataset with only 5% of the data. Establishing such an open data-effective learning benchmark is crucial for the medical AI research community because it facilitates efficient data use, promotes collaborative breakthroughs, and fosters the development of cost-effective, scalable, and impactful healthcare solutions. The project can be accessed at https://github.com/shadow2469/Data-Effective-Learning-A-Comprehensive-Medical-Benchmark.git.

View paper on

Share this with someone who'll enjoy it:

Title:Data-Effective Learning: A Comprehensive Medical Benchmark

Paper and Code