Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:MetaDD: Boosting Dataset Distillation with Neural Network Architecture-Invariant Generalization

Oct 07, 2024

Yunlong Zhao, Xiaoheng Deng, Xiu Su, Hongyan Xu, Xiuxing Li, Yijing Liu, Shan You

Figure 1 for MetaDD: Boosting Dataset Distillation with Neural Network Architecture-Invariant Generalization

Figure 2 for MetaDD: Boosting Dataset Distillation with Neural Network Architecture-Invariant Generalization

Figure 3 for MetaDD: Boosting Dataset Distillation with Neural Network Architecture-Invariant Generalization

Figure 4 for MetaDD: Boosting Dataset Distillation with Neural Network Architecture-Invariant Generalization

Share this with someone who'll enjoy it:

Abstract:Dataset distillation (DD) entails creating a refined, compact distilled dataset from a large-scale dataset to facilitate efficient training. A significant challenge in DD is the dependency between the distilled dataset and the neural network (NN) architecture used. Training a different NN architecture with a distilled dataset distilled using a specific architecture often results in diminished trainning performance for other architectures. This paper introduces MetaDD, designed to enhance the generalizability of DD across various NN architectures. Specifically, MetaDD partitions distilled data into meta features (i.e., the data's common characteristics that remain consistent across different NN architectures) and heterogeneous features (i.e., the data's unique feature to each NN architecture). Then, MetaDD employs an architecture-invariant loss function for multi-architecture feature alignment, which increases meta features and reduces heterogeneous features in distilled data. As a low-memory consumption component, MetaDD can be seamlessly integrated into any DD methodology. Experimental results demonstrate that MetaDD significantly improves performance across various DD methods. On the Distilled Tiny-Imagenet with Sre2L (50 IPC), MetaDD achieves cross-architecture NN accuracy of up to 30.1\%, surpassing the second-best method (GLaD) by 1.7\%.

View paper on

Share this with someone who'll enjoy it:

Title:MetaDD: Boosting Dataset Distillation with Neural Network Architecture-Invariant Generalization

Paper and Code