Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Does Deep Learning Learn to Abstract? A Systematic Probing Framework

Feb 23, 2023

Shengnan An, Zeqi Lin, Bei Chen, Qiang Fu, Nanning Zheng, Jian-Guang Lou

Figure 1 for Does Deep Learning Learn to Abstract? A Systematic Probing Framework

Figure 2 for Does Deep Learning Learn to Abstract? A Systematic Probing Framework

Figure 3 for Does Deep Learning Learn to Abstract? A Systematic Probing Framework

Figure 4 for Does Deep Learning Learn to Abstract? A Systematic Probing Framework

Share this with someone who'll enjoy it:

Abstract:Abstraction is a desirable capability for deep learning models, which means to induce abstract concepts from concrete instances and flexibly apply them beyond the learning context. At the same time, there is a lack of clear understanding about both the presence and further characteristics of this capability in deep learning models. In this paper, we introduce a systematic probing framework to explore the abstraction capability of deep learning models from a transferability perspective. A set of controlled experiments are conducted based on this framework, providing strong evidence that two probed pre-trained language models (PLMs), T5 and GPT2, have the abstraction capability. We also conduct in-depth analysis, thus shedding further light: (1) the whole training phase exhibits a "memorize-then-abstract" two-stage process; (2) the learned abstract concepts are gathered in a few middle-layer attention heads, rather than being evenly distributed throughout the model; (3) the probed abstraction capabilities exhibit robustness against concept mutations, and are more robust to low-level/source-side mutations than high-level/target-side ones; (4) generic pre-training is critical to the emergence of abstraction capability, and PLMs exhibit better abstraction with larger model sizes and data scales.

* ICLR 2023

View paper on

OpenReview

Share this with someone who'll enjoy it:

Title:Does Deep Learning Learn to Abstract? A Systematic Probing Framework

Paper and Code