Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Revisiting Differentiable Structure Learning: Inconsistency of $\ell_1$ Penalty and Beyond

Oct 24, 2024

Kaifeng Jin, Ignavier Ng, Kun Zhang, Biwei Huang

$Figure 1 for Revisiting Differentiable Structure Learning: Inconsistency of $\ell_1$ Penalty and Beyond$

$Figure 2 for Revisiting Differentiable Structure Learning: Inconsistency of $\ell_1$ Penalty and Beyond$

$Figure 3 for Revisiting Differentiable Structure Learning: Inconsistency of $\ell_1$ Penalty and Beyond$

$Figure 4 for Revisiting Differentiable Structure Learning: Inconsistency of $\ell_1$ Penalty and Beyond$

Share this with someone who'll enjoy it:

Abstract:Recent advances in differentiable structure learning have framed the combinatorial problem of learning directed acyclic graphs as a continuous optimization problem. Various aspects, including data standardization, have been studied to identify factors that influence the empirical performance of these methods. In this work, we investigate critical limitations in differentiable structure learning methods, focusing on settings where the true structure can be identified up to Markov equivalence classes, particularly in the linear Gaussian case. While Ng et al. (2024) highlighted potential non-convexity issues in this setting, we demonstrate and explain why the use of $\ell_1$-penalized likelihood in such cases is fundamentally inconsistent, even if the global optimum of the optimization problem can be found. To resolve this limitation, we develop a hybrid differentiable structure learning method based on $\ell_0$-penalized likelihood with hard acyclicity constraint, where the $\ell_0$ penalty can be approximated by different techniques including Gumbel-Softmax. Specifically, we first estimate the underlying moral graph, and use it to restrict the search space of the optimization problem, which helps alleviate the non-convexity issue. Experimental results show that the proposed method enhances empirical performance both before and after data standardization, providing a more reliable path for future advancements in differentiable structure learning, especially for learning Markov equivalence classes.

View paper on

Share this with someone who'll enjoy it:

Title:Revisiting Differentiable Structure Learning: Inconsistency of $\ell_1$ Penalty and Beyond

Paper and Code