DAG-GNN: DAG Structure Learning with Graph Neural Networks

Abstract

Learning a faithful Directed Acyclic Graph (DAG) from samples of a joint distribution is a challenging combinatorial problem due to the superexponential search space in the number of nodes. Recent breakthroughs reformulate this as a continuous optimization problem with a structural constraint ensuring acyclicity (Zheng et al., 2018). While effective for linear Structural Equation Models (SEMs), these methods are limited in capturing nonlinear relationships. This work proposes DAG-GNN, a deep generative model leveraging Graph Neural Networks (GNNs) to learn DAGs from complex data. The model uses a variational autoencoder parameterized by a novel GNN architecture, handling both discrete and vector-valued variables. Experiments show superior accuracy on synthetic nonlinear data and competitive performance on benchmark datasets.

Key Innovations

Deep Generative Model: Combines variational autoencoders with GNNs to capture nonlinear dependencies.
Generalized SEM: Extends linear SEM to nonlinear mappings via GNNs, ensuring compatibility with linear data.
Flexible Data Handling: Naturally supports discrete and vector-valued variables.
Practical Acyclicity Constraint: Introduces a polynomial alternative to matrix exponential for easier implementation.

Methodology

1. Graph Neural Network Architecture

The proposed DAG-GNN architecture is inspired by linear SEM but incorporates nonlinear transforms:

Decoder: ( X = f_2((I - A^T)^{-1} f_1(Z)) ), where ( f_1 ) and ( f_2 ) are parameterized functions.
Encoder: ( Z = f_4((I - A^T) f_3(X)) ), with ( f_3 ) and ( f_4 ) as MLPs.

2. Variational Autoencoder Framework

ELBO Optimization: Maximizes the evidence lower bound:
[
\mathcal{L}_{\text{ELBO}} = -\text{D}_{\text{KL}}(q(Z|X) | p(Z)) + \mathbb{E}_{q(Z|X)}[\log p(X|Z)].
]
Continuous & Discrete Support:
- For continuous variables: Gaussian likelihoods.
- For discrete variables: Categorical likelihoods with softmax outputs.

3. Acyclicity Constraint

A novel polynomial constraint ensures acyclicity:
[
\text{tr}[(I + \alpha A \circ A)^m] - m = 0,
]
where ( \alpha > 0 ) and ( \circ ) denotes elementwise product.

Experiments

1. Synthetic Data

Linear SEM: DAG-GNN matches DAG-NOTEARS in accuracy.
Nonlinear SEM: Outperforms DAG-NOTEARS with lower SHD and FDR (e.g., ( x = 2\sin(A^T(x + 0.5)) + A^T x + z )).
Vector-Valued Data: Handles multi-dimensional variables (( d = 5 )) effectively.

2. Benchmark Datasets

Discrete Variables (Child, Alarm, Pigs): Achieves BIC scores close to global optima found by GOPNILP.

3. Applications

Protein Signaling Networks: Identifies 8/20 ground-truth edges with minimal false positives.
KB Relation Inference: Learns intuitive causal relations (e.g., person/Nationality ⇒ person/Languages).

Advantages

Rich Capacity: Captures complex nonlinear relationships.
Unified Framework: Handles diverse data types without ad-hoc adjustments.
Scalability: Efficiently learns large graphs via continuous optimization.

FAQs

Q1: How does DAG-GNN handle discrete variables?

A1: The decoder outputs a categorical distribution via softmax, enabling direct modeling of discrete data.

Q2: Why use a polynomial constraint instead of matrix exponential?

A2: Polynomials are more stable and widely supported in deep learning frameworks.

Q3: Can DAG-GNN outperform combinatorial search methods?

A3: While not always globally optimal, it scales better and achieves competitive results on large graphs.

Q4: What’s the computational cost?

A4: Training involves gradient-based optimization, with complexity linear in the number of edges.

👉 Explore more about DAG-GNN’s applications
👉 Learn how variational autoencoders improve structure learning

Conclusion: DAG-GNN advances DAG learning by integrating deep generative models with GNNs, offering a flexible, scalable solution for nonlinear and discrete data. Future work includes extending to dynamic graphs and higher-order dependencies.


### Key Features:
- **SEO Optimized**: Keywords like "DAG-GNN," "nonlinear SEM," and "variational autoencoder" are naturally integrated.
- **FAQs**: Address common queries to boost engagement.