Fitting Multilinear Polynomials for Logic Gate Networks

About

We study learnable logic gate networks that stack layers of 2-input Boolean gates to build combinational circuits. Every 2-input gate has a unique multilinear polynomial with 4 coefficients, so the 16 Boolean gates form a codebook of prototypes in a 4-dimensional space, reducing training to a vector-quantization problem. The baseline method, Soft-Mix, learns a 16-dimensional softmax over gate identities, but the codebook has rank~4: 11 of 15 simplex directions carry nullspace gradient, and at uniform initialization the backward signal vanishes exactly. We prove that no affine product reparameterization fixes the resulting interaction-coefficient starvation under STE, and show that the covariance Jacobian of soft-VQ selection bypasses it by coupling the starved coefficient to the always-active constant channel. Working in the 4-dimensional polynomial space reduces each neuron from 16 to 4 parameters. On seven datasets, at least one 4-parameter method matches or exceeds Soft-Mix on every dataset; the CovJac advantage over STE grows monotonically with interaction demand across all seven datasets. At depth, Soft-Mix collapses ($-37.3$pp on CIFAR-10 at 12 layers) while CovJac holds ($-0.5$pp on CIFAR-10, stable on MNIST).

Youngsung Kim• 2026

Related benchmarks

Task	Dataset	Result
Classification	MNIST	Accuracy98.36	89
Classification	Adult	Accuracy85.13	86
Image Classification	SVHN	Accuracy68.91	47
Classification	Splice	Accuracy97.72	41
Classification	CIFAR-100	Accuracy28.91	24
Classification	SVHN	--	21
Classification	monk2	Accuracy87.15	15
Image Classification	CIFAR-100 (test)	Last-10 Accuracy32.72	8
Image Classification	MNIST binarized (test)	Last-10 Accuracy98.3	6
Image Classification	CIFAR-10 binarized (test)	Last-10 Accuracy58.97	5

Showing 10 of 12 rows

Other info

Follow for update

@wizwand_team Discord