Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Fitting Multilinear Polynomials for Logic Gate Networks

About

We study learnable logic gate networks that stack layers of 2-input Boolean gates to build combinational circuits. Every 2-input gate has a unique multilinear polynomial with 4 coefficients, so the 16 Boolean gates form a codebook of prototypes in a 4-dimensional space, reducing training to a vector-quantization problem. The baseline method, Soft-Mix, learns a 16-dimensional softmax over gate identities, but the codebook has rank~4: 11 of 15 simplex directions carry nullspace gradient, and at uniform initialization the backward signal vanishes exactly. We prove that no affine product reparameterization fixes the resulting interaction-coefficient starvation under STE, and show that the covariance Jacobian of soft-VQ selection bypasses it by coupling the starved coefficient to the always-active constant channel. Working in the 4-dimensional polynomial space reduces each neuron from 16 to 4 parameters. On seven datasets, at least one 4-parameter method matches or exceeds Soft-Mix on every dataset; the CovJac advantage over STE grows monotonically with interaction demand across all seven datasets. At depth, Soft-Mix collapses ($-37.3$pp on CIFAR-10 at 12 layers) while CovJac holds ($-0.5$pp on CIFAR-10, stable on MNIST).

Youngsung Kim• 2026

Related benchmarks

TaskDatasetResultRank
ClassificationMNIST
Accuracy98.36
89
ClassificationAdult
Accuracy85.13
86
Image ClassificationSVHN
Accuracy68.91
47
ClassificationSplice
Accuracy97.72
41
ClassificationSVHN--
21
ClassificationCIFAR-100
Accuracy28.91
16
Classificationmonk2
Accuracy87.15
15
Image ClassificationCIFAR-100 (test)
Last-10 Accuracy32.72
8
Image ClassificationMNIST binarized (test)
Last-10 Accuracy98.3
6
Image ClassificationCIFAR-10 binarized (test)
Last-10 Accuracy58.97
5
Showing 10 of 12 rows

Other info

Follow for update