Fitting Multilinear Polynomials for Logic Gate Networks
About
We study learnable logic gate networks that stack layers of 2-input Boolean gates to build combinational circuits. Every 2-input gate has a unique multilinear polynomial with 4 coefficients, so the 16 Boolean gates form a codebook of prototypes in a 4-dimensional space, reducing training to a vector-quantization problem. The baseline method, Soft-Mix, learns a 16-dimensional softmax over gate identities, but the codebook has rank~4: 11 of 15 simplex directions carry nullspace gradient, and at uniform initialization the backward signal vanishes exactly. We prove that no affine product reparameterization fixes the resulting interaction-coefficient starvation under STE, and show that the covariance Jacobian of soft-VQ selection bypasses it by coupling the starved coefficient to the always-active constant channel. Working in the 4-dimensional polynomial space reduces each neuron from 16 to 4 parameters. On seven datasets, at least one 4-parameter method matches or exceeds Soft-Mix on every dataset; the CovJac advantage over STE grows monotonically with interaction demand across all seven datasets. At depth, Soft-Mix collapses ($-37.3$pp on CIFAR-10 at 12 layers) while CovJac holds ($-0.5$pp on CIFAR-10, stable on MNIST).
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Classification | MNIST | Accuracy98.36 | 89 | |
| Classification | Adult | Accuracy85.13 | 86 | |
| Image Classification | SVHN | Accuracy68.91 | 47 | |
| Classification | Splice | Accuracy97.72 | 41 | |
| Classification | SVHN | -- | 21 | |
| Classification | CIFAR-100 | Accuracy28.91 | 16 | |
| Classification | monk2 | Accuracy87.15 | 15 | |
| Image Classification | CIFAR-100 (test) | Last-10 Accuracy32.72 | 8 | |
| Image Classification | MNIST binarized (test) | Last-10 Accuracy98.3 | 6 | |
| Image Classification | CIFAR-10 binarized (test) | Last-10 Accuracy58.97 | 5 |