Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Convolutional Kolmogorov-Arnold Networks

About

In this paper, we present Convolutional Kolmogorov-Arnold Networks, a novel architecture that integrates the learnable spline-based activation functions of Kolmogorov-Arnold Networks (KANs) into convolutional layers. By replacing traditional fixed-weight kernels with learnable non-linear functions, Convolutional KANs offer a significant improvement in parameter efficiency and expressive power over standard Convolutional Neural Networks (CNNs). We empirically evaluate Convolutional KANs on the Fashion-MNIST dataset, demonstrating competitive accuracy with up to 50% fewer parameters compared to baseline classic convolutions. This suggests that the KAN Convolution can effectively capture complex spatial relationships with fewer resources, offering a promising alternative for parameter-efficient deep learning models.

Alexander Dylan Bodner, Antonio Santiago Tepsich, Jack Natan Spolski, Santiago Pourteau• 2024

Related benchmarks

TaskDatasetResultRank
Intrusion DetectionUNSW-NB15
Precision70.95
22
Threat DetectionNSL-KDD
Accuracy93.7
11
Intrusion DetectionCICIDS 2017
Accuracy98.55
9
Threat DetectionTri-IDS (BOT-IOT + CICIDS2017 + NSL-KDD)
Accuracy93.45
8
Bone Erosion ClassificationRAM-W600 (test)
BACC49.26
8
Showing 5 of 5 rows

Other info

Follow for update