Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

ZENITH: Automated Gradient Norm Informed Stochastic Optimization

About

Training deep computer vision models requires manual oversight or hyperparameter tuning of the learning rate (LR) schedule. While existing adaptive optimizers schedule the LR automatically, they suffer from computational and memory overhead, incompatibility with regularization, and suboptimal LR choices. In this work, we introduce the ZENITH (Zero-overhead Evolution using Norm-Informed Training History) optimizer, which adapts the LR using the temporal evolution of the gradient norm. Image classification experiments spanning 6 CNN architectures and 6 benchmarks demonstrate that ZENITH achieves higher test accuracy in lower wall-clock time than baselines. It also yielded superior mAP in object detection, keypoint detection, and instance segmentation on MS COCO using the R-CNN family of models. Furthermore, its compatibility with regularization enables even better generalization.

Dhrubo Saha• 2026

Related benchmarks

TaskDatasetResultRank
Image ClassificationCIFAR-10 (test)
Accuracy92.4
3381
Image ClassificationImageNet-100 (test)
Clean Accuracy78.2
109
Image ClassificationFood-101 (test)--
89
Image ClassificationCIFAR-10
Latency (ms/iter)17.47
13
Image ClassificationMNIST (test)
Accuracy99.57
12
Instance SegmentationMS-COCO 2017 (test)
Box mAP5059.3
6
Keypoint DetectionMS-COCO 2017 (test)
mAP50 (Box)81.3
6
Showing 7 of 7 rows

Other info

Follow for update