Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Wrapper-Aware Rate-Distortion Optimization in Feature Coding for Machines

About

Feature coding for machines (FCM) is a lossy compression paradigm for split-inference. The transmitter encodes the outputs of the first part of a neural network before sending them to the receiver for completing the inference. Practical FCM methods ``sandwich'' a traditional codec between pre- and post-processing neural networks, called wrappers, to make features easier to compress using video codecs. Since traditional codecs are non-differentiable, the wrappers are trained using a proxy codec, which is later replaced by a standard codec after training. These codecs perform rate-distortion optimization (RDO) based on the sum of squared errors (SSE). Because the RDO does not consider the post-processing wrapper, the inner codec can invest bits in preserving information that the post-processing later discards. In this paper, we modify the bit-allocation in the inner codec via a wrapper-aware weighted SSE metric. To make wrapper-aware RDO (WA-RDO) practical for FCM, we propose: 1) temporal reuse of weights across a group of pictures and 2) fixed, architecture- and task-dependent weights trained offline. Under MPEG test conditions, our methods implemented on HEVC match the VVC-based FCM state-of-the-art, effectively bridging a codec generation gap with minimal runtime overhead relative to SSE-RDO HEVC.

Samuel Fern\'andez-Mendui\~na, Hyomin Choi, Fabien Racap\'e, Eduardo Pavez, Antonio Ortega• 2026

Related benchmarks

TaskDatasetResultRank
Object DetectionSFU Class C
BD Accuracy0.75
9
Object DetectionSFU HW Class A B
BD-accuracy41
9
Object DetectionSFU HW Class D
BD Accuracy0.0131
9
Object TrackingTVD
BD Accuracy1.69
9
Object TrackingHiEve
BD Accuracy1.38
9
Image SegmentationOI v6 (test)
BD-Accuracy0.54
7
Object DetectionOI v6 (test)
BD-accuracy0.51
7
Showing 7 of 7 rows

Other info

Follow for update