Wrapper-Aware Rate-Distortion Optimization in Feature Coding for Machines

About

Feature coding for machines (FCM) is a lossy compression paradigm for split-inference. The transmitter encodes the outputs of the first part of a neural network before sending them to the receiver for completing the inference. Practical FCM methods ``sandwich'' a traditional codec between pre- and post-processing neural networks, called wrappers, to make features easier to compress using video codecs. Since traditional codecs are non-differentiable, the wrappers are trained using a proxy codec, which is later replaced by a standard codec after training. These codecs perform rate-distortion optimization (RDO) based on the sum of squared errors (SSE). Because the RDO does not consider the post-processing wrapper, the inner codec can invest bits in preserving information that the post-processing later discards. In this paper, we modify the bit-allocation in the inner codec via a wrapper-aware weighted SSE metric. To make wrapper-aware RDO (WA-RDO) practical for FCM, we propose: 1) temporal reuse of weights across a group of pictures and 2) fixed, architecture- and task-dependent weights trained offline. Under MPEG test conditions, our methods implemented on HEVC match the VVC-based FCM state-of-the-art, effectively bridging a codec generation gap with minimal runtime overhead relative to SSE-RDO HEVC.

Samuel Fern\'andez-Mendui\~na, Hyomin Choi, Fabien Racap\'e, Eduardo Pavez, Antonio Ortega• 2026

Related benchmarks

Task	Dataset	Result
Object Detection	SFU Class C	BD Accuracy0.75	9
Object Detection	SFU HW Class A B	BD-accuracy41	9
Object Detection	SFU HW Class D	BD Accuracy0.0131	9
Object Tracking	TVD	BD Accuracy1.69	9
Object Tracking	HiEve	BD Accuracy1.38	9
Image Segmentation	OI v6 (test)	BD-Accuracy0.54	7
Object Detection	OI v6 (test)	BD-accuracy0.51	7

Showing 7 of 7 rows

Other info

Follow for update

@wizwand_team Discord