Wrapper-Aware Rate-Distortion Optimization in Feature Coding for Machines
About
Feature coding for machines (FCM) is a lossy compression paradigm for split-inference. The transmitter encodes the outputs of the first part of a neural network before sending them to the receiver for completing the inference. Practical FCM methods ``sandwich'' a traditional codec between pre- and post-processing neural networks, called wrappers, to make features easier to compress using video codecs. Since traditional codecs are non-differentiable, the wrappers are trained using a proxy codec, which is later replaced by a standard codec after training. These codecs perform rate-distortion optimization (RDO) based on the sum of squared errors (SSE). Because the RDO does not consider the post-processing wrapper, the inner codec can invest bits in preserving information that the post-processing later discards. In this paper, we modify the bit-allocation in the inner codec via a wrapper-aware weighted SSE metric. To make wrapper-aware RDO (WA-RDO) practical for FCM, we propose: 1) temporal reuse of weights across a group of pictures and 2) fixed, architecture- and task-dependent weights trained offline. Under MPEG test conditions, our methods implemented on HEVC match the VVC-based FCM state-of-the-art, effectively bridging a codec generation gap with minimal runtime overhead relative to SSE-RDO HEVC.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Object Detection | SFU Class C | BD Accuracy0.75 | 9 | |
| Object Detection | SFU HW Class A B | BD-accuracy41 | 9 | |
| Object Detection | SFU HW Class D | BD Accuracy0.0131 | 9 | |
| Object Tracking | TVD | BD Accuracy1.69 | 9 | |
| Object Tracking | HiEve | BD Accuracy1.38 | 9 | |
| Image Segmentation | OI v6 (test) | BD-Accuracy0.54 | 7 | |
| Object Detection | OI v6 (test) | BD-accuracy0.51 | 7 |