Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Incremental Sequence Classification with Temporal Consistency

About

We address the problem of incremental sequence classification, where predictions are updated as new elements in the sequence are revealed. Drawing on temporal-difference learning from reinforcement learning, we identify a temporal-consistency condition that successive predictions should satisfy. We leverage this condition to develop a novel loss function for training incremental sequence classifiers. Through a concrete example, we demonstrate that optimizing this loss can offer substantial gains in data efficiency. We apply our method to text classification tasks and show that it improves predictive accuracy over competing approaches on several benchmark datasets. We further evaluate our approach on the task of verifying large language model generations for correctness in grade-school math problems. Our results show that models trained with our method are better able to distinguish promising generations from unpromising ones after observing only a few tokens.

Lucas Maystre, Gabriel Barello, Tudor Berariu, Aleix Cambray, Rares Dolga, Alvaro Ortega Gonzalez, Andrei Nica, David Barber• 2025

Related benchmarks

TaskDatasetResultRank
Reward ModelingRewardBench v2 (test)
Average Score72.6
67
Reward ModelingSkywork-Reward-Preference (test)
Final Token Accuracy92.5
27
Showing 2 of 2 rows

Other info

Follow for update