AITA

Benchmarks

Task Name	Dataset Name	SOTA Result
Annotator Modeling	AITA situation	Accuracy68.1	19
Verdict Prediction	AITA 1.0 (author split)	Accuracy85.6	14
Judgment Prediction	AITA verdict	Accuracy85.9	14
Fine-grained Moral Steering	AITA	Rho0.966	8
Moral Steering	AITA (test)	Deviation (alpha_U=100%)-15.92	8
Sycophancy Evaluation	AITA	Sycophancy Score (S) PD-L0.54	6
Causal Effect Estimation	AITA comments	ΔATE3.43	6
Causal Estimation	AITA anger	ΔATE154.61	6

Showing 8 of 8 rows