Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

User simulation on Synthetic Exposure Overall

55Accuracy (%)

Llama-3.2-3B-Instruct +SFT+DPO

21.40830.12938.8547.571Aug 25, 2025
Updated 1mo ago

Evaluation Results

MethodLinks
2025.08
55
2025.08
54.7
2025.08
53.1
2025.08
52.7
2025.08
47.7
2025.08
46.4
2025.08
46
2025.08
42.6
2025.08
42.5
2025.08
42.2
2025.08
40.7
2025.08
39.7
2025.08
36.8
2025.08
36
2025.08
35.6
2025.08
33.7
2025.08
29.6
2025.08
27.2
2025.08
22.7