Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Myopic choice evaluation on Complex generation settings
Loading...
86.43
Accuracy
LTV (Ours)
-3.062
20.1715
43.405
66.6385
Sep 29, 2025
Accuracy
Updated 6d ago
Evaluation Results
Method
Method
Links
Accuracy
LTV (Ours)
Setup=3) More layers,...
2025.09
86.43
LTV (Ours)
Setup=5) ICL prompts
2025.09
84.61
LTV (Ours)
Setup=Baseline, P={-1}...
2025.09
83.49
LTV (Ours)
Setup=2) More Pos., P=...
2025.09
82.44
LTV (Ours)
Setup=1) Diff. Pos., P...
2025.09
78.39
FV
Setup=5) ICL prompts
2025.09
74.78
Vanilla TV
Setup=5) ICL prompts
2025.09
56.12
LTV (Ours)
Setup=4) More layers &...
2025.09
51.39
Vanilla TV
Setup=Baseline, P={-1}...
2025.09
37.8
FV
Setup=Baseline, P={-1}...
2025.09
37.3
FV
Setup=3) More layers,...
2025.09
31.88
Vanilla TV
Setup=2) More Pos., P=...
2025.09
19.18
Vanilla TV
Setup=4) More layers &...
2025.09
18.15
Vanilla TV
Setup=3) More layers,...
2025.09
17.97
FV
Setup=2) More Pos., P=...
2025.09
6.05
FV
Setup=1) Diff. Pos., P...
2025.09
2.68
Vanilla TV
Setup=1) Diff. Pos., P...
2025.09
2.16
FV
Setup=4) More layers &...
2025.09
0.38
Feedback
Search any
task
Search any
task