Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
User Preference Evaluation on User-study dataset CI tasks (Group 1)
Loading...
14
Choice Count
UAV-GPT
11.92
12.46
13
13.54
Dec 9, 2025
Choice Count
Updated 1mo ago
Evaluation Results
Method
Method
Links
Choice Count
UAV-GPT
2025.12
14
GS
2025.12
12
Feedback
Search any
task
Search any
task