Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Multi-document Question Answering on FRAMES (test)
Loading...
0.8762
Hypervolume
RADAR
0.650208
0.708879
0.76755
0.826221
Sep 29, 2025
Hypervolume
Updated 1mo ago
Evaluation Results
Method
Method
Links
Hypervolume
RADAR
Evaluation Protocol=ID...
2025.09
0.8762
IRT-Router
Evaluation Protocol=ID...
2025.09
0.8501
RouterBench
Evaluation Protocol=ID...
2025.09
0.8325
Random-Pair
Evaluation Protocol=ID...
2025.09
0.6589
Feedback
Search any
task
Search any
task