Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Relational Understanding on MultiChallenge
Loading...
20.35
IM Score
GraphIF
17.594
18.3095
19.025
19.7405
Nov 13, 2025
IM Score
SC Score
IR Score
RVE Score
Overall Score
Updated 23d ago
Evaluation Results
Method
Method
Links
IM Score
SC Score
IR Score
RVE Score
Overall Score
GraphIF
Backbone=Qwen2.5-14B-I...
2025.11
20.35
20
36.23
19.51
24.02
Qwen2.5-14B-Instruct
Evaluation Mode=LLM-Only
2025.11
17.7
10
18.84
21.95
17.12
Feedback
Search any
task
Search any
task