Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Question Answering on DesignQA
Loading...
26
Retrieval F1 (BoW)
Llama-11B-MCERF
10.4
14.45
18.5
22.55
Jan 31, 2026
Retrieval F1 (BoW)
Compilation F1 (rules)
Definition F1 (BoC)
Presence Accuracy
Dimension Accuracy
Functional Performance Accuracy
Updated 5d ago
Evaluation Results
Method
Method
Links
Retrieval F1 (BoW)
Compilation F1 (rules)
Definition F1 (BoC)
Presence Accuracy
Dimension Accuracy
Functional Performance Accuracy
Llama-11B-MCERF
Model=Llama-11B-MCERF,...
2026.01
26
25
39
50
60
50
LLaVA-1.5-RAG
Model=LLaVA-1.5-RAG
2026.01
11
28
39
48
41
44
Feedback
Search any
task
Search any
task