Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Knowledge Graph Question Answering on KGQAGen-10k 1,079 unseen questions (test)
Loading...
84.9
LASM Accuracy
GPT-4o w/ SP
29.26
43.705
58.15
72.595
Mar 31, 2026
LASM Accuracy
Delta
Updated 17d ago
Evaluation Results
Method
Method
Links
LASM Accuracy
Delta
GPT-4o w/ SP
Type=Oracle subgraph,...
2026.03
84.9
-
APEX-EM (EG1: Entity graph)
Type=Rich feedback, it...
2026.03
73.7
31.7
APEX-EM (A3: Judge + iteration)
Type=Memory, binary, i...
2026.03
73.5
31.5
APEX-EM (A5: Opus judge)
Type=All + Opus judge,...
2026.03
72.7
30.7
APEX-EM (R1: Semantic only)
Type=Rich feedback, it...
2026.03
71.6
29.7
APEX-EM (EG2: Full memory)
Type=Rich feedback, it...
2026.03
71.2
29.2
APEX-EM (A5: GPT 4o as base)
Type=All, Base Model=G...
2026.03
66.2
12
PoG (GPT-4o)
Type=KG-RAG, plan-on-g...
2026.03
58.1
-
ToG (GPT-4o)
Type=KG-RAG, think-on-...
2026.03
56.3
-
GPT-4o
Type=Single-shot LLM,...
2026.03
54.2
-
GCR (GPT-4o)
Type=KG-RAG, constrain...
2026.03
54
-
APEX-EM (A1: Memory, no judge)
Type=Memory, binary si...
2026.03
43.1
1.1
APEX-EM (A0: No Memory)
Type=Sonnet 4.5, singl...
2026.03
42
-
APEX-EM (A2: Memory + judge)
Type=Memory, rich feed...
2026.03
41.8
0.2
RoG (LLaMA2-7B)
Type=KG-RAG, graph rea...
2026.03
31.4
-
Feedback
Search any
task
Search any
task