Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Summary-style Question Answering on LinkedIn Hiring Agent Summary-style Queries
Loading...
63.5
Token-F1
HLTM
7.548
22.074
36.6
51.126
Apr 29, 2026
Token-F1
BLEU-1
Correctness
Updated 1mo ago
Evaluation Results
Method
Method
Links
Token-F1
BLEU-1
Correctness
HLTM
2026.04
63.5
47.3
79.8
RAG (avg.)
2026.04
45.9
35.1
59.4
RAPTOR
2026.04
44.1
32
59.5
A-Mem
2026.04
43
32.4
58.8
HippoRAG
2026.04
41.5
29.7
68.4
Full-context
2026.04
41.4
31.5
63.3
Mem0
2026.04
39.7
28.1
39.4
ReadAgent
2026.04
39.5
30
61.4
GraphRAG
2026.04
10.3
7.4
26.2
SimpleMem
2026.04
9.7
0.5
25.7
Feedback
Search any
task
Search any
task