Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Complex Task Solving on BrowseComp+ Naive Stream
Loading...
55
Accuracy (1st-Q)
DC-RS
46.68
48.84
51
53.16
Jun 1, 2026
Accuracy (1st-Q)
Plasticity Gain (PG)
Accuracy (2nd-Q)
Stability Gain (SG)
Updated 1d ago
Evaluation Results
Method
Method
Links
Accuracy (1st-Q)
Plasticity Gain (PG)
Accuracy (2nd-Q)
Stability Gain (SG)
DC-RS
2026.06
55
5
57
2
AWM
2026.06
54
4
53
-1
Mem0
2026.06
53
3
42
-11
ExpRAG
2026.06
51
1
56
5
ReMem
2026.06
51
1
55
4
MEMPROBE
2026.06
51
1
62
11
ReAct
2026.06
50
-
50
-
LangMem
2026.06
47
-3
49
2
Feedback
Search any
task
Search any
task