Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Multi-temporal motif detection on LLMTM
Loading...
99
Accuracy
tool-augmented LLM agent
9.6328
32.8339
56.035
79.2361
Dec 24, 2025
Accuracy
Average Tokens
Updated 3d ago
Evaluation Results
Method
Method
Links
Accuracy
Average Tokens
tool-augmented LLM agent
Method label=Agent
2025.12
99
9,190.19
tool-augmented LLM agent
Method label=Agent
2025.12
98
12,298.63
GPT-4o-mini
Method label=GPT-4o-mini
2025.12
18.8
2,716.59
GPT-4o-mini
Method label=GPT-4o-mini
2025.12
13.07
2,790.1
Feedback
Search any
task
Search any
task