Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Scenario Attribution on TSRBench
Loading...
87.5
Accuracy
VeriTime
47.8448
58.1399
68.435
78.7301
Feb 8, 2026
Accuracy
Updated 1mo ago
Evaluation Results
Method
Method
Links
Accuracy
VeriTime
Base Model=Qwen3-4B-In...
2026.02
87.5
VeriTime
Base Model=Qwen2.5-3B-...
2026.02
83.52
Qwen3-4B-Instruct
Training Protocol=Base
2026.02
80.68
ChatTS
Base Model=Qwen3-4B-In...
2026.02
80.68
ChatTS
Base Model=Qwen2.5-3B-...
2026.02
78.98
Qwen2.5-7B-instruct
Model Type=General LLM
2026.02
69.89
Meta-Llama3-8B-Instruct
Model Type=General LLM
2026.02
64.77
Mistral-7B-v0.3
Model Type=General LLM
2026.02
61.36
GPT-4o-mini
Model Type=General LLM
2026.02
61.36
Qwen2.5-3B-Instruct
Training Protocol=Base
2026.02
61.36
DeepSeek-R1-Distill-Qwen-7B
Model Type=General LLM
2026.02
59.66
Time-MQA
Base Model=Qwen2.5-7B
2026.02
56.25
Time-R1
Base Model=Qwen2.5-7B
2026.02
53.14
Time-MQA
Base Model=Llama3-8B
2026.02
50.77
Time-MQA
Base Model=Mistral-7B
2026.02
49.37
Feedback
Search any
task
Search any
task