Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Inferential Calculation on TSRBench
Loading...
77.14
Accuracy
VeriTime
22.5192
36.6996
50.88
65.0604
Feb 8, 2026
Accuracy
Updated 1mo ago
Evaluation Results
Method
Method
Links
Accuracy
VeriTime
Base Model=Qwen3-4B-In...
2026.02
77.14
ChatTS
Base Model=Qwen3-4B-In...
2026.02
72.38
VeriTime
Base Model=Qwen2.5-3B-...
2026.02
68.57
GPT-4o-mini
Model Type=General LLM
2026.02
65.71
Qwen3-4B-Instruct
Training Protocol=Base
2026.02
62.85
Qwen2.5-7B-instruct
Model Type=General LLM
2026.02
55.24
ChatTS
Base Model=Qwen2.5-3B-...
2026.02
54.28
DeepSeek-R1-Distill-Qwen-7B
Model Type=General LLM
2026.02
49.52
Time-R1
Base Model=Qwen2.5-7B
2026.02
40.35
Meta-Llama3-8B-Instruct
Model Type=General LLM
2026.02
33.33
Time-MQA
Base Model=Llama3-8B
2026.02
32.56
Qwen2.5-3B-Instruct
Training Protocol=Base
2026.02
31.43
Time-MQA
Base Model=Qwen2.5-7B
2026.02
30.48
Mistral-7B-v0.3
Model Type=General LLM
2026.02
28.57
Time-MQA
Base Model=Mistral-7B
2026.02
24.62
Feedback
Search any
task
Search any
task