Share your thoughts, 1 month free Claude Pro on usSee more

Inferential Calculation on TSRBench

77.14Accuracy

VeriTime

Updated 5mo ago

Evaluation Results

Method	Links
VeriTime 2026.02		77.14
ChatTS 2026.02		72.38
VeriTime 2026.02		68.57
GPT-4o-mini 2026.02		65.71
Qwen3-4B-Instruct 2026.02		62.85
Qwen2.5-7B-instruct 2026.02		55.24
ChatTS 2026.02		54.28
DeepSeek-R1-Distill-Qwen-7B 2026.02		49.52
Time-R1 2026.02		40.35
Meta-Llama3-8B-Instruct 2026.02		33.33
Time-MQA 2026.02		32.56
Qwen2.5-3B-Instruct 2026.02		31.43
Time-MQA 2026.02		30.48
Mistral-7B-v0.3 2026.02		28.57
Time-MQA 2026.02		24.62