Share your thoughts, 1 month free Claude Pro on usSee more

Tool Use Evaluation on GTM

89.4Average Score

GTM-1.5B

Updated 5mo ago

Evaluation Results

Method	Links
GTM-1.5B 2025.12		89.4
Qwen2.5-14B-Instruct 2025.12		85.8
Qwen2.5-7B-Instruct 2025.12		83
InternLM2.5-20B 2025.12		69.4
Llama-3.2-3B-Instruct 2025.12		68.3
Qwen2.5-3B-Instruct 2025.12		65.3
Qwen2.5-1.5B-Instruct 2025.12		61.2
Qwen2.5-0.5B-Instruct 2025.12		45.1
Llama-3.2-1B-Instruct 2025.12		39.8
InternLM2.5-7B 2025.12		39.2
InternLM2.5-1.8B 2025.12		10.8