Share your thoughts, 1 month free Claude Pro on usSee more

Agent interaction on Agent

100Clean Success (Eager)

Llama-3.2-1B-Instruct

Updated 2mo ago

Evaluation Results

Method	Links
Llama-3.2-1B-Instruct 2026.05		100	100	37.5	68.8
Llama-3.2-3B-Instruct 2026.05		100	100	41.2	56.2
Qwen2.5-1.5B-Instruct 2026.05		100	100	90	66.2
Qwen2.5-3B-Instruct 2026.05		100	100	65	60