Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Task Execution on 200 TE-labeled queries (test)
Loading...
62
TE-Success@1
MeetMaster-XL
51.6
54.3
57
59.7
Feb 3, 2026
TE-Success@1
Chain Length
Execution Time (s)
Updated 3mo ago
Evaluation Results
Method
Method
Links
TE-Success@1
Chain Length
Execution Time (s)
MeetMaster-XL
Architecture=Dual-proc...
2026.02
62
1.6
19.7
Closed (avg)
Type=Closed API
2026.02
58
1.5
18.1
Qwen2.5-7B
2026.02
52
1.2
16.8
Feedback
Search any
task
Search any
task