Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Action Routing on Stress Tests (N=1k)
Loading...
0
Failure Rate: Conversational Text
BloClaw
-0.728
4.186
9.1
14.014
Apr 1, 2026
Failure Rate: Conversational Text
Failure Rate: Unescaped Quotes
Failure Rate: Multi-line Code Strings
Failure Rate: Missing End Tags
Average Failure Rate (N=1k)
Updated 17d ago
Evaluation Results
Method
Method
Links
Failure Rate: Conversational Text
Failure Rate: Unescaped Quotes
Failure Rate: Multi-line Code Strings
Failure Rate: Missing End Tags
Average Failure Rate (N=1k)
BloClaw
Routing Protocol=XML +...
2026.04
0
0.2
0.5
3.1
0.95
JSON Routing
Routing Protocol=JSON
2026.04
18.2
45.5
72
12.4
37
Feedback
Search any
task
Search any
task