Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Benign Infection Control on Compliance
Loading...
100
Metric M Score
AutoGen
-4
23
50
77
Mar 4, 2026
Metric M Score
Metric Q Score
Metric R Score
Updated 1mo ago
Evaluation Results
Method
Method
Links
Metric M Score
Metric Q Score
Metric R Score
AutoGen
Defense=Strict
2026.03
100
100
100
AutoGen
Defense=Speed
2026.03
100
90
100
AutoGen
Defense=Balanced
2026.03
95
90
100
MetaGPT
Defense=Strict
2026.03
85
100
100
MetaGPT
Defense=Balanced
2026.03
80
100
85
MetaGPT
Defense=Speed
2026.03
77.5
95
82.5
LangGraph
Defense=Strict
2026.03
77.5
97.5
80
LangGraph
Defense=Speed
2026.03
77.5
20
77.5
LangGraph
Defense=Balanced
2026.03
60
100
97.5
AutoGen
Defense=Reflection
2026.03
12.5
0
5
MetaGPT
Defense=Reflection
2026.03
2.5
7.5
5
LangGraph
Defense=Reflection
2026.03
0
0
2.5
Feedback
Search any
task
Search any
task