Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Signal-component payoff optimization on ProudCoop+AllD discriminative cell c=5
Loading...
1.2
Payoff (Per Interaction)
Reciprocity Gradient
0.6384
0.7842
0.93
1.0758
May 8, 2026
Payoff (Per Interaction)
Reference Percentage
Updated 22d ago
Evaluation Results
Method
Method
Links
Payoff (Per Interaction)
Reference Percentage
Reciprocity Gradient
Touter=125, Seeds=5
2026.05
1.2
96
DPG
Touter=125, Seeds=10
2026.05
0.76
61
TD3
Touter=125, Seeds=10
2026.05
0.74
59
DDPG
Touter=125, Seeds=10
2026.05
0.66
53
Feedback
Search any
task
Search any
task