Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Joint action and signal payoff optimization on ProudCoop+AllD (off-diagonal)
Loading...
2.39
Payoff (Per Interaction)
DDPG
1.8596
1.9973
2.135
2.2727
May 8, 2026
Payoff (Per Interaction)
% of Reference
Updated 22d ago
Evaluation Results
Method
Method
Links
Payoff (Per Interaction)
% of Reference
DDPG
Touter=15, Seeds=3
2026.05
2.39
106
TD3
Touter=15, Seeds=3
2026.05
2.37
105
Reciprocity Gradient
2026.05
2.34
104
DPG
Touter=15, Seeds=3
2026.05
1.88
83
Feedback
Search any
task
Search any
task