Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

L6

Benchmarks

Task NameDataset NameSOTA ResultTrend
Joint action and signal payoff optimizationL6 warmup (off-diagonal)
Payoff (Per Interaction)3.65
4
Action-component payoff optimizationL6 warmup discriminative cell
Per-interaction Payoff3.94
4
Showing 2 of 2 rows