Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Offline Reinforcement Learning for Dialogue Management on Reddit Casual
Loading...
4.65
Return
SAIQL
0.5628
1.6239
2.685
3.7461
Feb 21, 2023
Return
Updated 4d ago
Evaluation Results
Method
Method
Links
Return
SAIQL
Evaluation Approach=Mo...
2023.02
4.65
FtLE
Evaluation Approach=Mo...
2023.02
4.59
MoE-VRL
Evaluation Approach=Mo...
2023.02
4.46
EXP 1*
Evaluation Approach=Mo...
2023.02
4.25
FtLE
Evaluation Approach=Mo...
2023.02
1.14
EXP 1*
Evaluation Approach=Mo...
2023.02
0.97
SAIQL
Evaluation Approach=Mo...
2023.02
0.81
MoE-VRL
Evaluation Approach=Mo...
2023.02
0.72
Feedback
Search any
task
Search any
task