Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Offline Reinforcement Learning for Dialogue Management on Reddit Casual
Loading...
4.65
Return
SAIQL
0.5628
1.6239
2.685
3.7461
Feb 21, 2023
Return
Updated 1mo ago
Evaluation Results
Method
Method
Links
Return
SAIQL
Evaluation Approach=Mo...
2023.02
4.65
FtLE
Evaluation Approach=Mo...
2023.02
4.59
MoE-VRL
Evaluation Approach=Mo...
2023.02
4.46
EXP 1*
Evaluation Approach=Mo...
2023.02
4.25
FtLE
Evaluation Approach=Mo...
2023.02
1.14
EXP 1*
Evaluation Approach=Mo...
2023.02
0.97
SAIQL
Evaluation Approach=Mo...
2023.02
0.81
MoE-VRL
Evaluation Approach=Mo...
2023.02
0.72
Feedback
Search any
task
Search any
task