Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Closed-ended Task Evaluation on L-Eval closed-ended tasks

41.86Coursera Score

Resonance YaRN

28.714432.127235.5438.9528Feb 29, 2024
Updated 1mo ago

Evaluation Results

MethodLinks
2024.02
41.863542.5765.85.5648.4439.87
2024.02
41.723441.0966.912.2248.4439.06
2024.02
40.262138.1265.431.1146.8835.47
2024.02
38.663943.5665.061.1162.541.65
2024.02
38.083943.0765.43063.2841.48
2024.02
36.77326.7334.21.1150.7825.43
2024.02
36.482234.1655.76057.0334.24
2024.02
36.342740.5956.513.3361.7237.58
2024.02
36.051933.1750.564.4456.2533.24
2024.02
35.032437.6257.624.4460.9436.61
2024.02
31.983234.6559.111.1136.7232.59
2024.02
29.223940.5963.941.1139.8435.62