Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Multi-Paper Question Answering on ScholarQABench Multi

4.32Mean Score

Llama 3.1 8B Instruct

3.6963.8584.024.182Apr 2, 2026
Updated 16d ago

Evaluation Results

MethodLinks
2026.04
4.324.014.194.76475000
2026.04
4.193.964.064.54450000
2026.04
4.123.924.444.02578.6--42.8
2026.04
4.043.833.94.3845323.526.4524.89
2026.04
43.893.884.2346621.219.6620.4
2026.04
3.983.823.844.2648423.8123.9623.89
2026.04
3.963.853.834.1946824.5222.6923.57
2026.04
3.943.593.924.311,29725.6422.7724.12
2026.04
3.943.813.84.253224.5123.3323.91
2026.04
3.913.743.784.278423.4121.0622.18
2026.04
3.883.653.814.1884124.7423.7224.22
2026.04
3.873.553.84.271,02420.118.7519.4
2026.04
3.863.553.824.2188724.0725.0224.53
2026.04
3.853.63.874.081,01723.3521.6322.46
2026.04
3.79------0
2026.04
3.723.673.513.9936316.4919.0517.68