Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Multiple-Choice Question Answering on MM-Telco

84.5CT WG1 Accuracy

GPT-4o

63.1868.71574.2579.785Nov 17, 2025
Updated 1mo ago

Evaluation Results

MethodLinks
2025.11
84.588.785.984.779.387.388.370.976.489.187.685.588.790.189.385.6
82.888.783.186.686.883.687.975.179.787.187.284.886.788.388.284.9
2025.11
8187.683.683.176.783.986.766.778.286.485.783.687.788.289.183.7
80.884.881.179.377.482.583.169.171.986.684.682.383.68886.281.8
80.685.783.680.575.48283.570.37386.884.681.583.486.285.481.9
79.986.481.879.5788485.368.974.885.785.182.884.288.285.682.4
75.781.280.776.570.379.377.96669.483.881.87980.484.282.778.4
6978.971.169.766.471.275.857.864.878.377.173.975.875.379.372.6
6470.568.864.561.963.868.452.658.875.570.965.869.872.473.766.8