Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Long-context understanding on LongBench (test)

58.7Avg Score

Llama3.1-8B

34.05240.45146.8553.249Oct 10, 2023Mar 9, 2024Aug 8, 2024Jan 6, 2025Jun 7, 2025Nov 5, 2025Apr 6, 2026
Updated 11d ago

Evaluation Results

MethodLinks
2026.03
58.7-45.5-54.747.1-34.925.3--------53.87391.6--99.563.4-----56.7-------
2026.03
58.4-45.4-55.447-3424.8--------53.972.591.7--99.563.2-----55.3-------
2026.03
58.2-44.8-55.746-33.124.7--------54.672.592.1--99.563.2-----54.4-------
2026.03
56.9-46.5-6564.3-31.923.6--------548189--10033.1-----38.1-------
2026.03
56.5-44.9-5546.9-31.224--------51.67190.6--95.560.6-----50.5-------
2026.03
55.9-46-63.762.4-30.522.6--------53.580.588.5--97.532.6-----37.4-------
2026.03
55.7-45.8-6363.1-30.522--------53.580.588.5--96.432.3-----37.4-------
2026.03
54.8-43.8-54.445.9-31.724--------51.767.591.9--9750.5-----44.7-------
2026.03
54.1-44.7-6162.1-30.321.9--------53.17989--94.127.5-----31.8-------
2025.12
54-45.53-----25.3627.19-------56.3772.591.6543.62--65.1-----58.65-------
2026.03
53.9-45.7-62.662.3-28.821.3--------5270.588.1--94.531.7-----35.2-------
2025.12
53.84-41.58-----25.7527.82-------55.237688.5947.35--60.53-----61.68-------
2025.05
53.8-45.5-54.747.1-34.9-27.5-------53.87391.643.87.556.763.4-------------
2025.12
53.71-45.32-----25.2927.2-------55.472.591.7843.68--64.96-----57.26-------
2025.12
53.68-41.19-----26.2327.66-------54.527688.5946.89--60.44-----61.6-------
2025.05
52.9-44.8-54.846.2-31.9-26.8-------53.970.592.343.27.754.262.1-------------
2025.12
52.89-44.32-----2527.03-------53.9271.590.8443.12--63.88-----56.39-------
2025.12
52.75-40.87-----23.926.94-------53.687488.3447.54--58.64-----60.81-------
2025.12
52.68-44.42-----25.2826.86-------52.172.591.4244.07--62.45-----55.04-------
2025.05
52.6-43.4-55.346.5-31.3-27.2-------537091.642.57.954.861.8-------------
2025.12
52.54-44.21-----25.3426.79-------53.9672.591.1442.38--62.44-----54.13-------
2025.12
52.43-40.56-----24.7226.79-------53.417588.2146.97--58.08-----58.13-------
2025.12
52.37-40.95-----22.0426.57-------53.367689.3247.65--58.85-----56.58-------
2025.12
52.3-44.03-----25.2526.87-------51.677191.0343.61--62.15-----55.13-------
2025.05
51.7-45.6-53.845.3-29-25.9-------51.269.591.341.28.254.158.5-------------
2025.12
51.1-38.56-----22.4826.65-------50.477287.1347.18--57.15-----58.29-------
2025.12
50.72-41.02-----24.5726.79-------45.8671.591.1443.41--60.55-----51.64-------
2025.12
49.95-32.64-----22.226.46-------46.577687.5246.09--57.9-----54.16-------
2025.12
49.95-35.28-----23.6126.47-------46.2772.591.2142.96--60.63-----50.63-------
2025.12
49.94-37.88-----22.1625.89-------49.277086.5345.38--56.42-----55.94-------
2026.03
49.9-33.9-54.444.7-21.822.6--------46.44890.5--7858-----50.1-------
2025.12
49.73-40.64-----22.925.09-------53.597688.1644.08--49.22-----47.92-------
2026.03
49.46-47.76-59.2343.3736.0533.6624.0524.79-------53.3371.590.2144.43--69.39-----65.57----26.042100
2026.03
49.33-48.01-59.4343.236.2132.6423.9324.93-------52.927190.2144.74--69.23-----65.01----26.761100
2026.03
49.17-47-61.4343.7434.9431.3823.9824.57-------53.870.591.1144.46--69.12-----65.87----24.860100
2026.03
49.15-47.22-57.7243.4537.431.2424.0324.51-------53.7172.590.1344.54--68.29-----64.68----24.942100
2026.03
49.04-47.23-59.343.2536.39-23.7224.61-------52.567190.2144.69--68.93-----65.22----25.360.5100
2026.03
48.98-46.56-59.0443.3734.1229.3224.1823.68-------54.077190.1144.56--69.09-----66.53----24.983100
2026.03
48.96-47.52-58.7342.736.0830.6423.7824.4-------53.271.590.2143.27--69.33-----65.36----25.551.1100
2026.03
48.93-46.49-59.3243.0936.9229.5623.824.01-------52.0471.590.2144.2--68.88-----65.58----25.332100
2026.03
48.84-47.27-57.0143.5237.2629.523.8823.47-------53.4571.589.6344--67.94-----64.83----26.112100
2026.03
48.79-45.65-60.5343.8535.2627.2124.0422.53-------54.3470.590.243.71--69.25-----65.85----24.223.5100
2026.03
48.78-46.69-58.4142.936.6129.4123.6124.18-------53.2171.590.0542.9--69.28-----65.21----25.471.1100
2026.03
48.77-46.98-57.3143.3537.6426.9323.6722.19-------53.772.588.9643.81--68.23-----64.71----27.343100
2026.03
48.45-46.13-58.5242.6636.8928.3923.6123.33-------52.486989.5543.13--69.05-----66.27----24.262100
2026.03
48.31-45.4-58.4744.0536.1327.7723.7122.88-------52.686989.0543.32--67.83-----64.71----25.882100
2026.04
48.128.14351.460.244.936.932.923.824.3--------6990.339.9-9165-----61.3-----7-
2026.02
4830.6944.44-57.4947.1133.1632.3925.4725.55-------55.7370.591.443.398.759954.94-----47.7850.6------
2026.03
48-42.89-58.8642.3235.4727.3223.0722.72-------53.017189.9542.56--68.81-----64.25----23.772100
2026.02
47.9530.0544.67-55.445.6229.4134.7725.1426.9-------55.977391.1643.24109955.13-----47.7950.48------
2026.03
47.95-45.44-5743.5336.6224.2223.3820.38-------53.847089.0542.47--68.17-----64.03----263100
2025.12
47.9-36.51-----22.9425.82-------41.856586.2741.94--59.88-----50.87-------
2026.02
47.8430.0943.84-56.0745.8131.234.7225.0526.89-------56.27391.2943.1489954.2-----46.9650.49------
2026.03
47.64-43.72-58.3742.3634.0425.0322.5521.66-------51.966989.5342.06--69.37-----64.96----24.633100
2026.03
47.46-44.3-58.7842.7935.8925.2922.9521.13-------53.2466.588.9541.64--65.95-----62.88----26.063.599.5
2026.03
47.42-41.09-59.8542.4234.524.5323.6421.25-------53.896888.1343.12--68.39-----64.4----22.583100
2026.04
47.228.843.855.362.848.935.533.524.724.7--------40.590.540.3-91.864.9-----60-----9-
2025.05
47-38.4-52.736.7-31-26.7-------34.9779014.92.567.471.9-------------
2026.02
46.9228.7942.21-55.9346.9530.8934.824.1927.32-------55.986989.942.648.039949.96-----45.1249.51------
2025.02
46.9-43.2-43.735.6-30.7---------50.3----90.523.262.40.34155----------
2026.03
46.52-42.15-57.8942.8436.7421.3322.2518.34-------53.5564.589.5540.93--66.74-----61.7----22.743100
2026.03
46.44-40.52-57.5740.8932.8523.8521.9219.7-------51.4368.589.5540.89--67.47-----61.73----23.123100
2025.05
46.2-35.9-52.841-17.7-23.8-------44.26386368.443.652.3-------------
2026.03
45.62-37.14-56.7742.2431.8221.3322.8619.04-------53.586088.3141.5--66.82-----61.96----23.033.5100
2026.04
45.62736.84559.647.534.131.82323.6--------4690.140.9-90.865.6-----59.6-----8-
2026.03
45.45-38.32-57.3640.6732.8221.5121.8918.97-------51.0459.589.4641.06--67.62-----61.88----23.032100
2025.05
45.2-25.7-52.643.7-20-20.5-------44.7418939.6848.757-------------
2026.04
45.226.93745.559.246.333.132.22323.4--------40.589.941-91.166.4-----59.9-----7.5-
2024.04
44.73350.752.768.564.349.133.925.424.9---------------------------
2025.05
44.7-37.7-52.835.8-29.8-24.9-------38.27689.515.23.564.368.4-------------
2026.03
44.3-29.4-60.154-19.319.8--------47.654.586--6124.9-----31.1-------
2025.05
44-37.6-52.634.4-29.4-25.3-------33.67689.315363.568.5-------------
43.7-40.9-3530.5-32.4---------51.2----8316.362.30.29456----------
2026.03
43.53-43.13-59.0142.534.3422.6621.6118.49-------51.8564.588.7539.48--62.78-----56.07----22.111.567.75
2026.03
43.05-34.87-55.6837.8926.6720.4320.9217.43-------47.4458.585.238.98--65.51-----57.32----18.473.5100
2026.04
42.725.930.439.152.239.929.729.922.221.9--------33.59040.7-93.465.7-----60.2-----8-
2025.02
42.3-39.7-32.128.8-31.6---------50.2----8115.660.50.29154.8----------
2026.03
42.13-32.65-54.8138.9526.5917.6620.8316.04-------45.9949.587.138.9--64.62-----58.29----19.143.599.5
2025.05
42-36.9-47.635.5-19.3-19.2-------27.97489.612.24.66266.8-------------
2026.03
41.47-38.54-47.9636.7825.5328.6921.4724.11-------38.026590.342.85--68.37-----64.63----21.73148.5
2025.05
40.6-21.4-46.538.9-17.9-18-------31.34075.734.4845.750.7-------------
2024.04
39.926.942.25662.1473933.825.126.9---------------------------
2026.04
39.424.130.531.246.541.621.630.821.823.9--------4385.438.2-55.264.7-----61.2-----10-
2025.02
39.2-36.5-29.524.5-29.3---------49.5----79.213.556.10.25747.8----------
2026.03
39.19-32.82-45.9434.3823.3425.7320.2523.5-------31.446288.7141.18--68.39-----63.65----21.250.544
2025.05
39.1-32.4-44.231.5-14.3-16.5-------267386.213.43.955.663.7-------------
2026.03
38.59-30.86-43.8933.2622.5122.2419.6221.16-------30.2158.587.4841.11--67.06-----61.59----18.94257
2026.03
38.48-30.19-46.0135.7319.5716.5119.6714.86-------41.844783.5135.56--62.14-----53.07----15.572.592
2025.02
38.3-35.6-30.625-27.7---------47.4----76.614.351.70.25748.6----------
2026.03
37.32-28.53-42.8133.5821.3418.6319.217.76-------28.524885.5840.08--65.5-----59.41----18.18169
2026.03
37.31-32.11-42.3537.5321.0715.7119.5613.47-------45.024476.6435.15--59.06-----52.04----16.223.583.5
2024.04
37.129.939.650.253.742.332.13325.527.8---------------------------
2024.04
36.69.243.150.955.343.738.93624.727.4---------------------------
2024.04
36.524.435.450.252.448.230.533.625.329---------------------------
2026.03
35.82-26.69-41.0533.4620.8215.7219.1515.14-------28.44382.5738.44--62.86-----56.69----17.651.570
2023.10
35.5---------31.334.624.646.127.848.81,822--------------------
2026.03
35.26-27.42-42.793419.6914.6718.9512.89-------36.674380.6233.89--58.34-----52.26----15.45271.5
2026.04
35.117.624.440.329.226.414.928.821.620.9--------5081.641.2-79.131.7-----47.5-----7.1-
2026.03
35.05-28.01-41.9733.9321.2314.2419.0112.2-------35.934280.8532.86--58.23-----51.93----15.873.569
2024.04
3523.643.352.351.637.726.929.523.426.7---------------------------
Showing 100 of 197 rows