Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Long-context Information Retrieval on Needle-In-a-Haystack Verbatim prompt (test)

0.996Accuracy (Depth 0%)

Vanilla

0.045440.292220.5390.78578Jun 4, 2024
Updated 4d ago

Evaluation Results

MethodLinks
2024.06
0.996110.998110.9981110.9980.9991
2024.06
0.9620.9760.9620.9660.9840.980.9720.9880.990.9940.9620.976
2024.06
0.9560.9940.990.9940.9920.9920.990.9960.9960.990.9560.986
2024.06
0.9520.9940.9880.9880.9880.9880.990.9940.9920.9740.9960.9858
2024.06
0.9260.9840.9940.9880.9960.9960.9880.9980.9980.9920.980.9855
2024.06
0.9020.9940.9960.9920.9860.9960.9960.9980.9980.9920.9920.9856
2024.06
0.0820.0140.030.0680.0780.1260.4540.6580.6340.8460.9940.3622