Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Single needle-in-a-haystack retrieval on RULER S-NIAH

100S-NIAH PK Success Rate (2K)

Transformer++

89.39292.14694.997.654Oct 30, 2025
Updated 9d ago

Evaluation Results

MethodLinks
2025.10
10010062.610010059.410010098.691.2
2025.10
10010010010010099.698.897.895.499.1
2025.10
99.699.69999.899.898.89997.494.898.6
2025.10
99.297.897.49897.896.29897.496.897.6
2025.10
98.89897.498.898.696.297.496.89096.9
2025.10
98.661.43198.455.814.262.242.24.252
2025.10
98.498.89860.236.610.285.878.82866.1
2025.10
96.898.898.647.215.412.885.246.22057.9
2025.10
89.891.49099.291.826.486.482.624.475.8