Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

LLM data comprehension on Real-world data

74.3JSON Validity Rate

GPT-5.1-codex

38.62847.88957.1566.411Apr 7, 2026
Updated 11d ago

Evaluation Results

MethodLinks
2026.04
74.371.42.935
2026.04
71.462.98.635
2026.04
71.471.4035
2026.04
68.668.6035
2026.04
62.968.65.735
2026.04
6057.12.935
59.159.40.3350
2026.04
54.354.3035
2026.04
45.748.62.935
2026.04
42.945.72.935
2026.04
4045.75.735