Share your thoughts, 1 month free Claude Pro on usSee more

Legal Reasoning on LegalBench (accuracy)

90Accuracy

evaluation-instructed prompt optimization

Updated 1mo ago

Evaluation Results

Method	Links
evaluation-instructed prompt optimization 2025.11		90
Pro-Refine 2025.11		86
TextGrad 2025.11		84
APE 2025.11		84
LLM only 2025.11		83
Self-Refine 2025.11		81
evaluation-instructed prompt optimization 2025.11		70
evaluation-instructed prompt optimization 2025.11		69
Self-Refine 2025.11		63
Pro-Refine 2025.11		63
Self-Refine 2025.11		63
Pro-Refine 2025.11		63
APE 2025.11		61
TextGrad 2025.11		58
TextGrad 2025.11		58
LLM only 2025.11		56
LLM only 2025.11		55
APE 2025.11		55