Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Role-playing agent evaluation on LLM Court 5 legal scenarios 1.0 (test)

92.5QS d BRF Score

Llama-3.1-8B

72.2277.48582.7588.015Apr 13, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.04
92.53
2026.04
89.30
2026.04
89.31
2026.04
82.30
2026.04
813
2026.04
760
2026.04
730