Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Legal Statute Identification on ILSI random subset of 1000 cases (test)
Loading...
46.8
Micro-avg Precision
AoS model
42.536
43.643
44.75
45.857
Dec 26, 2025
Micro-avg Precision
Micro-avg Recall
Micro-avg F1
Macro-avg Precision
Macro-avg Recall
Macro-avg F1
Avg Jaccard Similarity
Updated 4d ago
Evaluation Results
Method
Method
Links
Micro-avg Precision
Micro-avg Recall
Micro-avg F1
Macro-avg Precision
Macro-avg Recall
Macro-avg F1
Avg Jaccard Similarity
AoS model
Evaluation subset=Rand...
2025.12
46.8
47.2
47
37.5
32.8
33.1
31.9
LLMPrompt
LLM Model=GPT-4o mini,...
2025.12
44.8
36.6
40.3
34.7
24.6
26.3
24.7
LLMPrompt
LLM Model=GPT-4o mini,...
2025.12
44
37.5
40.5
34.7
25.3
26.9
25
LLMPrompt
LLM Model=Mistral-7B-I...
2025.12
43.6
25.3
32.1
32
16.3
19.5
18.5
LLMPrompt
LLM Model=Mistral-7B-I...
2025.12
42.7
30.9
35.8
34.7
20.9
23.7
21.7
Feedback
Search any
task
Search any
task