Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Legal Reasoning on CaseHOLD (test)
Loading...
88.14
Test Accuracy
LLM2LLM
9.2456
29.7278
50.21
70.6922
Mar 22, 2024
Test Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Test Accuracy
LLM2LLM
% Data=100, # Seed Exa...
2024.03
88.14
Baseline
% Data=100, # Seed Exa...
2024.03
87.94
LLM2LLM
% Data=50, # Seed Exam...
2024.03
82.92
Baseline
% Data=50, # Seed Exam...
2024.03
80.39
LLM2LLM
% Data=20, # Seed Exam...
2024.03
78.97
LLM2LLM
% Data=10, # Seed Exam...
2024.03
78.21
Baseline
% Data=20, # Seed Exam...
2024.03
78
Baseline
% Data=10, # Seed Exam...
2024.03
77.03
LLM2LLM
% Data=5, # Seed Examp...
2024.03
76.83
LLM2LLM
% Data=2, # Seed Examp...
2024.03
74.97
Baseline
% Data=5, # Seed Examp...
2024.03
74.14
LLM2LLM
% Data=1, # Seed Examp...
2024.03
70.97
Baseline
% Data=2, # Seed Examp...
2024.03
69.44
LLM2LLM
% Data=0.5, # Seed Exa...
2024.03
66.5
Baseline
% Data=1, # Seed Examp...
2024.03
46.25
Baseline
% Data=0.5, # Seed Exa...
2024.03
33.94
Baseline
% Data=0, # Seed Examp...
2024.03
12.28
Feedback
Search any
task
Search any
task