Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Relevance Classification on Chinese Conversational AI Financial Services (test)
Loading...
95
Accuracy
Latent Model
77.32
81.91
86.5
91.09
Jan 6, 2026
Accuracy
Recall
F1-Score
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
Recall
F1-Score
Latent Model
aggregation=Query-Adap...
2026.01
95
94
95
BGE@0.85
threshold=0.85
2026.01
83
80
82
Majority Vote
2026.01
80
78
75
Single LLM
2026.01
78
75
72
Feedback
Search any
task
Search any
task