Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Semantic Similarity Evaluation on Insurance Tasks N=1500 (test)

0.869Mean BERT Cosine Similarity

DeepSeek-R1 + Fine-tune

0.7130.75350.7940.8345Feb 18, 2026
Updated 4d ago

Evaluation Results

MethodLinks
2026.02
0.8690.1590.929179.1
0.7990.160.8570.97768.5
0.7870.1630.8440.97465.6
0.7570.170.8030.97457.5
2026.02
0.7490.1630.7930.97756.1
2026.02
0.7190.1670.762147.2