Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Answer Sentence Selection on PrivacyQA 1.0 (test)

0.611SAE

GPT-4o-mini Multi-agent

0.385320.443910.50250.56109Jun 3, 2025
Updated 4d ago

Evaluation Results

MethodLinks
2025.06
0.6110.5950.5960.6020.5920.5940.5980.0050.019
2025.06
0.6050.5730.5620.5550.5470.5470.5650.0160.058
2025.06
0.6010.5880.5780.5870.5920.5760.5870.0070.025
2025.06
0.5820.5790.5830.5790.5660.5730.5770.0050.017
2025.06
0.5810.5490.5470.5170.5560.5410.5490.0140.064
2025.06
0.5550.5250.5230.5290.5220.5280.530.0080.033
2025.06
0.5490.5270.520.5240.5230.5260.5280.0070.029
2025.06
0.5460.4630.4690.4480.4850.4460.4760.0260.1
2025.06
0.5330.6060.5850.5810.5570.5690.5720.0190.073
2025.06
0.5320.510.5470.5290.5320.5120.5270.0110.037
2025.06
0.4690.3490.370.3250.3560.3360.3680.0350.144
2025.06
0.3940.3440.3320.3290.3120.3010.3350.0220.093