Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

OPeRA

Benchmarks

Task NameDataset NameSOTA ResultTrend
Next-action predictionOPeRA (test)
Action Generation Acc52.92
18
Lung function regressionOPERA (test)
FVC MAE (Breath)0.848
7
Health condition inferenceOPERA Obstructive (Lung)
AUROC75.2
7
Health condition inferenceOPERA Smoker Cough
AUROC0.83
7
Reasoning and Persona ConsistencyOPeRA (test)
Pages per Session5.3
7
Autonomous LLM Agent VerificationOPERA
Mean Td (s)11.9
3
Human-likeness evaluationOPeRA
Metric-
0
Showing 7 of 7 rows