Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

SCOpE-QA

Benchmarks

Task NameDataset NameSOTA ResultTrend
Insight GenerationSCOpE-QA (Set-level)
Inference Optimization Score4.45
20
Insight-level EvaluationSCOpE-QA Graph ML collection
Insight Score4.18
20
Insight-level EvaluationSCOpE-QA Quantization collection
Insight Score4.42
20
Insight-level EvaluationSCOpE-QA Reinforcement Learning collection
Insight-level Score4.52
20
Insight-level EvaluationSCOpE-QA Dialogue Systems collection
Insight Score4.43
20
Insight-level EvaluationSCOpE-QA Legal NLP collection
Insight Score4.27
20
Insight-level EvaluationSCOpE-QA LLM for Healthcare collection
Insight-level Score4.32
20
Insight-level EvaluationSCOpE-QA
Insight-level Score4.42
20
Insight-level EvaluationSCOpE-QA Ethical Bias & Fairness collection
Insight-level Score4.5
20
Insight-level EvaluationSCOpE-QA Data Augmentation
Insight-level Score4.39
20
Insight-level EvaluationSCOpE-QA Low-Resource NLP collection
Insight-level Score4.43
20
Insight-level EvaluationSCOpE-QA Interpretability collection
Insight Score4.19
20
Insight-level EvaluationSCOpE-QA Hate Speech Detection
Insight-level Score4.33
20
Insight-level EvaluationSCOpE-QA Video Segmentation collection
Insight Score4.47
20
Insight-level EvaluationSCOpE-QA Social Computing collection
Insight-level Score4.52
20
Insight-level EvaluationSCOpE-QA Long Video Understanding collection
Insight-level Score4.37
20
Insight-level EvaluationSCOpE-QA Representation Learning collection
Insight Score4.33
20
Insight-level EvaluationSCOpE-QA Long-context RAG collection
Insight Score4.48
20
Insight-level EvaluationSCOpE-QA Preference Optimization collection
Insight-level Score4.47
20
Insight-level EvaluationSCOpE-QA LLM as Agents collection
Insight Score4.35
20
Insight-level EvaluationSCOpE-QA Inference Optimization collection
Insight-level Score4.49
20
Document-grounded related insight recommendationSCOpE-QA
Inference Optimization4.21
8
Showing 22 of 22 rows