Mixed Dataset

Benchmarks

Task Name	Dataset Name	SOTA Result
Uncertainty Quantification	Mixed Dataset (real and fake biographies)	ROC AUC0.9001	32
Idiomatic Translation	Mixed Dataset en-bn	LLM-eval Score2.25	18
Offline Reinforcement Learning	Mixed Dataset Aggregate	Normalized Reward62.2	12
Polyp Segmentation	Mixed Dataset	Dice84.78	11
Idiomatic Translation	Mixed Dataset en-te	LLM-eval Score1.83	10
Idiomatic Translation	Mixed Dataset en-ta	LLM-eval Score1.87	10
Idiomatic Translation	Mixed Dataset en-hi	LLM-eval Score2.39	10
Phishing Detection	Mixed dataset	Accuracy96.66	2

Showing 8 of 8 rows