Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

SIG

Benchmarks

Task NameDataset NameSOTA ResultTrend
Bivariate Causal DiscoverySig
Accuracy90
23
Bivariate Causal DiscoverySig
Accuracy90
10
RAG Robustness EvaluationSIG Trivial
Style Robustness90.5
4
Showing 3 of 3 rows