Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

TSQueryBench

Benchmarks

Task NameDataset NameSOTA ResultTrend
Independent ScoringTSQueryBench
Robustness: Linear Spike0.96
3
Explanation GenerationTSQueryBench (test)
Linear Spike Score0.94
3
Relative RankingTSQueryBench
Linear Spike Robustness Score96
3
Showing 3 of 3 rows