Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Three-Level Classification on JD search logs 1.0 (test)
Loading...
82.58
Macro F1
K-CARE w/ SCA + APR (Proposed)
51.7544
59.7572
67.76
75.7628
Apr 28, 2026
Macro F1
Weighted F1
Accuracy
Updated 1mo ago
Evaluation Results
Method
Method
Links
Macro F1
Weighted F1
Accuracy
K-CARE w/ SCA + APR (Proposed)
Framework=SCA + Analog...
2026.04
82.58
87.99
88.05
K-CARE w/ SCA (QSAP+PSAQ+TGKI)
Framework=Symmetrical...
2026.04
82.39
87.85
87.94
K-CARE w/ SCA (QSAP+PSAQ)
Framework=Symmetrical...
2026.04
82.16
87.66
87.73
K-CARE w/ SCA (QSAP)
Framework=Symmetrical...
2026.04
82
87.53
87.61
LLM GRPO
Backbone=Qwen3-8B, Mod...
2026.04
81.18
87.04
87.17
LLM SFT
Backbone=Qwen3-8B, Mod...
2026.04
80.97
86.82
86.95
LLM Base
Backbone=Qwen3-8B, Mod...
2026.04
52.94
60.29
59.73
Feedback
Search any
task
Search any
task