| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| MFA-Labeled Raw (test) | Qwen3-ForcedAligner-0.6B | AAS Latency (Avg)42.9 | 8 | 4d ago | |
| MFA-labeled Long-form (test) | LLM-ForcedAligner | Average Alignment Value52.9 | 4 | 4d ago | |
| Human-Labeled (test) | Avg. RTF0.0067 | 4 | 4d ago | ||
| MFA-Labeled Concat-300s (test) | Qwen3-ForcedAligner-0.6B | AAS (Avg) [ms]52.9 | 4 | 4d ago | |
| human-labeled Chinese datasets (Mixed-300s) | Monotonic-Aligner | AAS410.8 | 3 | 4d ago | |
| human-labeled Chinese datasets (Mixed-60s) | AAS86.7 | 3 | 4d ago | ||
| human-labeled Chinese datasets (Raw-Noisy) | AAS0.895 | 3 | 4d ago | ||
| human-labeled Chinese datasets (Raw) | AAS88.6 | 3 | 4d ago | ||
| Randomly selected audio files and transcriptions Manual Inspection | FASA | AU Count81 | 2 | 4d ago | |
| human-labeled Chinese and MFA-labeled multilingual speech Mixed-Crosslingual | LLM-ForcedAligner | AAS42.5 | 1 | 4d ago |