Dual-Margin Embedding for Fine-Grained Long-Tailed Plant Taxonomy
About
Taxonomic classification of ecological families, genera, and species underpins biodiversity monitoring and conservation. Existing computer vision methods typically address fine-grained recognition and long-tailed learning in isolation. However, additional challenges such as spatiotemporal domain shift, hierarchical taxonomic structure, and previously unseen taxa often co-occur in real-world deployment, leading to brittle performance under open-world conditions. We propose TaxoNet, an embedding learning framework with a theoretically grounded dual-margin objective that reshapes class decision boundaries under class imbalance to improve fine-grained discrimination while strengthening rare-class representation geometry. We evaluate TaxoNet in open-world settings that capture co-occurring recognition challenges. Leveraging diverse plant datasets, including Google Auto-Arborist (urban tree imagery), iNaturalist (Plantae observations across heterogeneous ecosystems), and NAFlora-Mini (herbarium collections), we demonstrate that TaxoNet consistently outperforms strong baselines, including multimodal foundation models.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Species Classification | iNat-Plantae (test) | Macro Recall (Overall)83.21 | 13 | |
| Image Classification | AA-Central (test) | Top-1 Accuracy0.9212 | 10 | |
| Image Classification | AA-West (test) | Top-1 Recall85.94 | 10 | |
| Image Classification | AA-East (test) | R@184.53 | 10 | |
| Image Classification | NAFlora Mini (test) | R@191.52 | 7 | |
| Genus Classification | iNat-Plantae (test) | Macro Recall (Head 10)96.53 | 6 |