Dual-Margin Embedding for Fine-Grained Long-Tailed Plant Taxonomy

About

Taxonomic classification of ecological families, genera, and species underpins biodiversity monitoring and conservation. Existing computer vision methods typically address fine-grained recognition and long-tailed learning in isolation. However, additional challenges such as spatiotemporal domain shift, hierarchical taxonomic structure, and previously unseen taxa often co-occur in real-world deployment, leading to brittle performance under open-world conditions. We propose TaxoNet, an embedding learning framework with a theoretically grounded dual-margin objective that reshapes class decision boundaries under class imbalance to improve fine-grained discrimination while strengthening rare-class representation geometry. We evaluate TaxoNet in open-world settings that capture co-occurring recognition challenges. Leveraging diverse plant datasets, including Google Auto-Arborist (urban tree imagery), iNaturalist (Plantae observations across heterogeneous ecosystems), and NAFlora-Mini (herbarium collections), we demonstrate that TaxoNet consistently outperforms strong baselines, including multimodal foundation models.

Cheng Yaw Low, Heejoon Koo, Jaewoo Park, Meeyoung Cha• 2025

Related benchmarks

Task	Dataset	Result
Species Classification	iNat-Plantae (test)	Macro Recall (Overall)83.21	13
Image Classification	AA-Central (test)	Top-1 Accuracy0.9212	10
Image Classification	AA-West (test)	Top-1 Recall85.94	10
Image Classification	AA-East (test)	R@184.53	10
Image Classification	NAFlora Mini (test)	R@191.52	7
Genus Classification	iNat-Plantae (test)	Macro Recall (Head 10)96.53	6

Showing 6 of 6 rows

Other info

Follow for update

@wizwand_team Discord