nnLandmark: A Self-Configuring Method for 3D Medical Landmark Detection

About

Landmark detection is central to many medical applications, such as identifying critical structures for treatment planning or defining control points for biometric measurements. However, manual annotation is labor-intensive and requires expert anatomical knowledge. While deep learning shows promise in automating this task, fair evaluation and interpretation of methods in a broader context are hindered by limited public benchmarking, inconsistent baseline implementations, and non-standardized experimentation. To overcome these pitfalls, we present nnLandmark, a self-configuring framework for 3D landmark detection that combines tailored heatmap generation, loss design, inference logic, and a robust set of hyperparameters for heatmap regression, while reusing components from nnU-Net's underlying self-configuration and training engine. nnLandmark achieves state-of-the-art performance across five public and one private dataset, benchmarked against three recently published methods. Its out-of-the-box usability enables training strong landmark detection models on new datasets without expert knowledge or dataset-specific hyperparameter tuning. Beyond accuracy, nnLandmark provides both a strong, common baseline and a flexible, standardized environment for developing and evaluating new methodological contributions. It further streamlines evaluation across multiple datasets by offering data conversion utilities for current public benchmarks. Together, these properties position nnLandmark as a central tool for advancing 3D medical landmark detection through systematic, transparent benchmarking, enabling to genuinely measure methodological progress. The code is available on GitHub: https://github.com/MIC-DKFZ/nnLandmark

Alexandra Ertl, Stefan Denner, Robin Peretzke, Shuhan Xiao, David Zimmerer, Maximilian Fischer, Markus Bujotzek, Xin Yang, Peter Neher, Fabian Isensee, Klaus H. Maier-Hein• 2025

Related benchmarks

Task	Dataset	Result
Biometry measurement	FeTA (test)	Biometry Error0.0201	7
Biometry measurement	LFC (test)	Biometry Error1.17	7
Landmark Localisation	MML (test)	MRE1.36	7
Landmark Localisation	AFIDs (test)	MRE1.46	7
Landmark Localisation	Fetal pose (test)	MRE3.05	7
Landmark Localisation	PDDCA (test)	MRE2.51	7
Landmark Localization	FeTA (test)	MRE2.71	7
Landmark Localization	LFC (test)	MRE3.72	7

Showing 8 of 8 rows

Other info

Follow for update

@wizwand_team Discord