Climplicit: Climatic Implicit Embeddings for Global Ecological Tasks
About
Deep learning on climatic data holds potential for macroecological applications. However, its adoption remains limited among scientists outside the deep learning community due to storage, compute, and technical expertise barriers. To address this, we introduce Climplicit, a spatio-temporal geolocation encoder pretrained to generate implicit climatic representations anywhere on Earth. By bypassing the need to download raw climatic rasters and train feature extractors, our model uses x3500 less disk space and significantly reduces computational needs for downstream tasks. We evaluate our Climplicit embeddings on biomes classification, species distribution modeling, and plant trait regression. We find that single-layer probing our Climplicit embeddings consistently performs better or on par with training a model from scratch on downstream tasks and overall better than alternative geolocation encoding models.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Regression | California Housing | -- | 71 | |
| Biomes Classification | Biomes | F1 Score78.2 | 9 | |
| Median Income Regression | US County-level Median Household Income USDA 2021 | R² (%)45 | 9 | |
| Plant Traits Regression | Plant traits | R²78.6 | 9 | |
| Plant Traits Regression | Plant traits single-layer probing | R² (%)70 | 9 | |
| Species Distribution Modeling | SDM | Accuracy3.7 | 9 | |
| Biomes Classification | Biomes single-layer probing | F1 Score78.4 | 9 | |
| Species Distribution Modeling | SDM single-layer probing | Accuracy (%)3.2 | 9 | |
| Population Density Regression | US Population Density | R-squared (%)67 | 9 |