**Fig. 1** Here are the areas on the Seward Peninsula in Alaska where we used field and remote sensing data to map the extent of near-surface permafrost. The photos show the type of vegetation we see at the sites. Those shrubs can be seen in the Lidar point cloud data by making models of the canopy. (You can open the image in a new tab to view a larger version)

Using machine learning models to make high-resolution maps of permafrost extent

First of all, what is permafrost? Strictly, permafrost is any ground that is frozen for at least two consecutive years. In our work, we only consider the permafrost within the upper 120 cm of the ground surface, which we call "near-surface permafrost." We're interested in making high-resolution maps (<5 m spatial resolution) because 1) when permafrost thaws, the ground can sink rapidly and cause damage to nearby or overlying infrastructure, and 2) permafrost soils hold a ton of carbon which can be released during thaw, and we want to know how much of the landscape has already thawed.

**Fig. 2** Top row shows the predicted permafrost extent for the at-a-site predictions, and the bottom shows the predictions for the transferred models. The patterns are pretty similar for both sets of models. (You can open the image in a new tab to view a larger version)

We wanted to test how well different machine learning architectures (models) can be trained to make maps of near-surface permafrost extent. For ground truth data, we used some geophysical measurements and other observations like soil pits and frost probes. We used lidar point clouds to make models of the vegetation height across the landscape, digital elevation models, and several topographic metrics for each of the sites. We also used high-resolution satellite imagery to map out the vegetation greenness using a metric called the normalized difference vegetation index (NDVI) and also used our Blue Snow Threshold to make an estimate of snow-cover duration for every pixel across the study sites. We used all of those remote sensing data and the ground truth observations to train a random forest model (ERTr), a support vector machine model (SVM), and an artificial neural network model. We also wanted to know how well a model trained at one site can predict permafrost extent at a site the model hasn't seen before (this is model transferability). Since we have three sites, we tested transferability by training the model on a combination of two sites and then testing that trained model on the third site.

**Fig. 3** Feature importances for the random forest (ERTr) and support vector machine (SVM) models for each of the three sites. The larger the bar, the more important the feature. Yep, NDVI is a pretty important feature to include in models of permafrost extent, at least for these sites. (You can open the image in a new tab to view a larger version)

How well did the models do?

The random forest models did the best job at predicting permafrost extent for sites that the model had seen (what we call at-a-site prediction), with accuracies up to 83%. But, those models didn't transfer that well to sites it hadn't seen. The support vector machines were the better performing architecture for the transferred models, getting accuracies up to 78%. Turns out, with the right set of features and enough ground truth training data, machine learning does pretty darn well at mapping near-surface permafrost.

Which features are the most important to have for prediction?

A very cool aspect of machine learning is learning which of the features (in this case remote sensing variables) are really driving the prediction. We call these the important features. At these sites, NDVI is consistently one of the most important features, probably because plants are sensitive to the soil conditions and are a good indicator of frozen versus unfrozen soil. If you want to dig deeper into how we built and interpret these models, check out the paper published in Earth and Space Science (Thaler et al., 2023).