A sabbatical of eagles (part 1)
What happens when you combine the increasing need for carbon-free energy with an increasing eagle population? The potential for some not-so-pretty collisions! For my sabbatical research I have been collaborating with a great group of colleagues from USGS and Conservation Science Global on some fascinating data they’ve collected from GPS devices attached to hundreds of bald eagles in the Midwest. (If you prefer listening instead of reading, I have a video presentation essentially narrating this post that I presented at the Wind Wildlife Research Meeting in December 2020.)
I was super fortunate to have my sabbatical coincide with the 2020-2021 academic year, which was right in the throes of the COVID pandemic. So I got to skip out on all the virtual teaching, whew. Major props to my colleagues here at Winona State and universities everywhere who battled through this year in the trenches; they seriously deserve two sabbaticals after the year they endured. Fortunately the pre-COVID plan already was to collaborate remotely, so the research could proceed in spite of a global pandemic.
For my sabbatical I restricted attention to GPS measurements in Iowa, of which there are over 1.7 million for a few dozen eagles over 5 or so years.
The devices record data at 1-11 second intervals while birds are in flight, and from these observations we have the following variables about each point in time:
- Velocity (Kilometers per hour)
- Angle (in radians); is the bird flying straight or in a “tortuous” path?
- Meters above ground level (AGL)
- Vertical rate (m/s): is the bird increasing or decreasing its AGL?
Our goals with these data are essentially two-pronged:
- Using eagle GPS measurements, can we come up with a way to characterize consistent flight behaviors using the GPS data?
- Given the flight behaviors we come up with, can we model what underlying land features are related to certain types of behaviors?
For the first question, I used k-means clustering to come up with classifications for every single one of the GPS points into one of k behaviors. So what is k-means clustering? Maybe the animation below will help.
In this animation we are trying to classify 2-dimensional data (with ‘X’ and ‘Y’ coordinates) into one of k=2 clusters. The algorithm iterates between assigning each data point to the cluster with the nearest centroid then recomputing the centroid. This idea extends directly to p-dimensional data.
Choosing k is a bit of an art, which I won’t go into here (we have a forthcoming manuscript you can read for more details), but for our analysis it appeared the k=5 was the appropriate choice. Showing my colleagues boxplots of the GPS variables with cluster indications got them really excited! Here’s the fun thing about working with people who know animals well: what, to me, are “clusters numbers 1, 2, 3, 4, and 5” are, to them, “ah! perching!” or, “gliding from a thermal!” So it was they who identified the following actually relevant eagle behaviors from my clusters.
What’s really cool is watching an eagle “fly” and seeing these behaviors represented in action. Take a look.
Here you can really see the different behavioral modes as defined by the k-means clustering. The blue “gliding” points tend to be straight and descending; the pink “at altitude” points occur when the bird levels out. The green and yellow flights are much more angular (“tortuous” as my eagle expert colleagues like to say), with the yellow “gaining altitude” points more obviously doing just that: going “up.” All of this with a little unsupervised learning!
You might notice the wind turbines at the bottom of the animation, which represent 250 meters AGL. This is the turbine rotor-swept zone (RSZ), within which bald eagles are potentially imperiled. What’s more, my biologist colleagues suspect that the green and yellow behaviors (“flapping” and “ascending”) are riskier to the eagle than the straighter pink (“soaring/flapping at altitude”) and blue (“gliding from thermal”) behaviors, as they are potentially more distracted in the former behaviors. These can be defined to come up with three levels of risk:
- “Low risk”: any GPS point outside the RSZ (>250m AGL)
- “Moderate risk”: any GPS point within the RSZ ($\leq$ 250m) that is also clustered into the “pink” or “blue” cluster
- “High risk”: any GPS point within the RSZ that is also clustered in the “green” or “yellow” cluster
So there we have it! A framework for classifying every eagle GPS point into one of three risk categories, based on the topological flight characteristics of that point. Next, we need to figure out if and how underlying land features can be used to predict which risk category overhead eagle points are most likely to be engaged in. But that’s for a future post!