Available at: https://digitalcommons.calpoly.edu/theses/3311
Date of Award
6-2026
Degree Name
MS in Statistics
Department/Program
Statistics
College
College of Science and Mathematics
Advisor
Kelly Bodwin
Advisor Department
Statistics
Advisor College
College of Science and Mathematics
Abstract
Humpback whale songs are notoriously complex. Identification of humpback whale song units requires bioacousticians to tediously listen, analyze, and annotate collected sound data. Even sparse data requires listening to the entirety of the collected acoustic data. In this study, three hours of audio containing over one-thousand humpback whale song units was collected in Monterey Bay, California.
Prior studies have seen success using convolutional neural networks by performing image classification on hundreds of hours worth of spectrograms. Our study uses traditional machine learning models, as they are less computationally demanding, and require less data.
We use time splitting and Mel-frequency cepstrum to transform complex, continuous sound data into a set of trainable, numerical coefficients. We capture the complexity of humpback songs by creating observations based on the evolution of a song, called context windows. We apply two non-parametric machine learning models, k-nearest neighbors, and random forests, to our created observations. Results are tested in two manners, per-context-window and per-vocalization.
Context windows with more time steps avoid overfitting, produces better per-window metrics, and is more successful at identifying individual calls in a high presence dataset. Model selection, parameter tuning, and context window adjusting yield observation-based F1 scores up to 0.824, and identify over 98% of vocalizations.
Included in
Applied Statistics Commons, Biostatistics Commons, Data Science Commons, Statistical Methodology Commons