DOI: https://doi.org/10.15368/theses.2017.95
Available at: https://digitalcommons.calpoly.edu/theses/1803
Date of Award
10-2017
Degree Name
MS in Computer Science
Department/Program
Computer Science
Advisor
John Clements
Abstract
This thesis explores the challenge of automatic music transcription with a combination of digital signal processing and machine learning methods. Automatic music transcription is important for musicians who can't do it themselves or find it tedious. We start with an existing model, designed by Sigtia, Benetos and Dixon, and develop it in a number of original ways. We find that by using convolutional neural networks with filter shapes more tailored for spectrogram data, we see better and faster transcription results when evaluating the new model on a dataset of classical piano music. We also find that employing better practices shows improved results. Finally, we open-source our test bed for pre-processing, training, and testing the models to assist in future research.