DOI: https://doi.org/10.15368/theses.2012.1
Available at: https://digitalcommons.calpoly.edu/theses/682
Date of Award
1-2012
Degree Name
MS in Computer Science
Department/Program
Computer Science
Advisor
Franz Kurfess
Abstract
Document classification is used to sort and label documents. This gives users quicker access to relevant data. Users that work with large inflow of documents spend time filing and categorizing them to allow for easier procurement. The Automatic Classification and Document Filing (ACDF) system proposed here is designed to allow users working with files or documents to rely on the system to classify and store them with little manual attention. By using a system built on Hidden Markov Models, the documents in a smaller desktop environment are categorized with better results than the traditional Naive Bayes implementation of classification.