Master's Theses

A Strategy Oriented, Machine Learning Approach to Automatic Quality Assessment of Wikipedia Articles

Gabriel De La Calzada, California Polytechnic State University, San Luis ObispoFollow

DOI: https://doi.org/10.15368/theses.2009.32
Available at: https://digitalcommons.calpoly.edu/theses/306

Date of Award

4-2009

Degree Name

MS in Computer Science

Department/Program

Computer Science

Advisor

Alexander Dekhtyar

Abstract

This work discusses an approach to modeling and measuring information quality of Wikipedia articles. The approach is based on the idea that the quality of Wikipedia articles with distinctly diﬀerent proﬁles needs to be measured using diﬀerent information quality models. To implement this approach, a software framework written in the Java language was developed to collect and analyze information of Wikipedia articles.

We report on our initial study, which involved two categories of Wikipedia articles: ”stabilized” (those, whose content has not undergone major changes for a signiﬁcant period of time) and ”controversial” (articles that have undergone vandalism, revert wars, or whose content is subject to internal discussions between Wikipedia editors). In addition, we present simple information quality models and compare their performance on a subset of Wikipedia articles with the information quality evaluations provided by human users. Our experiment shows that using special-purpose models for information quality captures user sentiment about Wikipedia articles better than using a single model for both categories of articles.

Download

COinS

Master's Theses

A Strategy Oriented, Machine Learning Approach to Automatic Quality Assessment of Wikipedia Articles

Date of Award

Degree Name

Department/Program

Advisor

Abstract

Search

Browse

Author Corner

LINKS

Master's Theses

A Strategy Oriented, Machine Learning Approach to Automatic Quality Assessment of Wikipedia Articles

Author

Date of Award

Degree Name

Department/Program

Advisor

Abstract

Share

Search

Browse

Author Corner

LINKS