Preprint. Published in Proceedings MSR'2005: Second International Workshop on Mining Software Repositories at ISCE'05: St. Louis, Missouri, May 1, 2005, pages 58-62.
Copyright © ACM 2005. This is the author's version of the work. It is posted here by permission of ACM for your personal use. Not for redistribution. The definitive version was published in Proceedings MSR'2005.
NOTE: At the time of publication, the author Alexander Dekhtyar was not yet affiliated with Cal Poly.
The mining of textual artifacts is requisite for many important activities in software engineering: tracing of requirements; retrieval of components from a repository; location of manpage text for an area of question, etc. Many such activities leave the “final word” to the analyst – have the relevant items been retrieved? are there other items that should have been retrieved? When analysts become a part of the text mining process, their decisions on the relevance of retrieved elements impact the final outcome of the activity. In this paper, we undertook a pilot study to examine the impact of analyst decisions on the final outcome of a task.