Facets and measures of gene ontology annotation quality in model organism databases

W. John MacMullen

ASIS&T Annual Meeting - 2006 (ASIS&T 2006)
Austin, Texas, November 3-9, 2006


Model organism databases are important repositories of data and information for biomedical research, but are useful to scientists only if the information they contain meets certain levels of quality. This methodology paper describes five facets of information quality applicable to model organism databases, and defines corresponding metrics to be used in measuring the quality of annotations made by one or more database curators. The defined facets and measures are: consistency, reliability, specificity, completeness, and accuracy. Qualitative factors, and factors affecting internal and external validity, are also discussed.

