ASIST AM 03 2003 START ConferenceManager    

Toward Machine Understanding of Information Quality

Rong Tang, Kwong Bor Ng, Tomek Strzalkowski, & Paul B. Kantor

Presented at ASIST 2003 Annual Meeting -- Humanizing Information Technology: From Ideas to Bits and Back (ASIST AM 03 2003), Westin Long Beach, Long Beach, California, October 20 - 23, 2003


In this paper we report preliminary results of a study to develop, and subsequently to automate, new metrics for assessment of information quality in text documents, particularly in news. Through focus group studies, quality judgment experiments, and textual feature extraction and analysis, we were able to generate nine quality aspects and apply them in human assessments. Experts and students participated quality experiments, during which 1000 TREC documents were evaluated by participants from two sites -- Albany and Rutgers. Data showed good inter-judge agreement between judges from both sites. Principal component analysis revealed that the nine aspects form clusters of "content" and "presentation." Automatic quality prediction has been derived based on statistical analysis on the association between textual features and human quality judgments.

Server START Conference Manager (V2.45.2)
Update Time 29 May 2003 at 07:31:05
Start Conference Manager
Conference Manager