Genre as Web Descriptor

Mark A. Rosso

Sparking Synergies: Bringing Research and Practice Together @ ASIST '05 (ASIS&T 2005)
Westin Charlotte, Charlotte, North Carolina, October 28 - November 2, 2005


With the advent of large, digital collections of heterogenous documents (namely the World Wide Web, and large sub-collections thereof), attention to the prospect of using document genre to constrain document search has increased. A major issue to be resolved in this endeavor is the identification of what document categories should be used as the genres. As genre is a “folk typology”, document categories must enjoy widespread recognition by their intended user groups, in order to qualify as genres. This paper briefly describes a series of three user studies aimed at developing a set of web page categories that are widely recognized as genre. Results, in the first study of its kind, showed that on average, over 70% of the final study’s 257 participants agreed on the genre of the 55 webpages shown to them, having chosen from a typology of 18 genres.

