|
|
TCDL Bulletin What Type of Page is This? Genre as Web DescriptorMark A. Rosso With the advent of large, digital collections of heterogeneous documents (namely the World Wide Web, and large sub-collections thereof), attention to the prospect of using document genre to constrain document search has increased. A major issue to be resolved in this endeavor is the identification of what document categories should be used as the genres. As genre is a "folk typology", document categories must enjoy widespread recognition by their intended user groups, in order to qualify as genres. This poster briefly describes a series of three user studies aimed at developing a set of web page categories that are widely recognized as genre. Results, in the first study of this scale, showed that on average, over 70% of the final study's 257 participants agreed on the genre of the 55 webpages shown to them, having chosen from a typology of 18 genres. Further details on this study, and a subsequent study exploring the usefulness of annotating web search engine results with webpage genre, are reported in: Rosso, M. (2005) Using Genre to Improve Web Search. Ph.D. dissertation, University of North Carolina at Chapel Hill. <http://ils.unc.edu/~rossm/Rosso_dissertation.pdf>.
© Copyright 2005 Mark A. Rosso Top | Contents |