Approximate Phrase Searching: Movie Scripts and Song Lyrics

Kathryn Patterson and Carolyn Watters

ASIS&T 2008 Annual Meeting (AM08 2008)
Columbus, Ohio, October 24-29, 2008


Search engines provide an effective means of retrieving a document in which a piece of text occurs when the query contains infrequently occurring terms or the query is known to be an exact phrase. However, phrase queries usually contain common terms including determiners and users may not remember phrases exactly. Search engines discard common terms or assign them little importance, which may lead to poor retrieval results. In this paper, we examine the use of proximity-based phrase searching to search for quotes from song lyrics and movie scripts and compare the results against, and An improvement of over 25% on search engine results shows that an additional search method to complement the common search engine methods would be beneficial for this task.

