ASIST AM 03 2003 START ConferenceManager    

Haystacks and Hypotheses

Jeffrey Demaine, Joel Martin, Berry De Bruijn

Presented at ASIST 2003 Annual Meeting -- Humanizing Information Technology: From Ideas to Bits and Back (ASIST AM 03 2003), Westin Long Beach, Long Beach, California, October 20 - 23, 2003


This paper describes the EurekaSeek bibliometric technique for automated linked-literature analysis. The MEDLINE database of biomedical literature is iteratively searched in order to identify research opportunities in the form of conceptual linkages between terms.

As a tool for identifying undiscovered public knowledge, EurekaSeek is a variation on the techniques of Swanson and Smalheiser. EurekaSeek uses medical subject headings instead of text analysis in an automated search process, eliminating the reliance on expert input during the process of linking literatures.

The EurekaSeek process is tested by retroactively examining the co-occurrence of terms in the published literature. The hypothesis tested in this paper is whether this tool, had it existed in the past, could have identified conceptual linkages that occurred only later in the literature. EurekaSeek is compared against a process that considers all potential term-to-term relationships. The list of terms that EurekaSeek produces is a subset of all potential linked literature terms. The experiment shows that EurekaSeek produces a higher percentage of likely hypotheses than when all terms are considered. While the proportion of identified linkages generated is still too small for the process to be a practical aid to research, statistically significant results were achieved.

Server START Conference Manager (V2.45.2)
Update Time 29 May 2003 at 07:31:04
Start Conference Manager
Conference Manager