|
| |
Condorcet Query Engine: A Query Engine for Coordinated Index Terms Paul E. van der Vet and Nicolaas J. I. Mars Van der Vet and Mars revive the attempts to
incorporate predicate relationship between assigned index terms during the searching process, to control for the difference between, for example, "aspirin as a cause of, and aspirin as a cure for, headache." Such
relationships, and syntaxes for their use, are defined for each indexing language and are applied to terms from specified hierarchies of indexing concepts to create coordinated concepts. This is used with the
possibility of specifying all narrower terms than the chosen concept with an ANY operator. Thus for a query one may choose a single or ANY predicate relationship, a single or ANY term as the first element of syntax, and
a single or ANY term as the second. Such indexing concepts can then be combined with Boolean operators. The system will take considerable time or space resources for concept expansion. The problem of syntax assignment
at the time of indexing is not addressed. |
485 |
| |
Derivative Bibliographic Relationships: The Work Relationship in a Global Bibliographic Database Richard P. Smiraglia and Gregory H. Leazer Using OCLC Online Computer Library
Center's WorldCat, the proportions of works in families whose members consist of multiple editions, translations, amplifications, extractions, adaptions, accompanying material, and performances, were investigated by
Smiraglia and Leazer. From a random sample of 1,000 records a final sample of 477 progenitor records was culled, and then WorldCat was searched for derivative records. Derivative works, and thus families greater than
one, existed for one-third of the sample. Family size ranged from 2 to 45 with a mean of 3.54, or 1.77 if single-member families are included. Two-thirds of the observed derivations are controlled with collocating
headings. Discipline, form, and genre do not affect derivation. Families seem to reach full size soon after publication of the progenitor, although older ancestors have large families. |
493 |
| |
Cyberbrowsing: Information Customization on the Web Hal Berghel, Daniel Berleant, Thomas Foy, and Marcus McGuire Customization for Berghel et al. is the
personalization of information bearing items by extraction, interaction, and nonprescriptive nonlinear traversal on a client's machine. Cyberbrowser is a browser add-on which allows the analysis of retrieved items by
preforming frequency count-based keyword selection, displaying the keywords with frequencies, and selecting sentences with chosen keywords present. |
505 |
| |
Hierarchical Concept Indexing of Full-Text Documents in the Unified Medical Language System[register mark] Information Sources Map Lawrence W. Wright, Holly K. Grossetta
Nardini, Alar R. Aronson, and Thomas C. Rindflesch Using Health Services/Technology Assessment Text (HSTAT) as a database, Wright et al. extracted four HSTAT files with material on breast cancer and containing 66
distinct documents. By using the available SGML tags, chapter and section headings were located and used to divide the documents into parts while retaining its hierarchical structure. Using MetaMap, which translates
medical text to UMLS Metathesaurus terms and ranks these by occurrence, specificity, and position, terms which are less accurate than human indexing but superior to purely extracted terms are chosen using the document
fragments. Since both the whole document and its sections are represented, the resulting index is hierarchical in nature. Of the MetaMap-generated MeSH terms, 60% were not in the current indexing of HSTAT, and MMI
produced results similar to that of the HSTAT search facility--except that MMI could bring in larger sections or whole documents, rather than fine sections alone. |
512 |
| |
Stemming Methodologies Over Individual Query Words for an Arabic Information Retrieval System Hani Abu-Salem, Mahmoud Al-Omari, and Martha W. Evens A stem in Arabic
is a root verb form combined with derivational morphemes but with affixes removed. Abu-Salem et al. choose to use a word, a stem, or a root for a query term based upon which form has the highest average inverse document
frequency value, a method necessitating the creation of a three-field term record. Using 120 documents and 32 queries provided by users who also provided relevance judgements, this mixed stemming method was compared to
the individual forms alone using binary weighting, and inverse document frequency weighting. The root with weighting method was the superior. The mixed stemming improved binary weighting search results in all cases but
did not increase performance over weighted stems or roots. |
524 |
| |
An Experiment on Node Size in a Hypermedia System Su Hee Kim and Caroline M. Eastman For Kim and Eastman nodes are that material that can be viewed by scrolling in a
hypermedia system without using a link. Node size can be viewed as required storage, window size for viewing, or logical size, i.e., the number of characters, words, lines, or other items presented to the viewer. To
determine if window size and text length affect retrieval time, groups of 10 students searched 20 queries on each of the four possible versions of a file prepared in two card sizes and two text lengths. ANOVA does not
support interaction between card size and text length, or between the two card sizes. A significant difference occurs between text lengths, where longer text provides quicker results. |
530 |
| |
Faculty Perceptions of Electronic Journals as Scholarly Communication: A Question of Prestige and Legitimacy Cheri Speier, Jonathan Palmer, Daniel Wren, and Susan Hahn
Speier et al. surveyed a random sample of the business school faculty at 47 ARL universities for demographics, perceptions of promotion and tenure, and familiarity with electronic publishing. A 22% return rate
yielded 300 usable surveys. Only 16% had read electronic journals and only 7% had submitted a paper to one. Youth and a high level of publishing are associated with awareness of electronic journals. Finance and MIS
faculty are significantly more aware of electronic journals than management, marketing, or operations management. The view toward the value of e-journals appears to be negative, or at best neutral, when compared to
paper journals. |
537 |
| |
Activity of Understanding a Problem during Interaction with an "Enabling" Information Retrieval System: Modeling Information Flow Charles Cole Cole derives a new
model of communication by combining Shannon's model with the three world model of Popper. He stresses the two-way feedback operations that reoccur as conjectures and refutations continue toward a conclusion governed by
the cognitive state of the user who serves repeatedly as both destination and source in Shannon's sense. |
544 |
|