JASIS HomeJASIS TOC's

Journal of the Association for Information Science

IN THIS ISSUE

 

Bert R. Boyce

729

RESEARCH

 

Source-Item Production Laws for the Case That Items Have Multiple Sources with Fractional Counting of Credits
L. Egghe

Egghe describes a model incorporating the fractional counting of credits for an item to its multiple sources. It appears sound and quite general. However, despite the fact that some of the more complex mathematics is in the appendices, the average reader will find that considerable concentration and above average mathematical awareness is required to complete the main text with a satisfactory level of understanding.

                       

730

 

Filtered Document Retrieval with Frequency-Sorted Indexes
Michael Persin, Justin Zobel, and Ron Sacks-Davis

Persin, Zobel, and Sacks-Davis provide some important insight into data structure for retrieval. By sorting the lists of document numbers and word counts associated with any term in an index file by the count rather than by the document number, the ability to compress the list by using run lengths from the previous document number is lost. However the need for an accumulator of similarity values for each document with other than a zero similarity value is avoided. Using thresholds based on the similarity of the currently most similar document, and computed before the processing of the list for each term, document accumulators are created, ignored, or augmented, or not, if already in existence. Small partial similarities are unlikely to change the final ranking and documents yielding such values are ignored at considerable memory saving. The sort by occurrence count means considerable reduction in processing time since the tails of the long count in document lists need not be processed. If the maximum in document frequency in the list is stored with the term, it is possible to avoid reading the list for some terms. One can regain some compression by sorting documents in the list with the same frequency by document number. Tests show no degradation in retrieval effectiveness and would permit ranked retrieval on considerably smaller machines.

                                       

749

 

Inter-Record Linkage Structure in a Hypertext Bibliographic Retrieval System
Dietmar Wolfram

Wolfram's HyperLynx system is tested on nearly 3000 NTIS document records on library and information science from 1989 to '91. Initial entry is through author, title, and descriptor indices. Each search term with multiple hits forms a circular list of those records which can be traversed. A click on a hot word (index term or author) will open a new list on that term and return to the original indices is possible at any time. The distribution of the number of times a term occurs in the file is studied, as well as the distribution of co-occurrences of different breadths, and the distribution of the number of selectable terms in a record. Terms of the largest size and of the smallest size co-occur with the greatest frequency. Terms of mid-range co-occur least frequently. A simple model of expected number of term co-occurrences, a simulation based on exhaustivity and term size, and a third model where the frequency of terms of a given size was assigned based upon terms per record, were implemented. The observed distribution is much more variable than those produced by the models, although matching behavior is apparent.                                                       

 

765

 

Journal Production and Journal Impact Factors
Ronald Rousseau and Guido Van Hooydonk

Rousseau and Van Hooydonk find that while review and translation journals work quite differently, normal journals exhibit a linear relationship between production and global impact. The fields of Mathematics and Chemistry do not appear to follow this general rule.

775

BRIEF COMMUNICATIONS

 

Linguistic Laws and Computer Programs
Peter Kokol and Tatjana Kokol

The Kokol's find that counting the occurrences of reserved words and operators in multiple programs written in Fortran, C, and C++, results in overall rank frequency distributions that follow Zipf's law. While all programming language curves fit the Zipf model significantly, C++ correlates less strongly with the predicted distribution. Thus linguistic laws may well be candidates for the design of new software metrics. 

 

781

 

Expertise and the Perception of Shape in Information
Andrew Dillon and Dille Schaap

Dillon and Schaap are concerned with readers' ability to recognize the structural portion of a paper being read. Forty-eight subjects viewed paragraphs of text from published papers and allocated them to one of four classes: introduction, method, results, or discussion. Experienced readers are able to locate themselves more quickly and correctly.           

786

BOOK REVIEWS

 

At the Crossroads: Librarians on the Information Superhighway
by Herbert S. White
Reviewed by Charles H. Davis

 

789

 

Fril--Fuzzy and Evidential Reasoning in Artificial Intelligence
by J. F. Baldwin, T. P. Martin, and B. W. Pilsworth
Reviewed by Nikola Kasabov

 

790

 

Electric Words: Dictionaries, Computers, and Meanings
by Yorick A. Wilks, Brian M. Slator, and Louise M. Guthrie
Reviewed by Julian Warner

 

791

 

Finding Government Information on the Internet
edited by John Maxymuk
Reviewed by Deborah Hunt

 

792

 

Measuring Information: An Information Services Perspective
by Jean Tague-Sutcliffe
Reviewed by Robert Losee

 

794

 

Information Management for the Intelligent Organization: The Art of Scanning the Environment
by Chun Wei Choo
Reviewed by Kenneth G. Madden

 

795

 

Contextual Media: Multimedia and Interpretation
edited by Edward Barrett and Marie Redmond
Reviewed by Julia Gelfand

 

796

 

Learning Networks: A Field Guide to Teaching and Learning Online
by Linda Harasim, Starr Roxanne Hiltz, Lucio Teles, and Murray Turoff
Reviewed by Robert Wittorf

797

ASIS HomeSearch ASISMake A Comment

© 1998 , Association for Information Science
Last update: November 06, 1998