The results are compared with studies of public library user behavior in unassisted OPAC searches. Users' initial query formulations are very similar in the two situations. In the mediated searches, ambiguities are resolved and users' information needs are determined not primarily through extensive, direct elicitation by the intermediary, but hile interacting with the material on the shelf. Replication of this functionality in the OPAC's interaction with the user would solve many problems in unassisted end-user searches.
Few studies have been conducted in Public libraries, those that exist report even higher counts of failed searches than those who study Academic library catalog use. The author's own transaction log analysis of 350 searches in a Public library OPAC (unpublished) shows an overall failure rate of 30%, with 44% of topical searches failing, whereas of the 1000 separate queries which make up the 350 searches, 60% of the total and 70% of the topic-related queries failed.
In analyzing the causes of these failures it has been pointed out (see e.g. Norgard et al., 1993) that most OPACs in current use in libraries are still, both in content and functionality, rooted in the card catalogs they replaced. These catalogs were essentially designed for mediation; the rules for their production and ordering were designed for the librarian rather than the library user. At the same time technical development, particularly in the field of communication, has rapidly changed the catalogs' environment, so that for many users of electronic catalogs mediation will no longer be available, they will be accessing the catalog from their home or their desk at work. The current generation of OPACs provides poor support for this kind of unmediated search.
It is interesting to note that with the rapid growth of the Internet and the ubiquity of presentation media like the World Wide Web and access mechanisms like Netscape, a long tradition of technology-driven OPAC development continues; numerous libraries now seem to rush to make their catalogs available via the WWW based on the technological possibilities, rather than on an assessment of user needs and capabilities.
One approach to generating the background knowledge necessary for a user-centered process of OPAC improvement is to study the interaction between users and human intermediaries in the information search situation, to see whether there are patterns of interaction which can be replicated in the electronic catalog's interaction with the user. A number of such studies have been conducted, almost exclusively, however, in an Academic library setting, and on searches in bibliographic databases rather than OPACs. Belkin, Brooks and Daniels (e.g. in Belkin et al., 1987) arrive at a categorization of the elements of user-intermediary interaction through discourse analysis of a limited number of database searches by trained intermediaries. Saracevic et al. (1990) analyze 40 DIALOG searches by four professional searchers, data from this study have been further analyzed by other researches, most recently by Spink et al. (1995), who studied the elicitations made by intermediaries in the course of the search interaction.
In the Public library setting, a much-quoted study by Lynch (1977) categorize the librarians' questions through content analysis of some 300 unobtrusively recorded reference interviews. A similar technique was used by Dewdney (1986), but in this case to study the effect of a particular kind of training on librarians' question behavior. In Denmark, studies by Ingwersen and Kaae (e.g. Ingwersen and Kaae, 1980) investigated various aspects of librarians' problem-solving behavior, including analysis of a few recorded user-librarian interactions. Mark Pejtersen (1986) used recorded interactions between librarians and users searching for fiction to develop a classification scheme for fiction, adapted to users' search strategies, and to design the "Book- house" interface for a catalog based on this classification.
Existing studies directed towards the particular search situation which one encounters in the public library are few and largely old. There is need for further research, particularly in the light of what we now know about users' problems in the unmediated search situation. The main purpose of the study which is in part reported here has been to attempt to identify patterns of user-intermediary interaction, in particular to see how and to what extent the intermediary establishes a model of the user and the user's information need, and to investigate whether elements of this modeling process may be expected to have an impact on problems identified in unmediated OPAC search sessions.
A total of 170 interactions, involving six different librarians, were recorded. They do not represent all the transactions of the days in question, as at times more than one librarian manned the information desk, and as the recorder was turned off in quiet periods and was not always turned on for a question which obviously would not involve negotiation. As the distribution of types of users and types of questions varies somewhat over the year, for instance with heavier use by students/schoolchildren at midterm than towards the end of term (when this data sample was collected) the data is not a true random sample. The goal of the project, however, was to investigate a representative, not a random, set of user- intermediary interactions. As a control of this, the sample was compared with an inventory of user questions collected during a week at a different time of the year. The themes and types of questions in this inventory match the recorded data quite well. The age and sex distribution of the users in the sample also approximately match the distribution of registered borrowers of the library, with a slant towards the younger age groups.
The 170 interactions were categorized in the following groups:
| Help to find particular titles | 52 |
| Help to find books by particular authors | 8 |
| Concrete problems/directional inquiries | 10 |
| Help to reserve books | 9 |
| Topical (subject) inquiries | 86 |
| Incomplete or technically faulty recording | 5 |
This study focuses on the topical inquiries, which cause the majority of problems for users when they perform unmediated online searches. 40 of the topical inquiries were selected for transcription and analysis. The criterion for transcription was that some interaction was taking place between user and intermediary, other than the initial question and the final negotiation of the retrieved item(s). This interaction might for instance take the form of one or more question/answer sessions, or of unsolicited additional information from the user and response from the intermediary. In addition the transcribed transactions were selected to give an approximately balanced representation to all the librarians involved in the study. A somewhat simplified version of the standard conversation analysis transcription technique developed by Gail Jefferson (Atkinson & Heritage, 1984) was used for the transcripts.
Through content analysis, several aspects of the transcribed interactions were examined:
Since the data consist only of the recorded interactions, with no follow-up questions posed by the investigator to either user or intermediary during or after the sessions, both the determination of success and the categorization of the elements in the interaction are open to misinterpretation. The results are so consistent both between the intermediaries and between different types of users, however, that it seems justifyable to draw conclusions from the material.
The interactions follow a remarkably consistent pattern. Three phases can be identified: an initial problem presentation and clarification phase, a catalog consultation phase (during which problem clarification may continue) and a problem solution phase which in all but one of the 40 cases consists of the librarian going with the user to the shelves to negotiate a search result. The pre-search problem presentation phase is brief, lasting 15 seconds on the average and in only one case lasting longer than 30 seconds. In about 1/3 of the cases the interaction moves directly from the problem presentation phase to the shelf. In the remaining 2/3, where the problem presentation is followed by a catalog search, the duration of this search varies with the librarian's knowledge of the collection and the catalog, but it is rarely a question of entering more than one search term. In any case, the shelf negotiation phase is by far the longest lasting. On the average, it occupies approx. 75% of each interaction, and only in 4 cases does the time spent at the shelves make up less than 50% of the total interaction time.
In the light of the characterization above, it is remarkable that in only 18 out of 40 cases did the intermediary elicit additional information after the user's first problem statement before turning to the shelf or to the catalog, and in half of these cases again only a single question was asked. As has been shown, this does not mean that the users' initial problem statements present a clear and complete representation of the problem at hand, on the contrary, by the end of the interaction the problem as presented in the initial statement had been changed or modified in 60% of the cases. It is rather a reflection of the role of the catalog search and of the search at the shelf in the total interaction. This will be further discussed below.
When all three interaction phases are considered, the intermediary asked 2 questions on the average during an interaction, or 2.5 if follow-up questions which just echo or reiterate the user's statement are counted. The questions were distributed as follows:
| Subject definition (specification, fact gathering or instanciation through particular titles): | 50% |
| User objective or motivation: | 5% |
| Personal characteristics of users (language restriction, level of difficulty, urgency, amount of material needed etc.): | 12.5% |
| User status in relation to system (sources checked, catalog use, previous browsing): | 10% |
| Verification of acceptable answer: | 22.5% |
The main part of the subject definition elicitations serve a disambiguation purpose ("Is it something in connection with genealogy you are working on?" is the question triggered by the Telemark problem statement quoted above); questions pertaining to the librarian's factual knowledge (of the type "What are martial arts?") are less frequent. More than 80 % of the subject definition elicitations have the form of closed questions.
Less than 20% of the intermediaries' elicitation are directed towards establishing a model of the user, leaving aside the "verification of acceptable answer" category of elicitation, which might in a difficultly definable way contribute to such a model ("is this what you are thinking of?"), but which in most cases takes place at the very end of the interaction.
In so far as the terms precision and recall are meaningful in this connection, the users without exception ask for precision rather than recall - they are interested in something about their subject of interest rather than everything about it. Most frquently the user terminates the transaction, with comments like "then I think this will do, thank you". On the other hand they generally show quite a lot of patience during the browsing phase and accept being taken from shelf to shelf to judge titles as the librarian's chain of associations develop.
It was assumed at the outset of this study that a process of model-building on the part of the intermediary would be identified, and that the elements of this model-building might be isolated. Such model-building is obviously taking place, it is impossible for the intermediary to engage in problem- solving interaction without having and developing a model both of the user and the user's problem. The nature and frequency of intermediary elicitations show that the main emphasis in the interaction is on the modeling and understanding of the user's problem.
The study indicates, however, that the intermediary's model of the system, represented by the organization of the catalog and of the documents on the shelf, and the intermediary's model of the world, represented by general knowledge and in particular by knowledge generated through queries encountered in the past, play a large role in the i nteraction. A relatively modest amount of information elicitation takes place, both pertaining to the subject of the query and the situation of the user. The catalog search seems to serve the purpose of matching the query to the intermediary's world and collection model and provide a focus for the further search rather than to develop a detailed understanding of the query. The overwhelming predominance of closed questions from the intermediary seem to point in the same direction. The greater part of the interaction takes place at the shelf, where the user is exposed to the system model. It seems to be as much through participating in this exposure and observing the user's reactions to it as through direct elicitation that the intermediary establishes the user and problem model and is able to assist the user to a successful completion of the interaction.
More research is needed to establish if this is indeed the case. The next sep might be to interview intermediaries to gain insight in their perception of the process and the models they posess and establish. It may be that the nature of the queries and the collection in question is conductive to the interaction process the intermediaries are observed to chose - the degree of specificity and constraints on the queries ("must have illustrations","must not be too difficult" etc.) in many cases demand that the documents are examined to determine their value to the user. An interesting follow-up study would be to examine in more detail how the user and the intermediaries work with the documents on the shelf.
The findings indicate some fruitful directions for online catalog improvement. The author's and other OPAC user studies show that OPAC users have difficulty in applying any but the most simple search functions. Their query formulations are often ambiguous, do not convey the real information need and do not match the language of the system. OPAC users also seem to have difficulties in applying information from the document descriptions they do retrieve in a search to improve their search strategy, for instance by choosing new subject headings or keywords. These problems might be alleviated if the OPAC could:
Belkin, N. J. Et al. (1987). Knowledge elicitation using discourse analysis. International Journal of Man-Machine Studies, 27, 127-144.
Dewdney, P. (1992). Recording the reference interview: a field experiment. In Glazier, J. D. And Powell, R. R., eds. Qualitative research in information management, Englewood, Co., Libraries Unlimited, 122-150.
Hunter, R. N. (1991). Successes and failures of patrons searching the online catalog at a large academic library: A transaction log analysis. RQ, 30,395-402.
Ingwersen, P. And Kaae, S. (1980). User-librarian negotiations and information search procedures in public libraries. Analysis of verbal protocols. Final report. Copenhagen, Royal School of Librarianship.
Lynch, M. J. (1977). Reference interviews in public libraries. Ph. D. Dissertation, Rutgers, The State University of New Jersey.
Mark Pejtersen, A. (1986). Design and test of a database for fiction based on an analysis of children's search behvior. IN Ingwersen, P. Et al., eds. Information technology and information use: toward a unified view of information technology. London, Taylor Graham, 125-145.
Norgard, B. A. Et al. (1993). The online catalog: From technical services to access service. Advances in librarianship,17, 111-148.
Peters, T. A. (1989). When smart people fail: An analysis of the transaction log of an online public access catalog. Journal of Academic Librarianship, 15(5), 267-273.
Peters, T. A. (1993). The history and development of transaction log analysis. Library Hi-Tech, 11(2), 41-66.
Saracevic, T. Et al. (1990). Nature of interaction between users and intermediaries in online searching: A qualitative analysis. Proceedings of the 53rd Annual Meeting of the American Society for Information Science, 27, 47-54.
Spink, A. Et al. (1995). Search intermediary elicitations during mediated online searching. Proceedings of the 58th Annual Meeting of the American Society for Information Science, 32, 97-102.
Taylor, R. (1968). Question negotiation and information seeking in libraries. College and Research Libraries, 29, 178-194.
Tonta, Y. (1992). Analysis of search failures in document retrieval systems: A review. Public-Access Computer Systems Review, 3(1), 4-53.
© 1996, American Society for Information Science. Permission to copy and distribute this document is hereby granted provided that this copyright notice is retained on all copies and that copies are not altered.