help button home button JAMIA Hate scrolling?
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH

First published April 24, 2008 as JAMIA PrePrint; doi:10.1197/jamia.M2663
Journal of the American Medical Informatics Association 2008;15(4):542-545
© 2008 American Medical Informatics Association


A more recent version of this article appeared on July 1, 2008
This Article
Right arrow Full Text (PDF)
Right arrow All Versions of this Article:
M2663v1
15/4/542    most recent
Right arrow Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when eLetters are posted
Right arrow Alert me if a correction is posted
Services
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Coiera, E. W.
Right arrow Articles by Vickland, V.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Coiera, E. W.
Right arrow Articles by Vickland, V.

Submitted on November 12, 2007
Accepted on February 21, 2008

Is relevance relevant? User relevance ratings may not predict the impact of Internet search on decision outcomes

Enrico W. Coiera PhD1* and Victor Vickland1

Affiliation of the authors: 1 Centre for Health Informatics, University of New South Wales, New South Wales, Australia

* To whom correspondence should be addressed.

Objective A common measure of Internet search engine effectiveness is its ability to find documents that a user perceives as 'relevant'. This study sought to test whether user provided relevance ratings for documents retrieved by an Internet search engine correlate with the decision outcome after use of a search engine.

Design 227 university students were asked to answer four randomly assigned consumer health questions, then to conduct an Internet search on one of two randomly assigned search engines of different performance, and to again answer the question.

Measurements Participants were asked to provide a relevance score for each document retrieved as well as a pre and post search answer to each question.

Results User relevance rankings had little or no predictive power. Relevance rankings were unable to predict whether the user of a search engine could correctly answer a question after search and could not differentiate between two search engines with statistically different performance in the hands of users. Only when users had strong prior knowledge of the questions, and the decision task was of low complexity, did relevance appear to have modest predictive power.

Conclusion User provided relevance rankings taken in isolation seem to be of limited to no value when designing a search engine that will be used in a general-purpose setting. Relevance rankings may have a place in situations in which experts provide rankings, and decision tasks are of complexity commensurate with the abilities of the raters. A more natural metric of search engine performance may be a user's ability to accurately complete a task, as this removes the inherent subjectivity of relevance rankings, and provides a direct and repeatable outcome measure which directly correlates with the performance of the search technology in the hands of users.







HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH
Copyright © 1994 by the American Medical Informatics Association.