help button home button JAMIA Bigger figures
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS

First published April 24, 2008 as JAMIA PrePrint; doi:10.1197/jamia.M2663
This Article
Right arrow Full Text
Right arrow Full Text (PDF)
Right arrow All Versions of this Article:
M2663v1
15/4/542    most recent
Right arrow Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when eLetters are posted
Right arrow Alert me if a correction is posted
Services
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Google Scholar
Right arrow Articles by Coiera, E. W.
Right arrow Articles by Vickland, V.
PubMed
Right arrow PubMed Citation
Right arrow Articles by Coiera, E. W.
Right arrow Articles by Vickland, V.
J Am Med Inform Assoc. 2008;15:542-545. DOI 10.1197/jamia.M2663.
© 2008 American Medical Informatics Association


Research Paper

Is Relevance Relevant? User Relevance Ratings May Not Predict the Impact of Internet Search on Decision Outcomes

Enrico W. Coiera* and Victor Vickland

Centre for Health Informatics, University of New South Wales, New South Wales, Australia.

* Correspondence: Enrico W. Coiera, University of New South Wales, Centre for Health Informatics, UNSW 2055, Australia (Email: e.coiera{at}unsw.edu.au).

Received for publication: 11/12/07; accepted for publication: 02/21/08.

Objective: A common measure of Internet search engine effectiveness is its ability to find documents that a user perceives as ‘relevant’. This study sought to test whether user provided relevance ratings for documents retrieved by an Internet search engine correlate with the decision outcome after use of a search engine.

Design: 227 university students were asked to answer four randomly assigned consumer health questions, then to conduct an Internet search on one of two randomly assigned search engines of different performance, and to again answer the question.

Measurements: Participants were asked to provide a relevance score for each document retrieved as well as a pre and post search answer to each question.

Results: User relevance rankings had little or no predictive power. Relevance rankings were unable to predict whether the user of a search engine could correctly answer a question after search and could not differentiate between two search engines with statistically different performance in the hands of users. Only when users had strong prior knowledge of the questions, and the decision task was of low complexity, did relevance appear to have modest predictive power.

Conclusions: User provided relevance rankings taken in isolation seem to be of limited to no value when designing a search engine that will be used in a general-purpose setting. Relevance rankings may have a place in situations in which experts provide rankings, and decision tasks are of complexity commensurate with the abilities of the raters. A more natural metric of search engine performance may be a user's ability to accurately complete a task, as this removes the inherent subjectivity of relevance rankings, and provides a direct and repeatable outcome measure which directly correlates with the performance of the search technology in the hands of users.







HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
Copyright © 2008 by the American Medical Informatics Association.