Skip to main content


eCommons@Cornell

eCommons@Cornell >
College of Engineering >
Computer Science >
Computer Science Technical Reports >

Please use this identifier to cite or link to this item: http://hdl.handle.net/1813/5959
Title: A New Comparison Between Conventional Indexing (MEDLARS) andAutomatic Text Processing (SMART)
Authors: Salton, Gerard
Keywords: computer science
technical report
Issue Date: Dec-1971
Publisher: Cornell University
Citation: http://techreports.library.cornell.edu:8081/Dienst/UI/1.0/Display/cul.cs/TR71-115
Abstract: A new testing process is described designed to compare conventional retrieval (MEDLARS) and automatic text analysis methods (SMART). The results obtained with a collection of documents chosen independently of either SMART or MEDLARS indicate that a simple automatic extraction of keywords from document abstracts produces a 30 to 40 percent loss compared with MEDLARS indexing. A replacement of the unranked Boolean searches used in MEDLARS by the standard ranked output normally provided by SMART reduces the loss to between 15 and 20 percent. When an automatically generated word control list or a thesaurus is used as part of the SMART analysis, the results are comparable in effectiveness to those obtained by the intellectual MEDLARS indexing. Finally, the incorporation of user feedback procedures into SMART furnishes an improvement over the normal MEDLARS output of 15 to 30 percent. One concludes again that no technical justification exists for maintaining controlled, manual indexing in operational retrieval environments.
URI: http://hdl.handle.net/1813/5959
Appears in Collections:Computer Science Technical Reports

Files in This Item:

File Description SizeFormat
71-115.pdf2.14 MBAdobe PDFView/Open
71-115.ps753 kBPostscriptView/Open

Refworks Export

Items in eCommons are protected by copyright, with all rights reserved, unless otherwise indicated.

 

© 2014 Cornell University Library Contact Us