College of Engineering >
Computer Science >
Computer Science Technical Reports >
Please use this identifier to cite or link to this item:
|Title: ||A Theory of Term Importance in Automatic Text Analysis|
|Authors: ||Salton, Gerard|
Yang, C. S.
Yu, C. T.
|Keywords: ||computer science|
|Issue Date: ||Jul-1974|
|Publisher: ||Cornell University|
|Abstract: ||Most existing automatic content analysis and indexing techniques are based on word frequency characteristics applied largely in an ad hoc manner. Contradictory requirements arise in this connection, in that terms exhibiting high occurence frequencies in individual documents are often useful for high recall performance (to retrieve many relevant items), whereas terms with low frequency in the whole collection are useful for high precision (to reject nonrelevant items).|
|Appears in Collections:||Computer Science Technical Reports|
Items in eCommons are protected by copyright, with all rights reserved, unless otherwise indicated.