Skip to main content


eCommons@Cornell

eCommons@Cornell >
College of Engineering >
Computer Science >
Computer Science Technical Reports >

Please use this identifier to cite or link to this item: http://hdl.handle.net/1813/7061
Title: Generation and Search of Clustered Files
Authors: Bergmark, D.
Salton, Gerard
Wong, A.
Keywords: computer science
technical report
Issue Date: Jan-1977
Publisher: Cornell University
Citation: http://techreports.library.cornell.edu:8081/Dienst/UI/1.0/Display/cul.cs/TR77-299
Abstract: A classified, or clustered file is one where related, or similar records are grouped into classes, or clusters of items in such a way that all items within a cluster are jointly retrievable. Clustered files exhibit substantial advantages in many retrieval environments over the more conventional inverted list or multilist technologies. An inexpensive file clustering method applicable to large files is given together with appropriate file search methods. An abstract model is used to predict the retrieval effectiveness of various search methods in a clustered file environment, and experimental evidence is introduced to confirm the usefulness of the model. As an example, a collection of research papers in computer science is clustered automatically, and the resulting research clusters are compared with exissting, manually constructed taxonomies for the computer field.
URI: http://hdl.handle.net/1813/7061
Appears in Collections:Computer Science Technical Reports

Files in This Item:

File Description SizeFormat
77-299.pdf2.51 MBAdobe PDFView/Open
77-299.ps1.25 MBPostscriptView/Open

Items in eCommons are protected by copyright, with all rights reserved, unless otherwise indicated.

 

© 2010 Cornell University Library Contact Us