Skip to main content


eCommons@Cornell

eCommons@Cornell >
College of Engineering >
Computer Science >
Computer Science Technical Reports >

Please use this identifier to cite or link to this item: http://hdl.handle.net/1813/7366
Title: Clustered File Generation and Its Application to Computer Science Taxonomies
Authors: Bergmark, D.
Salton, Gerard
Keywords: computer science
technical report
Issue Date: Dec-1976
Publisher: Cornell University
Citation: http://techreports.library.cornell.edu:8081/Dienst/UI/1.0/Display/cul.cs/TR76-295
Abstract: A clustered file organization is one where related, or similar records are grouped into classes, or clusters of items in such a way that all items within a cluster are jointly retrievable. Such a file organization is advantageous for interactive searching where tentative query formulations may be used and the records may be specified incompletely or approximately. An inexpensive file clustering method applicable to large files is given together with an appropriate file search method. The method is used to cluster a file of research articles in computer science based on citation similarities between the papers; this leads to the identification of groups of active computer science research topics and of productive computer scientists.
URI: http://hdl.handle.net/1813/7366
Appears in Collections:Computer Science Technical Reports

Files in This Item:

File Description SizeFormat
76-295.pdf878.73 kBAdobe PDFView/Open
76-295.ps510.13 kBPostscriptView/Open

Items in eCommons are protected by copyright, with all rights reserved, unless otherwise indicated.

 

© Copyright 2003-2009 by the Cornell University Library Contact Us