Skip to main content


eCommons@Cornell >
College of Engineering >
Computer Science >
Computer Science Technical Reports >

Please use this identifier to cite or link to this item:
Title: Automatic Structuring of Text Files
Authors: Salton, Gerard
Buckley, Chris
Allan, James
Keywords: computer science
technical report
Issue Date: Oct-1991
Publisher: Cornell University
Abstract: In many practical information retrieval situations, it is necessary to process heterogeneous text databases that vary greatly in scope and coverage, and deal with many different subjects. In such an environment it is important to provide flexible access to individual text pieces, and to structure the collection so that related text elements are identified and appropriately linked. Methods are described in this study for the automatic structuring of heterogeneous text collections, and the construction of browsing tools and access procedures that facilitate collection use. The proposed methods are illustrated by performing searches with a large automated encyclopedia.
Appears in Collections:Computer Science Technical Reports

Files in This Item:

File Description SizeFormat
91-1241.pdf1.84 MBAdobe PDFView/Open
91-1241.ps437.87 kBPostscriptView/Open

Refworks Export

Items in eCommons are protected by copyright, with all rights reserved, unless otherwise indicated.


© 2014 Cornell University Library Contact Us