Skip to main content


eCommons@Cornell >
College of Engineering >
Computer Science >
Computer Science Technical Reports >

Please use this identifier to cite or link to this item:
Title: Automatic Text Decomposition Using Text Segments and Text Themes
Authors: Salton, Gerard
Singhal, Amit
Buckley, Chris
Mitra, Mandar
Keywords: computer science
technical report
Issue Date: Nov-1995
Publisher: Cornell University
Abstract: With the widespread use of full-text information retrieval, passage-retrieval techniques are becoming increasingly popular. Larger texts can then be replaced by important text excerpts, thereby simplifying the retrieval task and improving retrieval effectiveness. Passage-level evidence about the use of words in local contexts is also useful for resolving language ambiguities and improving retrieval output. Two main text decomposition strategies are introduced in this study, including a chronological decomposition into {\em text segments}, and semantic decomposition into {\em text themes}. The interaction between text segments and text themes is then used to characterize text structure, and to formulate specifications for information retrieval, text traversal, and text summarization.
Appears in Collections:Computer Science Technical Reports

Files in This Item:

File Description SizeFormat
95-1555.pdf231.01 kBAdobe PDFView/Open
95-1555.ps374.22 kBPostscriptView/Open

Refworks Export

Items in eCommons are protected by copyright, with all rights reserved, unless otherwise indicated.


© 2014 Cornell University Library Contact Us