|
eCommons@Cornell >
Cornell University Graduate School >
Theses and Dissertations (OPEN) >
Please use this identifier to cite or link to this item:
http://hdl.handle.net/1813/11166
| Title: | EMPIRICAL METHODS FOR FINE-GRAINED OPINION EXTRACTION FROM TEXT |
| Authors: | Breck, Eric |
| Keywords: | computer science natural language processing machine learning subjectivity sentiment opinion computational workflow computational linguistics |
| Issue Date: | 29-Jul-2008 |
| Abstract: | Opinions are everywhere. The op/ed pages of newspapers, political blogs, and
consumer websites like epinions.com are just some examples of the textual opinions available to readers. And there are many consumers who are interested in
following these opinions - intelligence analysts who track the opinions of foreign countries, public relation firms who want to ensure positive opinions for
their clients, pollsters who want to know the public's opinions about politicians,
and companies who want to know customers' opinions about their products.
The problem faced by all of these consumers of opinion is that there is such a
wealth of text to process that it is hard to read it all. Central to processing the
opinions in these text will be solving two specific problems - identifying expressions of opinion, and identifying their hierarchical structure. We demonstrate
solutions involving empirical natural language processing techniques.
Although empirical, data-driven methods such as these have become the
norm in natural language processing, little work has been done in analyzing
their impact on the reproducibility, efficiency, and effectiveness of research. We
address two specific problems in this area. We introduce a lightweight computational workflow system to improve the reproducibility and efficiency of
machine learning and natural language processing experiments. And we investigate the process of feature generation, setting out desiderata for an ideal
process and exploring the effectiveness of several alternatives. Both are investigated in the context of the natural language learning tasks set out earlier. |
| URI: | http://hdl.handle.net/1813/11166 |
| Appears in Collections: | Theses and Dissertations (OPEN)
|
Items in eCommons are protected by copyright, with all rights reserved, unless otherwise indicated.
|