Weichselbraun, Albert and Gindl, Stefan and Scharl, Arno (2011) Using Games with a Purpose and Bootstrapping to Create Domain-Specific Sentiment Lexicons. In: Proceedings of the 19th ACM Conference on Information and Knowledge Management (CIKM 2011), 24.-28. Oktober 2011, Glasgow, Scotland, UK.
Preview |
PDF (Using Games with a Purpose and Bootstrapping to Create Domain-Specific Sentiment Lexicons)
- Accepted Version
437kB |
Abstract
Sentiment detection analyzes the positive or negative polarity of text. The field has received considerable attention in recent years, since it plays an important role in providing means to assess user opinions regarding an organisation's products, services, or actions. Approaches towards sentiment detection include machine learning techniques as well as computationally less expensive methods. The latter rely on the use of language-specific sentiment lexicons, which are lists of sentiment terms with their corresponding sentiment value. The effort involved in creating, customizing, and extending sentiment lexicons is considerable, particularly if less common languages and domains are targeted without access to appropriate language resources. This paper proposes a semi-automatic approach for the creation of sentiment lexicons which assigns sentiment values to sentiment terms via crowdsourcing. Furthermore, it introduces a bootstrapping process operating on unlabeled domain documents to extend the created lexicons, and to customize them according to the particular use case. This process considers sentiment terms as well as sentiment indicators occurring in the discourse surrounding a particular topic. Such indicators are associated with a positive or negative context in a particular domain, but might have a neutral connotation in other domains. A formal evaluation shows that bootstrapping considerably improves the method's recall. Automatically created lexicons yield a performance comparable to professionally created language resources such as the General Inquirer.
Item Type: | Conference or Workshop Item (Paper) |
---|---|
Uncontrolled Keywords: | Sentiment Detection, Sentiment Analysis, Bootstrapping, Language Resources, Sentiment Lexicon, Crowd-Sourcing |
Subjects: | Q Science > QA Mathematics > QA75 Electronic computers. Computer science |
Divisions: | Faculty of Engineering, Science and Mathematics > School of Electronics and Computer Science |
ID Code: | 33 |
Deposited By: | Dr Albert Weichselbraun |
Deposited On: | 24 Mar 2012 18:03 |
Last Modified: | 24 Mar 2012 18:03 |
Repository Staff Only: item control page