webLyzard Publications

Building Tagged Linguistic Unit Databases for Sentiment Detection

Scharl, Arno and Weichselbraun, Albert and Gindl, Stefan (2008) Building Tagged Linguistic Unit Databases for Sentiment Detection. In: Proceedings of the 8th International Conference on Knowledge Management (I-Know '08), sentiment detection, semantic orientation, tagged linguistic unit, sentiment analysis, Graz, Austria.

[thumbnail of Building Tagged Linguistic Unit Databases for Sentiment Detection]
Preview
PDF (Building Tagged Linguistic Unit Databases for Sentiment Detection) - Accepted Version
206kB

Abstract

Despite the obvious business value of visualizing similarities between elements of evolving information spaces and mapping these similarities e.g. onto geospatial reference systems, analysts are often more interested in how the semantic orientation (sentiment) towards an organization, a product or a particular technology is changing over time. Unfortunately, popular methods that process unstructured textual material to detect semantic orientation automatically based on tagged dictionaries are not capable of fulfilling this task, even when coupled with part-of-speech tagging, a standard component of most text processing toolkits that distinguishes grammatical categories such as article (AT), noun (NN), verb (VB), and adverb (RB). Small corpus size, ambiguity and subtle incremental change of tonal expressions between different versions of a document complicate the detection of semantic orientation and often prevent promising algorithms from being incorporated into commercial applications. Parsing grammatical structures, by contrast, outperforms dictionary-based approaches in terms of reliability, but usually suffers from poor scalability due to their computational complexity. This paper addresses this predicament by presenting an alternative approach based on automatically building Tagged Linguistic Unit (TLU) databases to overcome the restrictions of dictionaries with a limited set of tagged tokens.

Item Type:Conference or Workshop Item (Paper)
Subjects:Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Divisions:Faculty of Engineering, Science and Mathematics > School of Electronics and Computer Science
ID Code:62
Deposited By: Dr Albert Weichselbraun
Deposited On:29 Nov 2012 07:36
Last Modified:29 Nov 2012 07:36

Repository Staff Only: item control page