Académique Documents
Professionnel Documents
Culture Documents
Sentiment analysis (sometimes known as opinion min- in most statistical classication methods, the neutral class
ing or emotion AI) refers to the use of natural lan- is ignored under the assumption that neutral texts lie near
guage processing, text analysis, computational linguistics, the boundary of the binary classier, several researchers
and biometrics to systematically identify, extract, quan- suggest that, as in every polarity problem, three cate-
tify, and study aective states and subjective informa- gories must be identied. Moreover, it can be proven
tion. Sentiment analysis is widely applied to voice of the that specic classiers such as the Max Entropy[6] and
customer materials such as reviews and survey responses, the SVMs[7] can benet from the introduction of a neu-
online and social media, and healthcare materials for ap- tral class and improve the overall accuracy of the classi-
plications that range from marketing to customer service cation. There are in principle two ways for operating
to clinical medicine. with a neutral class. Either, the algorithm proceeds by
Generally speaking, sentiment analysis aims to determine rst identifying the neutral language, ltering it out and
the attitude of a speaker, writer, or other subject with then assessing the rest in terms of positive and negative
respect to some topic or the overall contextual polar- sentiments, or it builds a three-way classication in one
ity or emotional reaction to a document, interaction, or step.[8] This second approach often involves estimating
event. The attitude may be a judgment or evaluation (see a probability distribution over all categories (e.g. naive
appraisal theory), aective state (that is to say, the emo- Bayes classiers as implemented by Pythons NLTK kit).
tional state of the author or speaker), or the intended emo- Whether and how to use a neutral class depends on the
tional communication (that is to say, the emotional eect nature of the data: if the data is clearly clustered into
intended by the author or interlocutor). neutral, negative and positive language, it makes sense to
lter the neutral language out and focus on the polarity
between positive and negative sentiments. If, in contrast,
the data are mostly neutral with small deviations towards
1 Types positive and negative aect, this strategy would make it
harder to clearly distinguish between the two poles.
A basic task in sentiment analysis is classifying the po- A dierent method for determining sentiment is the use
larity of a given text at the document, sentence, or fea- of a scaling system whereby words commonly associated
ture/aspect levelwhether the expressed opinion in a with having a negative, neutral, or positive sentiment with
document, a sentence or an entity feature/aspect is pos- them are given an associated number on a 10 to +10
itive, negative, or neutral. Advanced, beyond polarity scale (most negative up to most positive) or simply from
sentiment classication looks, for instance, at emotional 0 to a positive upper limit such as +4. This makes it pos-
states such as angry, sad, and happy. sible to adjust the sentiment of a given term relative to its
environment (usually on the level of the sentence). When
One method that holds historical priority is the method of a piece of unstructured text is analyzed using natural lan-
Volcani and Fogel[1] , which remains in practice currently. guage processing, each concept in the specied environ-
The method examines individual words and phrases with ment is given a score based on the way sentiment words
respect to dierent emotional scales and presents syn- relate to the concept and its associated score.[9] This al-
onyms that can be used to increase or decrease the level lows movement to a more sophisticated understanding of
of evoked emotion in each scale. sentiment, because it is now possible to adjust the sen-
Other early work included Turney[2] , and Pang[3] who ap- timent value of a concept relative to modications that
plied dierent methods for detecting the polarity of prod- may surround it. Words, for example, that intensify, re-
uct reviews and movie reviews respectively. This work is lax or negate the sentiment expressed by the concept can
at the document level. One can also classify a documents aect its score. Alternatively, texts can be given a posi-
polarity on a multi-way scale, which was attempted by tive and negative sentiment strength score if the goal is to
Pang[4] and Snyder[5] among others: Pang and Lee[4] ex- determine the sentiment in a text rather than the overall
panded the basic task of classifying a movie review as polarity and strength of the text.[10]
either positive or negative to predict star ratings on ei-
ther a 3- or a 4-star scale, while Snyder[5] performed an
in-depth analysis of restaurant reviews, predicting ratings
for various aspects of the given restaurant, such as the
food and atmosphere (on a ve-star scale). Even though
1
2 3 EVALUATION
helpful for recommender system. Besides, a review can [9] Taboada, Maite; Brooke, Julian (2011). Lexicon-based
be designed to hinder sales of a target product, thus be methods for sentiment analysis. Computational Linguis-
harmful to the recommender system even it is well writ- tics. 37 (2): 272274. doi:10.1162/coli_a_00049.
ten.
[10] Thelwall, Mike; Buckley, Kevan; Paltoglou, Georgios;
Researchers also found that long and short form of user- Cai, Di; Kappas, Arvid (2010). Sentiment strength de-
generated text should be treated dierently. An interest- tection in short informal text. Journal of the American
ing result shows that short form reviews are sometimes Society for Information Science and Technology. 61 (12):
[46]
more helpful than long form, because it is easier to l- 25442558. doi:10.1002/asi.21416.
ter out the noise in a short form text. For the long form [11] Pang, Bo; Lee, Lillian (2008). 4.1.2 Subjectivity De-
text, the growing length of the text does not always bring tection and Opinion Identication. Opinion Mining and
a proportionate increase of the number of features or sen- Sentiment Analysis. Now Publishers Inc.
timents in the text.
[12] Mihalcea, Rada; Banea, Carmen; Wiebe, Janyce (2007).
Learning Multilingual Subjective Language via Cross-
Lingual Projections (PDF). Proceedings of the Associa-
6 See also tion for Computational Linguistics (ACL). pp. 976983.
Johan Bollen [13] Su, Fangzhong; Markert, Katja (2008). From Words to
Senses: a Case Study in Subjectivity Recognition (PDF).
Market sentiment Proceedings of Coling 2008, Manchester, UK.
[23] Stevenson, Ryan; Mikels, Joseph; James, Thomas [38] Kirkpatrick, Marshall. ", ReadWriteWeb, 2009-04-15.
(2007). Characterization of the Aective Norms Retrieved on 2009-10-01.
for English Words by Discrete Emotional Categories
(PDF). Behavior Research Methods. 39 (4): 10201024. [39] CORDIS. Collective emotions in cyberspace (CYBER-
doi:10.3758/bf03192999. PMID 18183921. EMOTIONS)", European Commission, 2009-02-03. Re-
trieved on 2010-12-13.
[24] Kim, S. M.; Hovy, E. H. (2006). Identifying and
Analyzing Judgment Opinions. (PDF). Proceedings of [40] Condlie, Jamie. Flaming drives online social networks
the Human Language Technology / North American As- ", NewScientist, 2010-12-07. Retrieved on 2010-12-13.
sociation of Computational Linguistics conference (HLT-
[41] Tumasjan, Andranik; O.Sprenger, Timm; G.Sandner,
NAACL 2006). New York, NY.
Philipp; M.Welpe, Isabell (2010). Predicting Elections
[25] Dey, Lipika; Haque, S. K. Mirajul (2008). Opinion Min- with Twitter: What 140 Characters Reveal about Politi-
ing from Noisy Text Data. Proceedings of the second cal Sentiment. Proceedings of the Fourth International
workshop on Analytics for noisy unstructured text data, AAAI Conference on Weblogs and Social Media
p.83-90. [42] Tang, Huifeng, Songbo Tan, and Xueqi Cheng. A survey
[26] Cambria, E; Hussain, A (2012). Sentic Computing: Tech- on sentiment detection of reviews. Expert Systems with
niques, Tools, and Applications. Springer. Applications 36.7 (2009): 10760-10773.
[27] Akcora, Cuneyt Gurcan; Bayir, Murat Ali; Demirbas, [43] Jakob, Niklas, et al. Beyond the stars: exploiting free-
Murat; Ferhatosmanoglu, Hakan (2010). Identifying text user reviews to improve the accuracy of movie rec-
breakpoints in public opinion. SigKDD, Proceedings of ommendations. Proceedings of the 1st international CIKM
the First Workshop on Social Media Analytics. workshop on Topic-sentiment analysis for mass opinion.
ACM, 2009.
[28] Borth, Damian; Ji, Rongrong; Chen, Tao; Breuel,
Thomas; Chang, Shih-Fu (2013). Large-scale Visual [44] Hu, Minqing, and Bing Liu. Mining opinion features in
Sentiment Ontology and Detectors Using Adjective Noun customer reviews. AAAI. Vol. 4. No. 4. 2004.
Pairs. Proceedings of ACM Int. Conference on Multime- [45] Liu, Yang, et al. Modeling and predicting the helpfulness
dia. pp. 223232. of online reviews. Data mining, 2008. ICDM'08. Eighth
IEEE international conference on. IEEE, 2008.
[29] Case Study: Advanced Sentiment Analysis. Retrieved
18 October 2013. [46] Bermingham, Adam, and Alan F. Smeaton. Classifying
sentiment in microblogs: is brevity an advantage?. Pro-
[30] Mozeti, Igor; Grar, Miha; Smailovi, Jasmina (2016-
ceedings of the 19th ACM international conference on In-
05-05). Multilingual Twitter Sentiment Classication:
formation and knowledge management. ACM, 2010.
The Role of Human Annotators. PLOS ONE. 11 (5):
e0155036. doi:10.1371/journal.pone.0155036. ISSN
1932-6203. PMC 4858191 . PMID 27149621.
[36] Wright, Alex. Mining the Web for Feelings, Not Facts,
New York Times, 2009-08-23. Retrieved on 2009-10-01.
8.2 Images
File:Lock-green.svg Source: https://upload.wikimedia.org/wikipedia/commons/6/65/Lock-green.svg License: CC0 Contributors: en:File:
Free-to-read_lock_75.svg Original artist: User:Trappist the monk