Mostra i principali dati dell'item

dc.contributor.authorBasarslan, Muhammet Sinan
dc.contributor.authorKayaalp, Fatih
dc.date.accessioned2021-05-21T10:59:17Z
dc.date.available2021-05-21T10:59:17Z
dc.date.issued2020-09-17
dc.identifier.citationADCAIJ: Advances in Distributed Computing and Artificial Intelligence Journal, 9 (2020)
dc.identifier.issn2255-2863
dc.identifier.urihttp://hdl.handle.net/10366/146100
dc.description.abstractSocial media has become an important part of our everyday life due to the widespread use of the Internet. Of the social media services, Twitter is among the most used ones around the world. People share their opinions by writing tweets about numerous subjects, such as politics, sports, economy, etc. Millions of tweets per day create a huge dataset, which drew attention of the data scientists to focus on these data for sentiment analysis. The sentiment analysis focuses to identify the social media posts of users about a specific topic and categorize them as positive, negative or neutral. Thus, the study aims to investigate the effect of types of text representation on the performance of sentiment analysis. In this study, two datasets were used in the experiments. The first one is the user reviews about movies from the IMDB, which has been labeled by Kotzias, and the second one is the Twitter tweets, including the tweets of users about health topic in English in 2019, collected using the Twitter API. The Python programming language was used in the study both for implementing the classification models using the Naïve Bayes (NB), Support Vector Machines (SVM) and Artificial Neural Networks (ANN) algorithms, and for categorizing the sentiments as positive, negative and neutral. The feature extraction from the dataset was performed using Term Frequency-Inverse Document Frequency (TF-IDF) and Word2Vec (W2V) modeling techniques. The success percentages of the classification algorithms were compared at the end. According to the experimental results, Artificial Neural Network had the best accuracy performance in both datasets compared to the others.
dc.format.mimetypeapplication/pdf
dc.language.isoeng
dc.publisherEdiciones Universidad de Salamanca (España)
dc.rightsinfo:eu-repo/semantics/openAccess
dc.subjectSentiment analysis
dc.subjectsocial media
dc.subjectpython
dc.subjectnatural language
dc.subjectNatural Language Processing
dc.titleSentiment Analysis with Machine Learning Methods on Social Media
dc.typeinfo:eu-repo/semantics/article
dc.rights.accessRightsinfo:eu-repo/semantics/openAccess


Files in questo item

Thumbnail

Questo item appare nelle seguenti collezioni

Mostra i principali dati dell'item