Sentiment classification for malay newspaper using clonal selection algorithm / Nur Fitri Nabila Mohamad Nasir
Sentiment classification is technique to analyze the subjective information in the text then mine the opinion. Mostly people are using blog or twitter to collect the sentiment data but not frequently used newspaper because not so many researchers are using newspaper to classify sentiment data as the...
Saved in:
Main Author: | |
---|---|
Format: | Thesis |
Language: | English |
Published: |
2013
|
Subjects: | |
Online Access: | https://ir.uitm.edu.my/id/eprint/35333/1/35333.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
id |
my-uitm-ir.35333 |
---|---|
record_format |
uketd_dc |
spelling |
my-uitm-ir.353332020-10-20T06:45:47Z Sentiment classification for malay newspaper using clonal selection algorithm / Nur Fitri Nabila Mohamad Nasir 2013-01 Mohamad Nasir, Nur Fitri Nabila Analysis Electronic Computers. Computer Science Algorithms Sentiment classification is technique to analyze the subjective information in the text then mine the opinion. Mostly people are using blog or twitter to collect the sentiment data but not frequently used newspaper because not so many researchers are using newspaper to classify sentiment data as the main source. In this study, sentiment classifier using clonal algorithm selection was developed to categorize sentiment in Malay newspaper (Berita Harian). Another objective was to evaluate the proposed model effectiveness in classifying Malay newspaper’s data. In our method, the training of clonal selection algorithm (CSA) is first used to teach algorithm which is intelligent to categorize the sentiment in newspaper’s sentences into the polarity (positive, negative and neutraljfrom the data are collected and the testing was implemented after did the training to test whether a word should be taught correctly or not. Firstly, the data was dividing by ratio 80:20 from 1000 sentences. Therefore, 80% from 1000 sentences will use for training and 20% from 1000 sentences use for testing. Secondly, the data was dividing by ratio 70:30 which are 700 newspaper’s sentences as the training data and 300 newspaper’s sentences as the testing data. The experimental results show that our method can achieve better performance in clonal selection algorithm sentiment classification and the data collected cannot be used at once in this model because training data is very time-consuming if using all the data. The experiment achieves the best accuracy at 89.0%for ratio 70:30.This model was built with capability to help user in classifying newspaper sentence in easy way. 2013-01 Thesis https://ir.uitm.edu.my/id/eprint/35333/ https://ir.uitm.edu.my/id/eprint/35333/1/35333.pdf text en public degree Universiti Teknologi MARA Terengganu Faculty of Computer & Mathematical Sciences Isa, Norulhidayah Jantan, Hamidah |
institution |
Universiti Teknologi MARA |
collection |
UiTM Institutional Repository |
language |
English |
advisor |
Isa, Norulhidayah Jantan, Hamidah |
topic |
Analysis Analysis Algorithms |
spellingShingle |
Analysis Analysis Algorithms Mohamad Nasir, Nur Fitri Nabila Sentiment classification for malay newspaper using clonal selection algorithm / Nur Fitri Nabila Mohamad Nasir |
description |
Sentiment classification is technique to analyze the subjective information in the text then mine the opinion. Mostly people are using blog or twitter to collect the sentiment data but not frequently used newspaper because not so many researchers are using newspaper to classify sentiment data as the main source. In this study, sentiment classifier using clonal algorithm selection was developed to categorize sentiment in Malay newspaper (Berita Harian). Another objective was to evaluate the proposed model effectiveness in classifying Malay newspaper’s data. In our method, the training of clonal selection algorithm (CSA) is first used to teach algorithm which is intelligent to categorize the sentiment in newspaper’s sentences into the polarity (positive, negative and neutraljfrom the data are collected and the testing was implemented after did the training to test whether a word should be taught correctly or not. Firstly, the data was dividing by ratio 80:20 from 1000 sentences. Therefore, 80% from 1000 sentences will use for training and 20% from 1000 sentences use for testing. Secondly, the data was dividing by ratio 70:30 which are 700 newspaper’s sentences as the training data and 300 newspaper’s sentences as the testing data. The experimental results show that our method can achieve better performance in clonal selection algorithm sentiment classification and the data collected cannot be used at once in this model because training data is very time-consuming if using all the data. The experiment achieves the best accuracy at 89.0%for ratio 70:30.This model was built with capability to help user in classifying newspaper sentence in easy way. |
format |
Thesis |
qualification_level |
Bachelor degree |
author |
Mohamad Nasir, Nur Fitri Nabila |
author_facet |
Mohamad Nasir, Nur Fitri Nabila |
author_sort |
Mohamad Nasir, Nur Fitri Nabila |
title |
Sentiment classification for malay newspaper using clonal selection algorithm / Nur Fitri Nabila Mohamad Nasir |
title_short |
Sentiment classification for malay newspaper using clonal selection algorithm / Nur Fitri Nabila Mohamad Nasir |
title_full |
Sentiment classification for malay newspaper using clonal selection algorithm / Nur Fitri Nabila Mohamad Nasir |
title_fullStr |
Sentiment classification for malay newspaper using clonal selection algorithm / Nur Fitri Nabila Mohamad Nasir |
title_full_unstemmed |
Sentiment classification for malay newspaper using clonal selection algorithm / Nur Fitri Nabila Mohamad Nasir |
title_sort |
sentiment classification for malay newspaper using clonal selection algorithm / nur fitri nabila mohamad nasir |
granting_institution |
Universiti Teknologi MARA Terengganu |
granting_department |
Faculty of Computer & Mathematical Sciences |
publishDate |
2013 |
url |
https://ir.uitm.edu.my/id/eprint/35333/1/35333.pdf |
_version_ |
1783734299407679488 |