Comparison of different automatic text summarization systems using standard performance evaluations
There are many automatic summarization systems can be used to produce a summary from a single text documents. From the different automatic summarization system, it can be found that the system will produce a different content of summary results although the percentage of sentences out of whole singl...
Saved in:
Main Author: | |
---|---|
Format: | Thesis |
Language: | English |
Published: |
2009
|
Subjects: | |
Online Access: | http://eprints.utm.my/id/eprint/18202/1/NurhafizahAbdMunirMFSKSM2009.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
id |
my-utm-ep.18202 |
---|---|
record_format |
uketd_dc |
spelling |
my-utm-ep.182022018-06-25T09:00:08Z Comparison of different automatic text summarization systems using standard performance evaluations 2009-04 Abd Munir, Nur Hafizah QA75 Electronic computers. Computer science There are many automatic summarization systems can be used to produce a summary from a single text documents. From the different automatic summarization system, it can be found that the system will produce a different content of summary results although the percentage of sentences out of whole single text document is setting to the same value. Therefore, in this study, three automatic summarization systems are used to produce the summary results; Microsoft Word Automatic Summarization, Shvoong Summarization and Simple Text Summarization in PHP. The performance of those results are investigated and measured using standard performance evaluation such recall, precision and f-measure. The dataset collection used in this study is collected from The New Straits Time and The Stars online and it is about Iskandar Region Development Authority (IRDA). Two automatic summarization system are already existed which is Microsoft Word Automatic Summarization and Shvoong Summarization and only one summarization system is coded in PHP language, there is Simple Text Summarization in PHP. Many operations have been applied in this coded system such as removing stop word, stemming, normalizing, creating weighted term-frequency and applying the technique. The results from those systems are stored into the database. In this study, about 50 articles are used. The comparison between different automatic summarization systems was made using standard performance evaluation. The performance evaluation is fully analyzed without depending on human evaluator. One program of analyzing the performance is coded in PERL language to produce a statistic of all summary results from those three automatic summarization systems. From the experimental results, it can be concluded that the Shvoong Summarization is the most effective automatic summarization system for single text document. 2009-04 Thesis http://eprints.utm.my/id/eprint/18202/ http://eprints.utm.my/id/eprint/18202/1/NurhafizahAbdMunirMFSKSM2009.pdf application/pdf en public masters Universiti Teknologi Malaysia, Faculty of Computer Science and Information System Faculty of Computer Science and Information System |
institution |
Universiti Teknologi Malaysia |
collection |
UTM Institutional Repository |
language |
English |
topic |
QA75 Electronic computers Computer science |
spellingShingle |
QA75 Electronic computers Computer science Abd Munir, Nur Hafizah Comparison of different automatic text summarization systems using standard performance evaluations |
description |
There are many automatic summarization systems can be used to produce a summary from a single text documents. From the different automatic summarization system, it can be found that the system will produce a different content of summary results although the percentage of sentences out of whole single text document is setting to the same value. Therefore, in this study, three automatic summarization systems are used to produce the summary results; Microsoft Word Automatic Summarization, Shvoong Summarization and Simple Text Summarization in PHP. The performance of those results are investigated and measured using standard performance evaluation such recall, precision and f-measure. The dataset collection used in this study is collected from The New Straits Time and The Stars online and it is about Iskandar Region Development Authority (IRDA). Two automatic summarization system are already existed which is Microsoft Word Automatic Summarization and Shvoong Summarization and only one summarization system is coded in PHP language, there is Simple Text Summarization in PHP. Many operations have been applied in this coded system such as removing stop word, stemming, normalizing, creating weighted term-frequency and applying the technique. The results from those systems are stored into the database. In this study, about 50 articles are used. The comparison between different automatic summarization systems was made using standard performance evaluation. The performance evaluation is fully analyzed without depending on human evaluator. One program of analyzing the performance is coded in PERL language to produce a statistic of all summary results from those three automatic summarization systems. From the experimental results, it can be concluded that the Shvoong Summarization is the most effective automatic summarization system for single text document. |
format |
Thesis |
qualification_level |
Master's degree |
author |
Abd Munir, Nur Hafizah |
author_facet |
Abd Munir, Nur Hafizah |
author_sort |
Abd Munir, Nur Hafizah |
title |
Comparison of different automatic text summarization systems using standard performance evaluations |
title_short |
Comparison of different automatic text summarization systems using standard performance evaluations |
title_full |
Comparison of different automatic text summarization systems using standard performance evaluations |
title_fullStr |
Comparison of different automatic text summarization systems using standard performance evaluations |
title_full_unstemmed |
Comparison of different automatic text summarization systems using standard performance evaluations |
title_sort |
comparison of different automatic text summarization systems using standard performance evaluations |
granting_institution |
Universiti Teknologi Malaysia, Faculty of Computer Science and Information System |
granting_department |
Faculty of Computer Science and Information System |
publishDate |
2009 |
url |
http://eprints.utm.my/id/eprint/18202/1/NurhafizahAbdMunirMFSKSM2009.pdf |
_version_ |
1747815217584144384 |