Code clone detection using string based tree matching technique
Code cloning have been an issue in these few years as the number of available web application and stand alone software increase nowadays. The major consequences of cloning is that it would risk the maintenance process as there are many duplicated codes in the systems that practically increase the co...
Saved in:
Main Author: | |
---|---|
Format: | Thesis |
Language: | English |
Published: |
2008
|
Subjects: | |
Online Access: | http://eprints.utm.my/id/eprint/9508/1/NorfaradillaWahidMFSKSM2008.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
id |
my-utm-ep.9508 |
---|---|
record_format |
uketd_dc |
spelling |
my-utm-ep.95082018-07-19T01:51:05Z Code clone detection using string based tree matching technique 2008-10 Wahid, Norfaradilla QA75 Electronic computers. Computer science Code cloning have been an issue in these few years as the number of available web application and stand alone software increase nowadays. The major consequences of cloning is that it would risk the maintenance process as there are many duplicated codes in the systems that practically increase the complexity of the system. There are many code clone detection techniques that can be found nowadays which generally can be group into string based, token based, tree based and semantic based. The aim of this project is to find out the possibility of using a technique of ontology mapping technique to solve the problem, but we are not using the real ontology for the clone detection. It has been prove that there is the possibility as it manages to detect clone code. In this thesis the clone detection is using two layers of detection; i.e. structural similarity and string based similarity. The structural similarity is by using subgraph miner where it capable to get the similar subtree between different files. And then we extract all elements of that particular subtree and treat the elements as a string. Two strings from different files then applied with similarity metric to know whether it is a clone pair. From the experimental result, we found that the system is language independent but the result is good in precision but not so good recall. It is also capable to detect two main types of clone, i.e identical clones and similar clones. 2008-10 Thesis http://eprints.utm.my/id/eprint/9508/ http://eprints.utm.my/id/eprint/9508/1/NorfaradillaWahidMFSKSM2008.pdf application/pdf en public masters Universiti Teknologi Malaysia, Faculty of Computer Science and Information System Faculty of Computer Science and Information System |
institution |
Universiti Teknologi Malaysia |
collection |
UTM Institutional Repository |
language |
English |
topic |
QA75 Electronic computers Computer science |
spellingShingle |
QA75 Electronic computers Computer science Wahid, Norfaradilla Code clone detection using string based tree matching technique |
description |
Code cloning have been an issue in these few years as the number of available web application and stand alone software increase nowadays. The major consequences of cloning is that it would risk the maintenance process as there are many duplicated codes in the systems that practically increase the complexity of the system. There are many code clone detection techniques that can be found nowadays which generally can be group into string based, token based, tree based and semantic based. The aim of this project is to find out the possibility of using a technique of ontology mapping technique to solve the problem, but we are not using the real ontology for the clone detection. It has been prove that there is the possibility as it manages to detect clone code. In this thesis the clone detection is using two layers of detection; i.e. structural similarity and string based similarity. The structural similarity is by using subgraph miner where it capable to get the similar subtree between different files. And then we extract all elements of that particular subtree and treat the elements as a string. Two strings from different files then applied with similarity metric to know whether it is a clone pair. From the experimental result, we found that the system is language independent but the result is good in precision but not so good recall. It is also capable to detect two main types of clone, i.e identical clones and similar clones. |
format |
Thesis |
qualification_level |
Master's degree |
author |
Wahid, Norfaradilla |
author_facet |
Wahid, Norfaradilla |
author_sort |
Wahid, Norfaradilla |
title |
Code clone detection using string based tree matching technique |
title_short |
Code clone detection using string based tree matching technique |
title_full |
Code clone detection using string based tree matching technique |
title_fullStr |
Code clone detection using string based tree matching technique |
title_full_unstemmed |
Code clone detection using string based tree matching technique |
title_sort |
code clone detection using string based tree matching technique |
granting_institution |
Universiti Teknologi Malaysia, Faculty of Computer Science and Information System |
granting_department |
Faculty of Computer Science and Information System |
publishDate |
2008 |
url |
http://eprints.utm.my/id/eprint/9508/1/NorfaradillaWahidMFSKSM2008.pdf |
_version_ |
1747814742835068928 |