Maintaining Replica Consistency Over Large-Scale Data Grid Using Update Propagation Technique

A Data Grid is an organized collection of nodes in a wide area network which contributes to various computation, storage data, and application. In Data Grid high numbers of users are distributed in a wide area environment which is dynamic and heterogeneous. Data management is one of the current issu...

Full description

Saved in:
Bibliographic Details
Main Author: A. Radi, Mohammed A.
Format: Thesis
Language:English
English
Published: 2009
Subjects:
Online Access:http://psasir.upm.edu.my/id/eprint/7150/1/FSKTM_2009_7a.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
id my-upm-ir.7150
record_format uketd_dc
spelling my-upm-ir.71502013-05-27T07:33:43Z Maintaining Replica Consistency Over Large-Scale Data Grid Using Update Propagation Technique 2009-01 A. Radi, Mohammed A. A Data Grid is an organized collection of nodes in a wide area network which contributes to various computation, storage data, and application. In Data Grid high numbers of users are distributed in a wide area environment which is dynamic and heterogeneous. Data management is one of the current issues where data transparency, consistency, fault-tolerance, automatic management and the performance are the user parameters in grid environment. Data management techniques must scale up while addressing autonomy, dynamicity and heterogeneity of the data resource. Data replication is a well known technique used to reduce accesses latency, improve availability and performance in a distributed computing environment. Replication introduces the problem of maintaining consistency among the replicas when files are allowed to be updated. The update information should be propagated to all replicas to guarantee correct read of the remote replicas. An asynchronous replication is a commonly agreed solution for the problem in consistency of replicas. A few studies have been done to maintain replica consistency in Data Grid. However, the introduced techniques are neither efficient nor scalable. They cannot be used in real Data Grid since the issues of large number of replica sites, large scale distribution, load balancing and site autonomy where the capability of grid site to join and leave the grid community at any time have not been addressed. This thesis proposes a new asynchronous replication protocol called Update Propagation Grid (UPG) to maintain replica consistency over a large scale data grid. In UPG the updates reach all on-line secondary replicas using a propagation technique based on nodes organized into a logical structure network in the form of two-dimensional grid structure. The proposed update propagation technique is a hybrid push-pull and dynamic technique that addresses the issues of site autonomy, efficiency, scalability, load balancing and fairness. A two performance analysis studies have been conducted to study the performance of the proposed technique in comparison with other techniques. First study involves mathematical and simulation analysis. Second study is based on Queuing Network Model. The result of the performance analysis shows that the proposed technique scales well with high number of replica sites and with high request loads. The result also shows the reduction on the average update reach time by 5% to 97%. Moreover the result shows that the proposed technique is capable of reaching load balancing while providing update propagation fairness Database management - Case studies 2009-01 Thesis http://psasir.upm.edu.my/id/eprint/7150/ http://psasir.upm.edu.my/id/eprint/7150/1/FSKTM_2009_7a.pdf application/pdf en public phd doctoral Universiti Putra Malaysia Database management - Case studies Computer Science and Information Technology English
institution Universiti Putra Malaysia
collection PSAS Institutional Repository
language English
English
topic Database management - Case studies


spellingShingle Database management - Case studies


A. Radi, Mohammed A.
Maintaining Replica Consistency Over Large-Scale Data Grid Using Update Propagation Technique
description A Data Grid is an organized collection of nodes in a wide area network which contributes to various computation, storage data, and application. In Data Grid high numbers of users are distributed in a wide area environment which is dynamic and heterogeneous. Data management is one of the current issues where data transparency, consistency, fault-tolerance, automatic management and the performance are the user parameters in grid environment. Data management techniques must scale up while addressing autonomy, dynamicity and heterogeneity of the data resource. Data replication is a well known technique used to reduce accesses latency, improve availability and performance in a distributed computing environment. Replication introduces the problem of maintaining consistency among the replicas when files are allowed to be updated. The update information should be propagated to all replicas to guarantee correct read of the remote replicas. An asynchronous replication is a commonly agreed solution for the problem in consistency of replicas. A few studies have been done to maintain replica consistency in Data Grid. However, the introduced techniques are neither efficient nor scalable. They cannot be used in real Data Grid since the issues of large number of replica sites, large scale distribution, load balancing and site autonomy where the capability of grid site to join and leave the grid community at any time have not been addressed. This thesis proposes a new asynchronous replication protocol called Update Propagation Grid (UPG) to maintain replica consistency over a large scale data grid. In UPG the updates reach all on-line secondary replicas using a propagation technique based on nodes organized into a logical structure network in the form of two-dimensional grid structure. The proposed update propagation technique is a hybrid push-pull and dynamic technique that addresses the issues of site autonomy, efficiency, scalability, load balancing and fairness. A two performance analysis studies have been conducted to study the performance of the proposed technique in comparison with other techniques. First study involves mathematical and simulation analysis. Second study is based on Queuing Network Model. The result of the performance analysis shows that the proposed technique scales well with high number of replica sites and with high request loads. The result also shows the reduction on the average update reach time by 5% to 97%. Moreover the result shows that the proposed technique is capable of reaching load balancing while providing update propagation fairness
format Thesis
qualification_name Doctor of Philosophy (PhD.)
qualification_level Doctorate
author A. Radi, Mohammed A.
author_facet A. Radi, Mohammed A.
author_sort A. Radi, Mohammed A.
title Maintaining Replica Consistency Over Large-Scale Data Grid Using Update Propagation Technique
title_short Maintaining Replica Consistency Over Large-Scale Data Grid Using Update Propagation Technique
title_full Maintaining Replica Consistency Over Large-Scale Data Grid Using Update Propagation Technique
title_fullStr Maintaining Replica Consistency Over Large-Scale Data Grid Using Update Propagation Technique
title_full_unstemmed Maintaining Replica Consistency Over Large-Scale Data Grid Using Update Propagation Technique
title_sort maintaining replica consistency over large-scale data grid using update propagation technique
granting_institution Universiti Putra Malaysia
granting_department Computer Science and Information Technology
publishDate 2009
url http://psasir.upm.edu.my/id/eprint/7150/1/FSKTM_2009_7a.pdf
_version_ 1747810663637450752