A new approach in infrequent itemset mining based on Eclat algorithm

Big data exponentially increases in the bulk of heterogeneous data. It requires an advanced technology to process, analyse and automatically transform the processed data into useful knowledge. Data mining has excellent potential in discovering the hidden knowledge within the databases. This hidden k...

Full description

Saved in:
Bibliographic Details
Main Author: Julaily Aida Jusoh (Author)
Format: Thesis Book
Language:English
Subjects:
Tags: Add Tag
No Tags, Be the first to tag this record!
LEADER 03341cam a2200301 7i4500
001 0000098081
005 20210413090000.0
008 200310s2020 my eng
040 |a UniSZA 
050 0 0 |a QA76.9 
090 0 0 |a QA76.9   |b .J85 2020 
100 0 |a Julaily Aida Jusoh   |e author  
245 0 2 |a A new approach in infrequent itemset mining based on Eclat algorithm   |c Julaily Aida binti Jusoh. 
264 0 |c 2020. 
300 |a xxvi,190 leaves:   |b colour illustrations;   |c 30cm. 
336 |a text  |2 rdacontent 
337 |a unmediated  |2 rdamedia 
338 |a volume  |2 rdacarrier 
502 |a Thesis(Doctor of Philosophy) - Universiti Malaysia Terengganu,2020 
504 |a Includes bibliographical references (leaves 165-175) 
505 0 |a 1. Introduction -- 2. Literature review -- 3.Research methodology -- 4. R-eclat algorithm -- 5. Parallel r-eclat algorithm -- 6. Conclusion and future work 
520 |a Big data exponentially increases in the bulk of heterogeneous data. It requires an advanced technology to process, analyse and automatically transform the processed data into useful knowledge. Data mining has excellent potential in discovering the hidden knowledge within the databases. This hidden knowledge can lead to the association rule which may disclose useful pattern. Two significant patterns can be found in frequent and infrequent. Most of the previous infrequent mining techniques deal with the horizontal data format. evertheless, the current and emerging trend finds researchers dealing with a vertical data format. One example of a vertical rule mining algorithm is Equivalence Class Transformation (Eclat). The Eclat algorithm comprises four variants: tidset, diffset, sortdiffset and postdiffset which only employed for frequent itemset mining. This research will introduce a new version of the Eclat algorithm to fulfil the purpose of obtaining the infrequent itemset. In the early phase, a bit modification has been done on the Eclat algorithm and its variants which named R-Eclat where R refers to rare. As the previous Eclat algorithm, the R-Eclat also comprises four variants: r-tidset, r-diffset, r-sortdiffset and r-postdiffset. In the second phase, R-Eclat and its variants are executed by serial processing, but the duration of mining processing is time-consuming. In response to the promising results of mining in speedy processing time and less memory usage, R-Eclat is complemented with a parallel programming approach. At the third phase, a new parallel R-Eclat named as PR-Eclat is proposed to overcome the limitations of serial processing in speeding the running time. In PR-Eclat algorithms, the experimental results indicate that PR-Eclat outperforms the R-Eclat in average by 54% during execution time and the memory usage reduces at an average of 60% in infrequent itemset mining. This research confronts the issue of large database with single format only. Future work is recommended to highlight multiple databases and multiple data formats such as images, audio and video.  
610 2 0 |a Universiti Malaysia Terengganu --   |x Dissertations  
650 0 |a Algorithm Analysis and Problem Complexity  
650 0 |a Computer Graphics  
650 0 |a Data Structures  
710 2 |a Universiti Malaysia Terengganu  
999 |a 1000179010  |b Thesis  |c Reference  |e Tembila Bibliographic & Index Unit