Temporal integration based factorization to improve prediction accuracy of collaborative filtering
A recommender system provides users with personalized suggestions for items based on the user’s behaviour history. These systems often use the collaborative filtering (CF) for analysing the users’ preferences for items in the rating matrix. The rating matrix typically contains a high percentage of...
Saved in:
Main Author: | |
---|---|
Format: | Thesis |
Language: | English |
Published: |
2016
|
Subjects: | |
Online Access: | http://psasir.upm.edu.my/id/eprint/69372/1/FSKTM%202016%2040%20IR.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
id |
my-upm-ir.69372 |
---|---|
record_format |
uketd_dc |
spelling |
my-upm-ir.693722019-07-10T01:08:26Z Temporal integration based factorization to improve prediction accuracy of collaborative filtering 2016-11 Al-Qasem, Al-Hadi Ismail Ahmed A recommender system provides users with personalized suggestions for items based on the user’s behaviour history. These systems often use the collaborative filtering (CF) for analysing the users’ preferences for items in the rating matrix. The rating matrix typically contains a high percentage of unknown rating scores which is called the data sparsity problem. The data sparsity problem has been solved by several approaches such as Bayesian probabilistic, machine learning, genetic algorithm, particle swarm optimization and matrix factorization. The matrix factorization approach through temporal approaches has the accurate performance in addressing the data sparsity problem but still with low accuracy. The existing temporal-based factorization approaches used the long-term preferences and the short-term preferences. The difference between long-term preferences is that it utilizes the whole recorded preferences while the short-term preferences utilizes the recorded preferences within a session (e.g. week, month, season, etc.). However, there are four issues when a factorization approach is adopted which are latent feedback learning, score overfitting, user’s interest drifting and item’s popularity decay over time. This study proposes three approaches which are (i) the Ensemble Divide and Conquer (EDC) which achieved accurate latent feedback learning, (ii) two personalized matrix factorization (MF) based temporal approaches, namely the LongTemporalMF and ShortTemporalMF to solve overfitting during the optimization process, user’s interest drifting and item’s popularity decays over time and (iii) TemporalMF++ approach which solved all the issues. The TemporalMF++ approach relies on the k-means algorithm and the bacterial foraging optimization algorithm. The Root Mean Squared Error metric is used to evaluate the prediction accuracy. The factorization approaches such as the Singular Value Decomposition, Baseline, Matrix Factorization and Neighbours based Baseline are used to be compared against the proposed approaches. In addition, the Temporal Dynamics, Short-Term based Latent, Short-Term based Baseline, Long-Term, and Temporal Interaction approaches are used to benchmark the proposed approaches. The MovieLens, Epinions, and Netflix Prize are real-world datasets which are used in the experimental settings. The experimental results show the TemporalMF++ approach is higher prediction accuracy compared to the approaches of EDC, LongTemporalMF, and ShortTemporalMF. In addition, the TemporalMF++ approach has a prediction accuracy higher than the benchmark approaches of factorization and temporal. In summary, the TemporalMF++ approach has a superior effectiveness in improving the accuracy prediction of the CF by learning the temporal behaviour. Recommender systems (Information filtering) 2016-11 Thesis http://psasir.upm.edu.my/id/eprint/69372/ http://psasir.upm.edu.my/id/eprint/69372/1/FSKTM%202016%2040%20IR.pdf text en public doctoral Universiti Putra Malaysia Recommender systems (Information filtering) |
institution |
Universiti Putra Malaysia |
collection |
PSAS Institutional Repository |
language |
English |
topic |
Recommender systems (Information filtering) |
spellingShingle |
Recommender systems (Information filtering) Al-Qasem, Al-Hadi Ismail Ahmed Temporal integration based factorization to improve prediction accuracy of collaborative filtering |
description |
A recommender system provides users with personalized suggestions for items based on
the user’s behaviour history. These systems often use the collaborative filtering (CF) for analysing the users’ preferences for items in the rating matrix. The rating matrix typically contains a high percentage of unknown rating scores which is called the data sparsity problem. The data sparsity problem has been solved by several approaches such as Bayesian probabilistic, machine learning, genetic algorithm, particle swarm optimization and matrix factorization. The matrix factorization approach through temporal approaches has the accurate performance in addressing the data sparsity problem but still with low accuracy. The existing temporal-based factorization approaches used the long-term preferences and the short-term preferences. The difference between long-term preferences is that it utilizes the whole recorded preferences while the short-term preferences utilizes the recorded preferences within a session (e.g. week, month, season, etc.). However, there are four issues when a factorization approach is adopted which are latent feedback learning, score overfitting, user’s interest drifting and item’s popularity decay over time. This study proposes three approaches which are (i) the Ensemble Divide and Conquer (EDC) which achieved accurate latent feedback learning, (ii) two personalized matrix
factorization (MF) based temporal approaches, namely the LongTemporalMF and
ShortTemporalMF to solve overfitting during the optimization process, user’s interest drifting and item’s popularity decays over time and (iii) TemporalMF++ approach which solved all the issues. The TemporalMF++ approach relies on the k-means algorithm and the bacterial foraging optimization algorithm. The Root Mean Squared Error metric is used to evaluate the prediction accuracy. The factorization approaches such as the Singular Value Decomposition, Baseline, Matrix Factorization and Neighbours based Baseline are used to be compared against the proposed approaches. In addition, the Temporal Dynamics, Short-Term based Latent, Short-Term based Baseline, Long-Term, and Temporal Interaction approaches are used
to benchmark the proposed approaches.
The MovieLens, Epinions, and Netflix Prize are real-world datasets which are used in
the experimental settings. The experimental results show the TemporalMF++ approach
is higher prediction accuracy compared to the approaches of EDC, LongTemporalMF, and ShortTemporalMF. In addition, the TemporalMF++ approach has a prediction
accuracy higher than the benchmark approaches of factorization and temporal. In
summary, the TemporalMF++ approach has a superior effectiveness in improving the accuracy prediction of the CF by learning the temporal behaviour. |
format |
Thesis |
qualification_level |
Doctorate |
author |
Al-Qasem, Al-Hadi Ismail Ahmed |
author_facet |
Al-Qasem, Al-Hadi Ismail Ahmed |
author_sort |
Al-Qasem, Al-Hadi Ismail Ahmed |
title |
Temporal integration based factorization to improve prediction accuracy of collaborative filtering |
title_short |
Temporal integration based factorization to improve prediction accuracy of collaborative filtering |
title_full |
Temporal integration based factorization to improve prediction accuracy of collaborative filtering |
title_fullStr |
Temporal integration based factorization to improve prediction accuracy of collaborative filtering |
title_full_unstemmed |
Temporal integration based factorization to improve prediction accuracy of collaborative filtering |
title_sort |
temporal integration based factorization to improve prediction accuracy of collaborative filtering |
granting_institution |
Universiti Putra Malaysia |
publishDate |
2016 |
url |
http://psasir.upm.edu.my/id/eprint/69372/1/FSKTM%202016%2040%20IR.pdf |
_version_ |
1747812689902567424 |