Prosodic contours for classification of perceived boundaries on Malay phrasing / Haslizatul Fairuz Mohamed Hanum

Speech is structured in cohesive sequences of information units defined with unique speaker-related intonation patterns, known as prosody. To capture the communicative intention of a speaker, listeners first focus on the change of rising and/or falling prosody patterns (known as speech events) on wo...

Full description

Saved in:
Bibliographic Details
Main Author: Mohamed Hanum, Haslizatul Fairuz
Format: Thesis
Language:English
Published: 2021
Subjects:
Online Access:https://ir.uitm.edu.my/id/eprint/54987/1/54987.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
id my-uitm-ir.54987
record_format uketd_dc
spelling my-uitm-ir.549872022-02-16T04:57:52Z Prosodic contours for classification of perceived boundaries on Malay phrasing / Haslizatul Fairuz Mohamed Hanum 2021-10 Mohamed Hanum, Haslizatul Fairuz Language. Linguistic theory. Comparative grammar Phonology. Phonetics Speech is structured in cohesive sequences of information units defined with unique speaker-related intonation patterns, known as prosody. To capture the communicative intention of a speaker, listeners first focus on the change of rising and/or falling prosody patterns (known as speech events) on words and phrasing to capture the degree of information conveyed. Secondly, into national breaks are used by speakers to convey speech content in a sequence of smaller sub-segments known as phrasing. Incorrect perception of the events and boundaries causes the communicative intention to be falsely understood by listeners. Thus, previous works evaluate the role of boundaries in languages like English, Swedish, Japanese, and Chinese to help automatic speech recognition and understanding domain. The main issue addressed by previous work on boundary classification is the disagreement on prosody event labeling. Boundaries are falsely classified due to limitations on expert evaluation of prosody events and lack of a prosody standard. Also, evaluating how the pause and intonation variations discriminate between a phrase with a continual content from other subsequent phrases that carry the end (final) content is still an open question. Limited work on the role of prosody is often defined based on specific language lexical rules, which degrades understanding of phrasing and speech content for the under-resourced language. As related research are limited to prosody stress indicating prominence on individual Malay words; thus, processes for defining the prosody governing Malay phrase boundaries are essential. This research aims to evaluate the correlates of prosody on boundaries with a deep (finality) or a shallow (continual) content. Dataset for the boundary classification task contains formal statements recorded from the debate session from the Malaysian Parliamentary Speech recordings. In the first phase, a refined set of Rapid Phrase Prosody Tasks (RPIT) instructions labelled the speech signal with perceived continual or final content. Responses from four male and four female volunteers are analysed using KAPPA and Krippendorff analysis to construct a Malay phrase (MySP) boundaries dataset with an average of 85% agreement on perceived boundary labels from the first research phase. Pitch regression and the rise-fall-connection (RFC) contour parameters are used to model the role of prosody on the perceived boundaries. A new phrase strength correlates computed as slope feature from the highest nuclei point towards the end of the boundary word region. The role of each vector combined with the word and silence durations to signify each deep (final) and shallow (continual) boundary is tested with supervised classifiers. The supervised K-Nearest Neighbour (KNN), Random Forest (RF), and Logistic Regression (Log-Reg) models predicted the boundary classes with up to 75% accuracy. A higher degree of slope excursion is observed on boundaries that the listeners perceived as the deep boundaries (evaluated with finality in speech content through the RPIT) and used to improve classification results on up to 20% of the falsely classified boundaries. This study contributes to the Malay prosody knowledge and classification of phrasing the boundaries 2021-10 Thesis https://ir.uitm.edu.my/id/eprint/54987/ https://ir.uitm.edu.my/id/eprint/54987/1/54987.pdf text en public phd doctoral Universiti Teknologi MARA Faculty of Computer and Mathematical Sciences Abdullah, Nur Atiqah Sia (Assoc. Prof. Ts. Dr. )
institution Universiti Teknologi MARA
collection UiTM Institutional Repository
language English
advisor Abdullah, Nur Atiqah Sia (Assoc. Prof. Ts. Dr. )
topic Language
Linguistic theory
Comparative grammar
Language
Linguistic theory
Comparative grammar
spellingShingle Language
Linguistic theory
Comparative grammar
Language
Linguistic theory
Comparative grammar
Mohamed Hanum, Haslizatul Fairuz
Prosodic contours for classification of perceived boundaries on Malay phrasing / Haslizatul Fairuz Mohamed Hanum
description Speech is structured in cohesive sequences of information units defined with unique speaker-related intonation patterns, known as prosody. To capture the communicative intention of a speaker, listeners first focus on the change of rising and/or falling prosody patterns (known as speech events) on words and phrasing to capture the degree of information conveyed. Secondly, into national breaks are used by speakers to convey speech content in a sequence of smaller sub-segments known as phrasing. Incorrect perception of the events and boundaries causes the communicative intention to be falsely understood by listeners. Thus, previous works evaluate the role of boundaries in languages like English, Swedish, Japanese, and Chinese to help automatic speech recognition and understanding domain. The main issue addressed by previous work on boundary classification is the disagreement on prosody event labeling. Boundaries are falsely classified due to limitations on expert evaluation of prosody events and lack of a prosody standard. Also, evaluating how the pause and intonation variations discriminate between a phrase with a continual content from other subsequent phrases that carry the end (final) content is still an open question. Limited work on the role of prosody is often defined based on specific language lexical rules, which degrades understanding of phrasing and speech content for the under-resourced language. As related research are limited to prosody stress indicating prominence on individual Malay words; thus, processes for defining the prosody governing Malay phrase boundaries are essential. This research aims to evaluate the correlates of prosody on boundaries with a deep (finality) or a shallow (continual) content. Dataset for the boundary classification task contains formal statements recorded from the debate session from the Malaysian Parliamentary Speech recordings. In the first phase, a refined set of Rapid Phrase Prosody Tasks (RPIT) instructions labelled the speech signal with perceived continual or final content. Responses from four male and four female volunteers are analysed using KAPPA and Krippendorff analysis to construct a Malay phrase (MySP) boundaries dataset with an average of 85% agreement on perceived boundary labels from the first research phase. Pitch regression and the rise-fall-connection (RFC) contour parameters are used to model the role of prosody on the perceived boundaries. A new phrase strength correlates computed as slope feature from the highest nuclei point towards the end of the boundary word region. The role of each vector combined with the word and silence durations to signify each deep (final) and shallow (continual) boundary is tested with supervised classifiers. The supervised K-Nearest Neighbour (KNN), Random Forest (RF), and Logistic Regression (Log-Reg) models predicted the boundary classes with up to 75% accuracy. A higher degree of slope excursion is observed on boundaries that the listeners perceived as the deep boundaries (evaluated with finality in speech content through the RPIT) and used to improve classification results on up to 20% of the falsely classified boundaries. This study contributes to the Malay prosody knowledge and classification of phrasing the boundaries
format Thesis
qualification_name Doctor of Philosophy (PhD.)
qualification_level Doctorate
author Mohamed Hanum, Haslizatul Fairuz
author_facet Mohamed Hanum, Haslizatul Fairuz
author_sort Mohamed Hanum, Haslizatul Fairuz
title Prosodic contours for classification of perceived boundaries on Malay phrasing / Haslizatul Fairuz Mohamed Hanum
title_short Prosodic contours for classification of perceived boundaries on Malay phrasing / Haslizatul Fairuz Mohamed Hanum
title_full Prosodic contours for classification of perceived boundaries on Malay phrasing / Haslizatul Fairuz Mohamed Hanum
title_fullStr Prosodic contours for classification of perceived boundaries on Malay phrasing / Haslizatul Fairuz Mohamed Hanum
title_full_unstemmed Prosodic contours for classification of perceived boundaries on Malay phrasing / Haslizatul Fairuz Mohamed Hanum
title_sort prosodic contours for classification of perceived boundaries on malay phrasing / haslizatul fairuz mohamed hanum
granting_institution Universiti Teknologi MARA
granting_department Faculty of Computer and Mathematical Sciences
publishDate 2021
url https://ir.uitm.edu.my/id/eprint/54987/1/54987.pdf
_version_ 1783734900594049024