Teknik-teknik mengenalpasti sela masa senyap dalam sistem pengecaman suara

Classification of speech into voiced, unvoiced and silence (V/UV/S) regions is an important process in many speech processing applications such as speech synthesis, segmentation and speech recognition system. Two such measures are investigated with respect to their ability to discern voiced/unvoiced...

詳細記述

書誌詳細
第一著者: Abdul Rahman, Ahmad Idil
フォーマット: 学位論文
言語:英語
出版事項: 2005
主題:
オンライン・アクセス:http://eprints.utm.my/34995/1/AhmadIdilAbdulMFKE2005.pdf
_version_ 1846216315126153216
author Abdul Rahman, Ahmad Idil
author_facet Abdul Rahman, Ahmad Idil
author_sort Abdul Rahman, Ahmad Idil
description Classification of speech into voiced, unvoiced and silence (V/UV/S) regions is an important process in many speech processing applications such as speech synthesis, segmentation and speech recognition system. Two such measures are investigated with respect to their ability to discern voiced/unvoiced and silence segments of speech. They are the Instantaneous Energy (IE) and Local Time Correlation (LTC) method. Both IE and LTC methods are recently proposed technique for nonstationary signal analysis and have been successfully applied to speech processing. A comparative study was made using these two algorithms for classifying a given speech segment into two classes: voiced/unvoiced speech and silence. IE and LTC methods were proposed to remove all the silent intervals in speech sample. Experiment are carried out using Linear Predictive Coding (LPC) and Dynamic Time Warping (DTW) for isolated digit recognition in Bahasa Malaysia. The technique without silent removal LPC-DTW gives a recognition accuracy of 98.28%. With detection and removing of silent interval, both technique IE-LPCDTW and LTC-LPC-DTW gives a recognition accuracy of 98%. The system then are applied for training and testing for connected digit recognition. The segmentation of input string of the digits are carried out using IE and LTC techniques. Connected digit recognition using IE-LPC-DTW had 93.3% digit accuracy and 78% digit string. However using LTC-LPC-DTW the performance decreased to 93.2% and 77.7% respectively.
format Thesis
id uthm-34995
institution Universiti Teknologi Malaysia
language English
publishDate 2005
record_format eprints
spelling uthm-349952017-10-11T04:36:19Z http://eprints.utm.my/34995/ Teknik-teknik mengenalpasti sela masa senyap dalam sistem pengecaman suara Abdul Rahman, Ahmad Idil Unspecified Classification of speech into voiced, unvoiced and silence (V/UV/S) regions is an important process in many speech processing applications such as speech synthesis, segmentation and speech recognition system. Two such measures are investigated with respect to their ability to discern voiced/unvoiced and silence segments of speech. They are the Instantaneous Energy (IE) and Local Time Correlation (LTC) method. Both IE and LTC methods are recently proposed technique for nonstationary signal analysis and have been successfully applied to speech processing. A comparative study was made using these two algorithms for classifying a given speech segment into two classes: voiced/unvoiced speech and silence. IE and LTC methods were proposed to remove all the silent intervals in speech sample. Experiment are carried out using Linear Predictive Coding (LPC) and Dynamic Time Warping (DTW) for isolated digit recognition in Bahasa Malaysia. The technique without silent removal LPC-DTW gives a recognition accuracy of 98.28%. With detection and removing of silent interval, both technique IE-LPCDTW and LTC-LPC-DTW gives a recognition accuracy of 98%. The system then are applied for training and testing for connected digit recognition. The segmentation of input string of the digits are carried out using IE and LTC techniques. Connected digit recognition using IE-LPC-DTW had 93.3% digit accuracy and 78% digit string. However using LTC-LPC-DTW the performance decreased to 93.2% and 77.7% respectively. 2005 Thesis NonPeerReviewed application/pdf en http://eprints.utm.my/34995/1/AhmadIdilAbdulMFKE2005.pdf Abdul Rahman, Ahmad Idil (2005) Teknik-teknik mengenalpasti sela masa senyap dalam sistem pengecaman suara. Masters thesis, Universiti Teknologi Malaysia, Faculty of Electrical Engineering. http://dms.library.utm.my:8080/vital/access/manager/Repository/vital:61230?queryType=vitalDismax&query=Teknik-teknik+mengenalpasti+sela+masa+senyap+&public=true
spellingShingle Unspecified
Abdul Rahman, Ahmad Idil
Teknik-teknik mengenalpasti sela masa senyap dalam sistem pengecaman suara
title Teknik-teknik mengenalpasti sela masa senyap dalam sistem pengecaman suara
title_full Teknik-teknik mengenalpasti sela masa senyap dalam sistem pengecaman suara
title_fullStr Teknik-teknik mengenalpasti sela masa senyap dalam sistem pengecaman suara
title_full_unstemmed Teknik-teknik mengenalpasti sela masa senyap dalam sistem pengecaman suara
title_short Teknik-teknik mengenalpasti sela masa senyap dalam sistem pengecaman suara
title_sort teknik teknik mengenalpasti sela masa senyap dalam sistem pengecaman suara
topic Unspecified
url http://eprints.utm.my/34995/1/AhmadIdilAbdulMFKE2005.pdf
url-record http://eprints.utm.my/34995/
http://dms.library.utm.my:8080/vital/access/manager/Repository/vital:61230?queryType=vitalDismax&query=Teknik-teknik+mengenalpasti+sela+masa+senyap+&public=true
work_keys_str_mv AT abdulrahmanahmadidil teknikteknikmengenalpastiselamasasenyapdalamsistempengecamansuara