MERAL Myanmar Education Research and Learning Portal
Item
{"_buckets": {"deposit": "1bc7f1eb-80e3-415d-a7ff-0e5eada2431f"}, "_deposit": {"id": "3092", "owners": [], "pid": {"revision_id": 0, "type": "recid", "value": "3092"}, "status": "published"}, "_oai": {"id": "oai:meral.edu.mm:recid/3092", "sets": ["user-ytu"]}, "communities": ["ytu"], "item_1583103067471": {"attribute_name": "Title", "attribute_value_mlt": [{"subitem_1551255647225": "Normal and Whispered Speech Recognition Systems for Myanmar Digits", "subitem_1551255648112": "en"}]}, "item_1583103085720": {"attribute_name": "Description", "attribute_value_mlt": [{"interim": "\u003cp\u003eNowadays, Automatic speech recognition (ASR) technology comes as the popular innovation in human machine interaction. This technology allows a computer to recognize the spoken words and convert them to text data. In designing the computer systems that recognize spoken words, one of the challenging tasks is to be recognized spoken Myanmar digits. In this paper we focus on recognizing Myanmar digits spoken by normal voice and whispered voice. Myanmar digits recognition system for both types has been developed by using Hidden Markov Model in HTK tools and Mel Frequency Cepstral Coefficients (MFCC) technique has been used to convert the speech waveform into a set of feature vectors for recognizing the vocalization of a word. In our experiments, HMMbased acoustic and language models are used to evaluate the performance of speech recognizer for both speaker dependent and speaker independent. According to the experimental results, the performance of speaker dependent speech recognition system for normal voice and whispered voice are 90% and 88.7% respectively. The performance of speaker independent speech recognition system for normal voice and whispered voice are 67.3% and 65.7% respectively. We found that the performance of both type of speaker dependent is higher than those of speaker independent.\u003c/p\u003e"}]}, "item_1583103108160": {"attribute_name": "Keywords", "attribute_value_mlt": [{"interim": "Automatic Speech Recognition"}, {"interim": "Hidden Markov Model"}, {"interim": "HTK tools"}, {"interim": "MFCC"}, {"interim": "Dependent and Independent Speakers"}]}, "item_1583103120197": {"attribute_name": "Files", "attribute_type": "file", "attribute_value_mlt": [{"accessrole": "open_access", "date": [{"dateType": "Available", "dateValue": "2019-04-01"}], "displaytype": "preview", "download_preview_message": "", "file_order": 0, "filename": "Normal and Whispered Speech Recognition Systems for Myanmar Digits.pdf", "filesize": [{"value": "744 Kb"}], "format": "application/pdf", "future_date_message": "", "is_thumbnail": false, "mimetype": "application/pdf", "size": 744000.0, "url": {"url": "https://meral.edu.mm/record/3092/files/Normal and Whispered Speech Recognition Systems for Myanmar Digits.pdf"}, "version_id": "725126d1-dcae-41a3-a67d-2751c4eee667"}]}, "item_1583103131163": {"attribute_name": "Journal articles", "attribute_value_mlt": [{"subitem_issue": "Issue 11", "subitem_journal_title": "International Journal of Science and Engineering Applications", "subitem_pages": "465-469", "subitem_volume": "Volume 7"}]}, "item_1583103147082": {"attribute_name": "Conference papers", "attribute_value_mlt": [{"subitem_acronym": "", "subitem_c_date": "", "subitem_conference_title": "", "subitem_part": "", "subitem_place": "", "subitem_session": "", "subitem_website": ""}]}, "item_1583103211336": {"attribute_name": "Books/reports/chapters", "attribute_value_mlt": [{"subitem_book_title": "", "subitem_isbn": "", "subitem_pages": "", "subitem_place": "", "subitem_publisher": ""}]}, "item_1583103233624": {"attribute_name": "Thesis/dissertations", "attribute_value_mlt": [{"subitem_awarding_university": "", "subitem_supervisor(s)": [{"subitem_supervisor": ""}]}]}, "item_1583105942107": {"attribute_name": "Authors", "attribute_value_mlt": [{"subitem_authors": [{"subitem_authors_fullname": "Nyein Nyein Oo"}, {"subitem_authors_fullname": "Masaru Yamashita"}, {"subitem_authors_fullname": "Shoichi Matsunaga"}]}]}, "item_1583108359239": {"attribute_name": "Upload type", "attribute_value_mlt": [{"interim": "Publication"}]}, "item_1583108428133": {"attribute_name": "Publication type", "attribute_value_mlt": [{"interim": "Journal article"}]}, "item_1583159729339": {"attribute_name": "Publication date", "attribute_value": "2018-11-14"}, "item_1583159847033": {"attribute_name": "Identifier", "attribute_value": "10.5281/zenodo.2620095"}, "item_title": "Normal and Whispered Speech Recognition Systems for Myanmar Digits", "item_type_id": "21", "owner": "1", "path": ["1596119372420"], "permalink_uri": "http://hdl.handle.net/20.500.12678/0000003092", "pubdate": {"attribute_name": "Deposited date", "attribute_value": "2019-04-01"}, "publish_date": "2019-04-01", "publish_status": "0", "recid": "3092", "relation": {}, "relation_version_is_last": true, "title": ["Normal and Whispered Speech Recognition Systems for Myanmar Digits"], "weko_shared_id": -1}
Normal and Whispered Speech Recognition Systems for Myanmar Digits
http://hdl.handle.net/20.500.12678/0000003092
http://hdl.handle.net/20.500.12678/000000309284f12bd1-0721-46fc-8a0a-9cbd2bcb366f
1bc7f1eb-80e3-415d-a7ff-0e5eada2431f
Name / File | License | Actions |
---|---|---|
![]() |
|
Publication type | ||||||
---|---|---|---|---|---|---|
Journal article | ||||||
Upload type | ||||||
Publication | ||||||
Title | ||||||
Title | Normal and Whispered Speech Recognition Systems for Myanmar Digits | |||||
Language | en | |||||
Publication date | 2018-11-14 | |||||
Authors | ||||||
Nyein Nyein Oo | ||||||
Masaru Yamashita | ||||||
Shoichi Matsunaga | ||||||
Description | ||||||
<p>Nowadays, Automatic speech recognition (ASR) technology comes as the popular innovation in human machine interaction. This technology allows a computer to recognize the spoken words and convert them to text data. In designing the computer systems that recognize spoken words, one of the challenging tasks is to be recognized spoken Myanmar digits. In this paper we focus on recognizing Myanmar digits spoken by normal voice and whispered voice. Myanmar digits recognition system for both types has been developed by using Hidden Markov Model in HTK tools and Mel Frequency Cepstral Coefficients (MFCC) technique has been used to convert the speech waveform into a set of feature vectors for recognizing the vocalization of a word. In our experiments, HMMbased acoustic and language models are used to evaluate the performance of speech recognizer for both speaker dependent and speaker independent. According to the experimental results, the performance of speaker dependent speech recognition system for normal voice and whispered voice are 90% and 88.7% respectively. The performance of speaker independent speech recognition system for normal voice and whispered voice are 67.3% and 65.7% respectively. We found that the performance of both type of speaker dependent is higher than those of speaker independent.</p> | ||||||
Keywords | ||||||
Automatic Speech Recognition, Hidden Markov Model, HTK tools, MFCC, Dependent and Independent Speakers | ||||||
Identifier | 10.5281/zenodo.2620095 | |||||
Journal articles | ||||||
Issue 11 | ||||||
International Journal of Science and Engineering Applications | ||||||
465-469 | ||||||
Volume 7 | ||||||
Conference papers | ||||||
Books/reports/chapters | ||||||
Thesis/dissertations |