MERAL Myanmar Education Research and Learning Portal
Item
{"_buckets": {"deposit": "25d3332e-0403-4721-9af1-ce5f05bd1884"}, "_deposit": {"created_by": 73, "id": "8005", "owner": "73", "owners": [73], "owners_ext": {"displayname": "", "username": ""}, "pid": {"revision_id": 0, "type": "depid", "value": "8005"}, "status": "published"}, "_oai": {"id": "oai:meral.edu.mm:recid/00008005", "sets": ["user-miit"]}, "communities": ["miit"], "item_1583103067471": {"attribute_name": "Title", "attribute_value_mlt": [{"subitem_1551255647225": "Morpheme-Based Myanmar Word Segmenter", "subitem_1551255648112": "en"}]}, "item_1583103085720": {"attribute_name": "Description", "attribute_value_mlt": [{"interim": "\"Myanmar script has no fixed delimiters between words or syllables. Therefore,\nto achieve meaningful and correct segmented words from the text is a\nchallenging task. This paper has proposed a morpheme-based Myanmar word\ntokenizer which combines rule-based syllable breaking and dictionary lookup\nsyllable merging methods with longest string matching approach. The\nproposed approach is tested on a Monolingual dictionary that contains useful\ninformation for the word segmentation. It also contains above 32,581 words\nincluding headwords, stop words and essential words with Myanmar3 font.\nThese words are collected from Myanmar and Essential Words dictionaries.\nAccording to the experimental results, it can provide the promising\nsegmentation accuracy of Myanmar text.\nKEYWORDS: Syllable breaking; Morpheme; \""}]}, "item_1583103108160": {"attribute_name": "Keywords", "attribute_value_mlt": [{"interim": "Syllable Breaking, Morpheme"}]}, "item_1583103120197": {"attribute_name": "Files", "attribute_type": "file", "attribute_value_mlt": [{"accessrole": "open_access", "date": [{"dateType": "Available", "dateValue": "2021-02-09"}], "displaytype": "preview", "download_preview_message": "", "file_order": 0, "filename": "Morpheme-Based Myanmar Word Segmenter.pdf", "filesize": [{"value": "1.1 MB"}], "format": "application/pdf", "future_date_message": "", "is_thumbnail": false, "licensetype": "license_3", "mimetype": "application/pdf", "size": 1100000.0, "url": {"url": "https://meral.edu.mm/record/8005/files/Morpheme-Based Myanmar Word Segmenter.pdf"}, "version_id": "8dbe0bf7-7440-4061-85c7-03659db08df9"}]}, "item_1583103131163": {"attribute_name": "Journal articles", "attribute_value_mlt": [{"subitem_issue": "Issue 5", "subitem_journal_title": "Interantional Journal of Trend in Sceentific Research and Development", "subitem_pages": "Pages 911-914", "subitem_volume": "Volume 3"}]}, "item_1583105942107": {"attribute_name": "Authors", "attribute_value_mlt": [{"subitem_authors": [{"subitem_authors_fullname": "Sin Thi Yar Myint"}, {"subitem_authors_fullname": "Hanni Htun"}, {"subitem_authors_fullname": "Myat Myo Nwe Wai"}]}]}, "item_1583108359239": {"attribute_name": "Upload type", "attribute_value_mlt": [{"interim": "Publication"}]}, "item_1583108428133": {"attribute_name": "Publication type", "attribute_value_mlt": [{"interim": "Journal article"}]}, "item_1583159729339": {"attribute_name": "Publication date", "attribute_value": "2019-08-06"}, "item_title": "Morpheme-Based Myanmar Word Segmenter", "item_type_id": "21", "owner": "73", "path": ["1582963674932", "1597396989070"], "permalink_uri": "http://hdl.handle.net/20.500.12678/0000008005", "pubdate": {"attribute_name": "Deposited date", "attribute_value": "2021-02-09"}, "publish_date": "2021-02-09", "publish_status": "0", "recid": "8005", "relation": {}, "relation_version_is_last": true, "title": ["Morpheme-Based Myanmar Word Segmenter"], "weko_shared_id": -1}
Morpheme-Based Myanmar Word Segmenter
http://hdl.handle.net/20.500.12678/0000008005
http://hdl.handle.net/20.500.12678/00000080050631fa9d-98a4-41bc-9891-48428cbf3ef8
25d3332e-0403-4721-9af1-ce5f05bd1884
Name / File | License | Actions |
---|---|---|
![]() |
Publication type | ||||||
---|---|---|---|---|---|---|
Journal article | ||||||
Upload type | ||||||
Publication | ||||||
Title | ||||||
Title | Morpheme-Based Myanmar Word Segmenter | |||||
Language | en | |||||
Publication date | 2019-08-06 | |||||
Authors | ||||||
Sin Thi Yar Myint | ||||||
Hanni Htun | ||||||
Myat Myo Nwe Wai | ||||||
Description | ||||||
"Myanmar script has no fixed delimiters between words or syllables. Therefore, to achieve meaningful and correct segmented words from the text is a challenging task. This paper has proposed a morpheme-based Myanmar word tokenizer which combines rule-based syllable breaking and dictionary lookup syllable merging methods with longest string matching approach. The proposed approach is tested on a Monolingual dictionary that contains useful information for the word segmentation. It also contains above 32,581 words including headwords, stop words and essential words with Myanmar3 font. These words are collected from Myanmar and Essential Words dictionaries. According to the experimental results, it can provide the promising segmentation accuracy of Myanmar text. KEYWORDS: Syllable breaking; Morpheme; " |
||||||
Keywords | ||||||
Syllable Breaking, Morpheme | ||||||
Journal articles | ||||||
Issue 5 | ||||||
Interantional Journal of Trend in Sceentific Research and Development | ||||||
Pages 911-914 | ||||||
Volume 3 |