MERAL Myanmar Education Research and Learning Portal
Item
{"_buckets": {"deposit": "92191fff-cf9d-4f2e-9f5f-b21aa2b15c76"}, "_deposit": {"created_by": 48, "id": "7113", "owner": "48", "owners": [48], "owners_ext": {"displayname": "kyarnyoaye", "username": "kyarnyoaye"}, "pid": {"revision_id": 0, "type": "depid", "value": "7113"}, "status": "published"}, "_oai": {"id": "oai:meral.edu.mm:recid/00007113", "sets": ["1597824175385", "user-ucsy"]}, "communities": ["ucsy"], "item_1583103067471": {"attribute_name": "Title", "attribute_value_mlt": [{"subitem_1551255647225": "Dependency Head Annotation for Myanmar Dependency Treebank", "subitem_1551255648112": "my"}]}, "item_1583103085720": {"attribute_name": "Description", "attribute_value_mlt": [{"interim": "Complete manual annotation of dependency treebank needs resources like annotators and\nannotation tools and takes long time and has high possibility of inconsistent annotations\nfor free word order languages such as Myanmar. This paper describes a dependency head\nannotation scheme with Universal part-of-speech and Universal Dependencies for\nMyanmar dependency treebank. Currently 22,810 sentences and 680,218 tokens were\nannotated from three corpora for Myanmar dependency treebank. Some language specific\nissues are also described with examples. Raw syntactic structures were annotated\nautomatically by UDPipe according to the Universal Dependencies based on Universalpart-of-speech tag scheme. Then unsupervised annotated dependency head structures have\nbeen manually updated in post processing. To be reliable and speedy post process with\nreduced errors for manual updating, selected sentences were added to the training data\nafter being updated. After that the model has been retrained and the remaining sentences\nwere parsed by UDPipe. Post processing was repeated until all sentences were updated.\nSome specifications of dependency annotation schemes in sentences encountered in post\nprocessing are presented with examples. For parsing performance of annotated data, cross\nvalidation tests and parsing experiments were performed. Moreover, annotated treebank\ndata have also been evaluated by CoNLL 2017 evaluation script for parsing performance.\nResults of parsing experiments and evaluation are also reported by unlabeled and labeled\nattachment scores and demonstrated that the proposed method is a suitable way for\nbuilding Myanmar dependency trees. Moreover, syntax structures of treebank are also\nanalyzed and syntax information is also presented. This dependency head annotation for\ndependency treebank is the first work for Myanmar language as far as we know."}]}, "item_1583103108160": {"attribute_name": "Keywords", "attribute_value_mlt": [{"interim": "Dependency head"}, {"interim": "Universal Dependencies"}, {"interim": "Treebank"}, {"interim": "Annotation schemes"}]}, "item_1583103120197": {"attribute_name": "Files", "attribute_type": "file", "attribute_value_mlt": [{"accessrole": "open_access", "date": [{"dateType": "Available", "dateValue": "2020-12-31"}], "displaytype": "preview", "download_preview_message": "", "file_order": 0, "filename": "Dependency Head Annotation for Myanmar Dependency Treebank.pdf", "filesize": [{"value": "667 KB"}], "format": "application/pdf", "future_date_message": "", "is_thumbnail": false, "licensetype": "license_0", "mimetype": "application/pdf", "size": 667000.0, "url": {"url": "https://meral.edu.mm/record/7113/files/Dependency Head Annotation for Myanmar Dependency Treebank.pdf"}, "version_id": "9fda2eb6-f581-44ec-bc38-ce6cf7d5024e"}]}, "item_1583103131163": {"attribute_name": "Journal articles", "attribute_value_mlt": [{"subitem_issue": "6", "subitem_journal_title": "Advances in Science, Technology and Engineering Systems Journal", "subitem_pages": "788-800", "subitem_volume": "5"}]}, "item_1583105942107": {"attribute_name": "Authors", "attribute_value_mlt": [{"subitem_authors": [{"subitem_authors_fullname": "Hnin Thu Zar Aye"}, {"subitem_authors_fullname": "Win Pa Pa"}]}]}, "item_1583108359239": {"attribute_name": "Upload type", "attribute_value_mlt": [{"interim": "Publication"}]}, "item_1583108428133": {"attribute_name": "Publication type", "attribute_value_mlt": [{"interim": "Journal article"}]}, "item_1583159729339": {"attribute_name": "Publication date", "attribute_value": "2020-11-24"}, "item_1583159847033": {"attribute_name": "Identifier", "attribute_value": "10.25046/aj050694"}, "item_title": "Dependency Head Annotation for Myanmar Dependency Treebank", "item_type_id": "21", "owner": "48", "path": ["1597824175385"], "permalink_uri": "http://hdl.handle.net/20.500.12678/0000007113", "pubdate": {"attribute_name": "Deposited date", "attribute_value": "2020-12-31"}, "publish_date": "2020-12-31", "publish_status": "0", "recid": "7113", "relation": {}, "relation_version_is_last": true, "title": ["Dependency Head Annotation for Myanmar Dependency Treebank"], "weko_shared_id": -1}
Dependency Head Annotation for Myanmar Dependency Treebank
http://hdl.handle.net/20.500.12678/0000007113
http://hdl.handle.net/20.500.12678/0000007113b0fc098a-5fd7-434d-8aa5-5fbc6b9c3a3d
92191fff-cf9d-4f2e-9f5f-b21aa2b15c76
Name / File | License | Actions |
---|---|---|
![]() |
Publication type | ||||||
---|---|---|---|---|---|---|
Journal article | ||||||
Upload type | ||||||
Publication | ||||||
Title | ||||||
Title | Dependency Head Annotation for Myanmar Dependency Treebank | |||||
Language | my | |||||
Publication date | 2020-11-24 | |||||
Authors | ||||||
Hnin Thu Zar Aye | ||||||
Win Pa Pa | ||||||
Description | ||||||
Complete manual annotation of dependency treebank needs resources like annotators and annotation tools and takes long time and has high possibility of inconsistent annotations for free word order languages such as Myanmar. This paper describes a dependency head annotation scheme with Universal part-of-speech and Universal Dependencies for Myanmar dependency treebank. Currently 22,810 sentences and 680,218 tokens were annotated from three corpora for Myanmar dependency treebank. Some language specific issues are also described with examples. Raw syntactic structures were annotated automatically by UDPipe according to the Universal Dependencies based on Universalpart-of-speech tag scheme. Then unsupervised annotated dependency head structures have been manually updated in post processing. To be reliable and speedy post process with reduced errors for manual updating, selected sentences were added to the training data after being updated. After that the model has been retrained and the remaining sentences were parsed by UDPipe. Post processing was repeated until all sentences were updated. Some specifications of dependency annotation schemes in sentences encountered in post processing are presented with examples. For parsing performance of annotated data, cross validation tests and parsing experiments were performed. Moreover, annotated treebank data have also been evaluated by CoNLL 2017 evaluation script for parsing performance. Results of parsing experiments and evaluation are also reported by unlabeled and labeled attachment scores and demonstrated that the proposed method is a suitable way for building Myanmar dependency trees. Moreover, syntax structures of treebank are also analyzed and syntax information is also presented. This dependency head annotation for dependency treebank is the first work for Myanmar language as far as we know. |
||||||
Keywords | ||||||
Dependency head, Universal Dependencies, Treebank, Annotation schemes | ||||||
Identifier | 10.25046/aj050694 | |||||
Journal articles | ||||||
6 | ||||||
Advances in Science, Technology and Engineering Systems Journal | ||||||
788-800 | ||||||
5 |