2024-03-29T11:11:06Z
https://meral.edu.mm/oai
oai:meral.edu.mm:recid/3876
2021-12-13T01:12:29Z
1582963302567:1597824273898
user-ucsy
Chunk Tagged Corpus Creation for Myanmar Language
Myint, Phyu Hninn
Htwe, Tin Myat
Thein, Ni Lar
In the applications of Natural languageprocessing (NLP), sentence analysis is one of theimportant phases for machine translationsystems. Currently, no mature deep analysis thathas been worked done is available for Myanmarlanguage. To perform shallow parsing onsentences, the chunk identification is afundamental task. The POS tagged corpuscreation has been proposed in [8] and in thispaper, we have proposed a methodology forbuilding chunk tagged corpus for MyanmarLanguage. We use the POS tagged corpus that isproposed in [8] and identify chunks in MyanmarPOS tagged texts. Our approach uses rule-basedon how to identify all chunks in a Myanmarsentence. As a preprocessing step, normalizationof POS tags is needed to perform in order toproduce finer tags. Hence, normalization rulesare also developed. After normalization, chunkrules are applied to tag chunk for these finertags. Our chunk tagged corpus is very useful inMyanmar to English machine translation system.
2011-05-05
http://hdl.handle.net/20.500.12678/0000003876
https://meral.edu.mm/records/3876