MERAL Myanmar Education Research and Learning Portal
Item
{"_buckets": {"deposit": "1a715df1-e735-44a6-93d8-31d6912e9467"}, "_deposit": {"created_by": 45, "id": "6273", "owner": "45", "owners": [45], "owners_ext": {"displayname": "", "username": ""}, "pid": {"revision_id": 0, "type": "recid", "value": "6273"}, "status": "published"}, "_oai": {"id": "oai:meral.edu.mm:recid/6273", "sets": ["1605779935331", "user-uit"]}, "communities": ["uit"], "item_1583103067471": {"attribute_name": "Title", "attribute_value_mlt": [{"subitem_1551255647225": "Feature Selection for Categorization of Online News Articles in Myanmar Language", "subitem_1551255648112": "en"}]}, "item_1583103085720": {"attribute_name": "Description", "attribute_value_mlt": [{"interim": "In text mining, the feature selection plays an important role to reduce the high dimensionality of feature space. It can improve the accuracy of the document clustering process and help to avoid overfitting problem. Nowadays, the enormous amount of news article documents is widely available on the internet due to the rapid development of the web. Consequently, there is an urgent need to extract useful content from overloaded information. The categorization of online text documents is crucial to avoid information overload and it can help readers to find rapidly their interesting topic. The problem arises for text categorization is the large number of features space. This study has two phases, documents preprocessing and feature selection. Document preprocessing contains documents collection, syllable segmentation, word segmentation, removing stop words for extracting features from the collection of Myanmar online news documents including sport, health, crime etc. In this study, TF-IDF weighting method is adapted for feature selection. The experimental result shows the adapted TF-IDF method has higher performance than based TF-IDF method."}]}, "item_1583103108160": {"attribute_name": "Keywords", "attribute_value_mlt": [{"interim": "Feature Selection"}, {"interim": "TF-IDF"}, {"interim": "Syllable Segmentation"}, {"interim": "Word Segmentation"}, {"interim": "Myanmar Online News"}]}, "item_1583103120197": {"attribute_name": "Files", "attribute_type": "file", "attribute_value_mlt": [{"accessrole": "open_access", "date": [{"dateType": "Available", "dateValue": "2020-11-19"}], "displaytype": "preview", "download_preview_message": "", "file_order": 0, "filename": "Feature Selection for Categorization of Online News Articles in Myanmar Language.pdf", "filesize": [{"value": "1.5 Mb"}], "format": "application/pdf", "future_date_message": "", "is_thumbnail": false, "licensefree": "© 2017 ICAIT", "licensetype": "license_free", "mimetype": "application/pdf", "size": 1500000.0, "url": {"url": "https://meral.edu.mm/record/6273/files/Feature Selection for Categorization of Online News Articles in Myanmar Language.pdf"}, "version_id": "bc569236-dfc7-47b0-aadc-8ed8417095ab"}]}, "item_1583103147082": {"attribute_name": "Conference papers", "attribute_value_mlt": [{"subitem_acronym": "ICAIT-2017", "subitem_c_date": "1-2 November, 2017", "subitem_conference_title": "1st International Conference on Advanced Information Technologies", "subitem_place": "Yangon, Myanmar", "subitem_session": "Natural Language Processing", "subitem_website": "https://www.uit.edu.mm/icait-2017/"}]}, "item_1583105942107": {"attribute_name": "Authors", "attribute_value_mlt": [{"subitem_authors": [{"subitem_authors_fullname": "Myat Sapal Phyu"}, {"subitem_authors_fullname": "Win Win Thant"}, {"subitem_authors_fullname": "Thet Thet Zin"}]}]}, "item_1583108359239": {"attribute_name": "Upload type", "attribute_value_mlt": [{"interim": "Publication"}]}, "item_1583108428133": {"attribute_name": "Publication type", "attribute_value_mlt": [{"interim": "Conference paper"}]}, "item_1583159729339": {"attribute_name": "Publication date", "attribute_value": "2017-11-02"}, "item_title": "Feature Selection for Categorization of Online News Articles in Myanmar Language", "item_type_id": "21", "owner": "45", "path": ["1605779935331"], "permalink_uri": "http://hdl.handle.net/20.500.12678/0000006273", "pubdate": {"attribute_name": "Deposited date", "attribute_value": "2020-11-19"}, "publish_date": "2020-11-19", "publish_status": "0", "recid": "6273", "relation": {}, "relation_version_is_last": true, "title": ["Feature Selection for Categorization of Online News Articles in Myanmar Language"], "weko_shared_id": -1}
Feature Selection for Categorization of Online News Articles in Myanmar Language
http://hdl.handle.net/20.500.12678/0000006273
http://hdl.handle.net/20.500.12678/000000627376559a07-59e5-4ea9-b8ca-2c10793e90c5
1a715df1-e735-44a6-93d8-31d6912e9467
Name / File | License | Actions |
---|---|---|
Feature Selection for Categorization of Online News Articles in Myanmar Language.pdf (1.5 Mb)
|
© 2017 ICAIT
|
Publication type | ||||||
---|---|---|---|---|---|---|
Conference paper | ||||||
Upload type | ||||||
Publication | ||||||
Title | ||||||
Title | Feature Selection for Categorization of Online News Articles in Myanmar Language | |||||
Language | en | |||||
Publication date | 2017-11-02 | |||||
Authors | ||||||
Myat Sapal Phyu | ||||||
Win Win Thant | ||||||
Thet Thet Zin | ||||||
Description | ||||||
In text mining, the feature selection plays an important role to reduce the high dimensionality of feature space. It can improve the accuracy of the document clustering process and help to avoid overfitting problem. Nowadays, the enormous amount of news article documents is widely available on the internet due to the rapid development of the web. Consequently, there is an urgent need to extract useful content from overloaded information. The categorization of online text documents is crucial to avoid information overload and it can help readers to find rapidly their interesting topic. The problem arises for text categorization is the large number of features space. This study has two phases, documents preprocessing and feature selection. Document preprocessing contains documents collection, syllable segmentation, word segmentation, removing stop words for extracting features from the collection of Myanmar online news documents including sport, health, crime etc. In this study, TF-IDF weighting method is adapted for feature selection. The experimental result shows the adapted TF-IDF method has higher performance than based TF-IDF method. | ||||||
Keywords | ||||||
Feature Selection, TF-IDF, Syllable Segmentation, Word Segmentation, Myanmar Online News | ||||||
Conference papers | ||||||
ICAIT-2017 | ||||||
1-2 November, 2017 | ||||||
1st International Conference on Advanced Information Technologies | ||||||
Yangon, Myanmar | ||||||
Natural Language Processing | ||||||
https://www.uit.edu.mm/icait-2017/ |