Log in
Language:

MERAL Myanmar Education Research and Learning Portal

  • Top
  • Universities
  • Ranking
To
lat lon distance
To

Field does not validate



Index Link

Index Tree

Please input email address.

WEKO

One fine body…

WEKO

One fine body…

Item

{"_buckets": {"deposit": "1a715df1-e735-44a6-93d8-31d6912e9467"}, "_deposit": {"created_by": 45, "id": "6273", "owner": "45", "owners": [45], "owners_ext": {"displayname": "", "username": ""}, "pid": {"revision_id": 0, "type": "recid", "value": "6273"}, "status": "published"}, "_oai": {"id": "oai:meral.edu.mm:recid/6273", "sets": ["1605779935331", "user-uit"]}, "communities": ["uit"], "item_1583103067471": {"attribute_name": "Title", "attribute_value_mlt": [{"subitem_1551255647225": "Feature Selection for Categorization of Online News Articles in Myanmar Language", "subitem_1551255648112": "en"}]}, "item_1583103085720": {"attribute_name": "Description", "attribute_value_mlt": [{"interim": "In text mining, the feature selection plays an important role to reduce the high dimensionality of feature space. It can improve the accuracy of the document clustering process and help to avoid overfitting problem. Nowadays, the enormous amount of news article documents is widely available on the internet due to the rapid development of the web. Consequently, there is an urgent need to extract useful content from overloaded information. The categorization of online text documents is crucial to avoid information overload and it can help readers to find rapidly their interesting topic. The problem arises for text categorization is the large number of features space. This study has two phases, documents preprocessing and feature selection. Document preprocessing contains documents collection, syllable segmentation, word segmentation, removing stop words for extracting features from the collection of Myanmar online news documents including sport, health, crime etc. In this study, TF-IDF weighting method is adapted for feature selection. The experimental result shows the adapted TF-IDF method has higher performance than based TF-IDF method."}]}, "item_1583103108160": {"attribute_name": "Keywords", "attribute_value_mlt": [{"interim": "Feature Selection"}, {"interim": "TF-IDF"}, {"interim": "Syllable Segmentation"}, {"interim": "Word Segmentation"}, {"interim": "Myanmar Online News"}]}, "item_1583103120197": {"attribute_name": "Files", "attribute_type": "file", "attribute_value_mlt": [{"accessrole": "open_access", "date": [{"dateType": "Available", "dateValue": "2020-11-19"}], "displaytype": "preview", "download_preview_message": "", "file_order": 0, "filename": "Feature Selection for Categorization of Online News Articles in Myanmar Language.pdf", "filesize": [{"value": "1.5 Mb"}], "format": "application/pdf", "future_date_message": "", "is_thumbnail": false, "licensefree": "© 2017 ICAIT", "licensetype": "license_free", "mimetype": "application/pdf", "size": 1500000.0, "url": {"url": "https://meral.edu.mm/record/6273/files/Feature Selection for Categorization of Online News Articles in Myanmar Language.pdf"}, "version_id": "bc569236-dfc7-47b0-aadc-8ed8417095ab"}]}, "item_1583103147082": {"attribute_name": "Conference papers", "attribute_value_mlt": [{"subitem_acronym": "ICAIT-2017", "subitem_c_date": "1-2 November, 2017", "subitem_conference_title": "1st International Conference on Advanced Information Technologies", "subitem_place": "Yangon, Myanmar", "subitem_session": "Natural Language Processing", "subitem_website": "https://www.uit.edu.mm/icait-2017/"}]}, "item_1583105942107": {"attribute_name": "Authors", "attribute_value_mlt": [{"subitem_authors": [{"subitem_authors_fullname": "Myat Sapal Phyu"}, {"subitem_authors_fullname": "Win Win Thant"}, {"subitem_authors_fullname": "Thet Thet Zin"}]}]}, "item_1583108359239": {"attribute_name": "Upload type", "attribute_value_mlt": [{"interim": "Publication"}]}, "item_1583108428133": {"attribute_name": "Publication type", "attribute_value_mlt": [{"interim": "Conference paper"}]}, "item_1583159729339": {"attribute_name": "Publication date", "attribute_value": "2017-11-02"}, "item_title": "Feature Selection for Categorization of Online News Articles in Myanmar Language", "item_type_id": "21", "owner": "45", "path": ["1605779935331"], "permalink_uri": "http://hdl.handle.net/20.500.12678/0000006273", "pubdate": {"attribute_name": "Deposited date", "attribute_value": "2020-11-19"}, "publish_date": "2020-11-19", "publish_status": "0", "recid": "6273", "relation": {}, "relation_version_is_last": true, "title": ["Feature Selection for Categorization of Online News Articles in Myanmar Language"], "weko_shared_id": -1}
  1. University of Information Technology
  2. International Conference on Advanced Information Technologies

Feature Selection for Categorization of Online News Articles in Myanmar Language

http://hdl.handle.net/20.500.12678/0000006273
http://hdl.handle.net/20.500.12678/0000006273
76559a07-59e5-4ea9-b8ca-2c10793e90c5
1a715df1-e735-44a6-93d8-31d6912e9467
None
Preview
Name / File License Actions
Feature Feature Selection for Categorization of Online News Articles in Myanmar Language.pdf (1.5 Mb)
© 2017 ICAIT
Publication type
Conference paper
Upload type
Publication
Title
Title Feature Selection for Categorization of Online News Articles in Myanmar Language
Language en
Publication date 2017-11-02
Authors
Myat Sapal Phyu
Win Win Thant
Thet Thet Zin
Description
In text mining, the feature selection plays an important role to reduce the high dimensionality of feature space. It can improve the accuracy of the document clustering process and help to avoid overfitting problem. Nowadays, the enormous amount of news article documents is widely available on the internet due to the rapid development of the web. Consequently, there is an urgent need to extract useful content from overloaded information. The categorization of online text documents is crucial to avoid information overload and it can help readers to find rapidly their interesting topic. The problem arises for text categorization is the large number of features space. This study has two phases, documents preprocessing and feature selection. Document preprocessing contains documents collection, syllable segmentation, word segmentation, removing stop words for extracting features from the collection of Myanmar online news documents including sport, health, crime etc. In this study, TF-IDF weighting method is adapted for feature selection. The experimental result shows the adapted TF-IDF method has higher performance than based TF-IDF method.
Keywords
Feature Selection, TF-IDF, Syllable Segmentation, Word Segmentation, Myanmar Online News
Conference papers
ICAIT-2017
1-2 November, 2017
1st International Conference on Advanced Information Technologies
Yangon, Myanmar
Natural Language Processing
https://www.uit.edu.mm/icait-2017/
Back
0
0
views
downloads
See details
Views Downloads

Versions

Ver.1 2020-11-19 15:36:50.792428
Show All versions

Share

Mendeley Twitter Facebook Print Addthis

Export

OAI-PMH
  • OAI-PMH DublinCore
Other Formats
  • JSON

Confirm


Back to MERAL


Back to MERAL