Index Link

  • RootNode
    • Co-operative College, Mandalay
    • Cooperative College, Phaunggyi
    • Co-operative University, Sagaing
    • Co-operative University, Thanlyin
    • Dagon University
    • Kyaukse University
    • Laquarware Technological college
    • Mandalay Technological University
    • Mandalay University of Distance Education
    • Mandalay University of Foreign Languages
    • Maubin University
    • Mawlamyine University
    • Meiktila University
    • Mohnyin University
    • Myanmar Institute of Information Technology
    • Myanmar Maritime University
    • National Management Degree College
    • Naypyitaw State Academy
    • Pathein University
    • Sagaing University
    • Sagaing University of Education
    • Taunggyi University
    • Technological University, Hmawbi
    • Technological University (Kyaukse)
    • Technological University Mandalay
    • University of Computer Studies, Mandalay
    • University of Computer Studies Maubin
    • University of Computer Studies, Meikhtila
    • University of Computer Studies Pathein
    • University of Computer Studies, Taungoo
    • University of Computer Studies, Yangon
    • University of Dental Medicine Mandalay
    • University of Dental Medicine, Yangon
    • University of Information Technology
    • University of Mandalay
    • University of Medicine 1
    • University of Medicine 2
    • University of Medicine Mandalay
    • University of Myitkyina
    • University of Public Health, Yangon
    • University of Veterinary Science
    • University of Yangon
    • West Yangon University
    • Yadanabon University
    • Yangon Technological University
    • Yangon University of Distance Education
    • Yangon University of Economics
    • Yangon University of Education
    • Yangon University of Foreign Languages
    • Yezin Agricultural University
    • New Index

Item

{"_buckets": {"deposit": "3e307aa6-01b1-4570-90a6-506632d707ab"}, "_deposit": {"id": "3101", "owners": [], "pid": {"revision_id": 0, "type": "recid", "value": "3101"}, "status": "published"}, "_oai": {"id": "oai:meral.edu.mm:recid/3101", "sets": ["user-ytu"]}, "communities": ["ytu"], "item_1583103067471": {"attribute_name": "Title", "attribute_value_mlt": [{"subitem_1551255647225": "Efficient Document Clustering System Basedon Probability Distribution ofK-Means(PD K-Means) Model", "subitem_1551255648112": "en"}]}, "item_1583103085720": {"attribute_name": "Description", "attribute_value_mlt": [{"interim": "\u003cp\u003eIn document clustering system, some documents with the same similarity scores may fall into different clusters instead of same cluster due to calculate similarity distance between pairs of documents based on geometric measurements.\u0026nbsp; To tackle this point, probability distribution of KMeans (PD K-Means) algorithm is proposed. In this system, documents are clustered based on proposed probability distribution equation instead of similarity measure between objects. It can also solve initial centroids problems of K-Means by using Systematic Selection of Initial Centroid (SSIC) approach. So, it not only can generate compact and stable results but also eliminates initial cluster problem of K-Means. According to the experiment, F-measure values increase about 0.28 in 20NewsGroup dataset, 0.26 in R8 and 0.14 in R52 from Reuter21578 datasets. The evaluations demonstrate that the proposed solution outperforms than original method and can be applied for various standard and unsupervised datasets.\u003c/p\u003e"}]}, "item_1583103108160": {"attribute_name": "Keywords", "attribute_value_mlt": [{"interim": "Initial  Centroid"}, {"interim": "Probability Distribution"}, {"interim": "PD K-Means"}, {"interim": "SSIC"}]}, "item_1583103120197": {"attribute_name": "Files", "attribute_type": "file", "attribute_value_mlt": [{"accessrole": "open_access", "date": [{"dateType": "Available", "dateValue": "2019-06-27"}], "displaytype": "preview", "download_preview_message": "", "file_order": 0, "filename": "Efficient Document Clustering System Basedon Probability Distribution ofK-Means(PD K-Means) Model Tin Thu Zar Win (Ph D IJSHRE journal).pdf", "filesize": [{"value": "152 Kb"}], "format": "application/pdf", "future_date_message": "", "is_thumbnail": false, "mimetype": "application/pdf", "size": 152000.0, "url": {"url": "https://meral.edu.mm/record/3101/files/Efficient Document Clustering System Basedon Probability Distribution ofK-Means(PD K-Means) Model Tin Thu Zar Win (Ph D IJSHRE journal).pdf"}, "version_id": "e03b8248-2272-4fea-bc1f-76b772cb83cc"}]}, "item_1583103131163": {"attribute_name": "Journal articles", "attribute_value_mlt": [{"subitem_issue": "Issue 8", "subitem_journal_title": "International Journal of Software \u0026 Hardware Research in Engineering", "subitem_pages": "pp. 1-6", "subitem_volume": "Volume 5"}]}, "item_1583103147082": {"attribute_name": "Conference papers", "attribute_value_mlt": [{"subitem_acronym": "", "subitem_c_date": "", "subitem_conference_title": "", "subitem_part": "", "subitem_place": "", "subitem_session": "", "subitem_website": ""}]}, "item_1583103211336": {"attribute_name": "Books/reports/chapters", "attribute_value_mlt": [{"subitem_book_title": "", "subitem_isbn": "", "subitem_pages": "", "subitem_place": "", "subitem_publisher": ""}]}, "item_1583103233624": {"attribute_name": "Thesis/dissertations", "attribute_value_mlt": [{"subitem_awarding_university": "", "subitem_supervisor(s)": [{"subitem_supervisor": ""}]}]}, "item_1583105942107": {"attribute_name": "Authors", "attribute_value_mlt": [{"subitem_authors": [{"subitem_authors_fullname": "Tin Thu Zar Win"}, {"subitem_authors_fullname": "Nang Aye Aye Htwe"}, {"subitem_authors_fullname": "Moe Moe Aye"}]}]}, "item_1583108359239": {"attribute_name": "Upload type", "attribute_value_mlt": [{"interim": "Publication"}]}, "item_1583108428133": {"attribute_name": "Publication type", "attribute_value_mlt": [{"interim": "Journal article"}]}, "item_1583159729339": {"attribute_name": "Publication date", "attribute_value": "2017-10-06"}, "item_1583159847033": {"attribute_name": "Identifier", "attribute_value": "10.5281/zenodo.3131916"}, "item_title": "Efficient Document Clustering System Basedon Probability Distribution ofK-Means(PD K-Means) Model", "item_type_id": "21", "owner": "1", "path": ["1596119372420"], "permalink_uri": "http://hdl.handle.net/20.500.12678/0000003101", "pubdate": {"attribute_name": "Deposited date", "attribute_value": "2019-06-27"}, "publish_date": "2019-06-27", "publish_status": "0", "recid": "3101", "relation": {}, "relation_version_is_last": true, "title": ["Efficient Document Clustering System Basedon Probability Distribution ofK-Means(PD K-Means) Model"], "weko_shared_id": -1}

Efficient Document Clustering System Basedon Probability Distribution ofK-Means(PD K-Means) Model

http://hdl.handle.net/20.500.12678/0000003101
46fab796-ce26-4e83-b144-2b31387e2a2b
3e307aa6-01b1-4570-90a6-506632d707ab
None
Name / File License Actions
Efficient Efficient Document Clustering System Basedon Probability Distribution ofK-Means(PD K-Means) Model Tin Thu Zar Win (Ph D IJSHRE journal).pdf (152 Kb)
Publication type
Journal article
Upload type
Publication
Title
Title Efficient Document Clustering System Basedon Probability Distribution ofK-Means(PD K-Means) Model
Language en
Publication date 2017-10-06
Authors
Tin Thu Zar Win
Nang Aye Aye Htwe
Moe Moe Aye
Description
<p>In document clustering system, some documents with the same similarity scores may fall into different clusters instead of same cluster due to calculate similarity distance between pairs of documents based on geometric measurements.&nbsp; To tackle this point, probability distribution of KMeans (PD K-Means) algorithm is proposed. In this system, documents are clustered based on proposed probability distribution equation instead of similarity measure between objects. It can also solve initial centroids problems of K-Means by using Systematic Selection of Initial Centroid (SSIC) approach. So, it not only can generate compact and stable results but also eliminates initial cluster problem of K-Means. According to the experiment, F-measure values increase about 0.28 in 20NewsGroup dataset, 0.26 in R8 and 0.14 in R52 from Reuter21578 datasets. The evaluations demonstrate that the proposed solution outperforms than original method and can be applied for various standard and unsupervised datasets.</p>
Keywords
Initial Centroid, Probability Distribution, PD K-Means, SSIC
Identifier 10.5281/zenodo.3131916
Journal articles
Issue 8
International Journal of Software & Hardware Research in Engineering
pp. 1-6
Volume 5
Conference papers
Books/reports/chapters
Thesis/dissertations
0
0
views
downloads
Views Downloads

Export

OAI-PMH
  • OAI-PMH DublinCore
Other Formats