MERAL Myanmar Education Research and Learning Portal
Item
{"_buckets": {"deposit": "2d73fdad-eadf-47f5-bd0e-89dcf4b48eb2"}, "_deposit": {"id": "3818", "owners": [], "pid": {"revision_id": 0, "type": "recid", "value": "3818"}, "status": "published"}, "_oai": {"id": "oai:meral.edu.mm:recid/3818", "sets": ["user-ucsy"]}, "communities": ["ucsy"], "item_1583103067471": {"attribute_name": "Title", "attribute_value_mlt": [{"subitem_1551255647225": "Elimination noisy information on web page", "subitem_1551255648112": "en"}]}, "item_1583103085720": {"attribute_name": "Description", "attribute_value_mlt": [{"interim": "Web page typically contains manyinformation blocks. They are navigation panels,copyright and privacy notices and advertisements.These blocks are useful for business purposes.These blocks are called as the noisy blocks whichcan harm web data mining. And so, eliminatingthese noises is of great importance. The noisyblocks usually share some common contents andpresentation styles. The main contents of web pageare different in the common presentation styles.Based on this observation, a site style tree (SST) ispresented in this system to capture the commonpresentation styles and actual contents. Aninformation based algorithm is used to determinewhich parts of the SST represent noises and whichparts represent the main contents of the site.Experimental results show that eliminating noisyinformation on web pages will be effective for webdata mining. The system shows how much noisyinformation blocks can be removed from webpages depending upon file size. The users canchoose desired web page and this system willeliminate unnecessary noise by using noisedetection and web page cleaning algorithm."}]}, "item_1583103108160": {"attribute_name": "Keywords", "attribute_value_mlt": [{"interim": "noise detection"}, {"interim": "noise elimination"}, {"interim": "web mining"}]}, "item_1583103120197": {"attribute_name": "Files", "attribute_type": "file", "attribute_value_mlt": [{"accessrole": "open_access", "date": [{"dateType": "Available", "dateValue": "2019-07-31"}], "displaytype": "preview", "download_preview_message": "", "file_order": 0, "filename": "54136.pdf", "filesize": [{"value": "440 Kb"}], "format": "application/pdf", "future_date_message": "", "is_thumbnail": false, "licensetype": "license_free", "mimetype": "application/pdf", "size": 440000.0, "url": {"url": "https://meral.edu.mm/record/3818/files/54136.pdf"}, "version_id": "c76222f9-d083-49cf-a503-b7aca5edea6c"}]}, "item_1583103131163": {"attribute_name": "Journal articles", "attribute_value_mlt": [{"subitem_issue": "", "subitem_journal_title": "Fourth Local Conference on Parallel and Soft Computing", "subitem_pages": "", "subitem_volume": ""}]}, "item_1583103147082": {"attribute_name": "Conference papers", "attribute_value_mlt": [{"subitem_acronym": "", "subitem_c_date": "", "subitem_conference_title": "", "subitem_part": "", "subitem_place": "", "subitem_session": "", "subitem_website": ""}]}, "item_1583103211336": {"attribute_name": "Books/reports/chapters", "attribute_value_mlt": [{"subitem_book_title": "", "subitem_isbn": "", "subitem_pages": "", "subitem_place": "", "subitem_publisher": ""}]}, "item_1583103233624": {"attribute_name": "Thesis/dissertations", "attribute_value_mlt": [{"subitem_awarding_university": "", "subitem_supervisor(s)": [{"subitem_supervisor": ""}]}]}, "item_1583105942107": {"attribute_name": "Authors", "attribute_value_mlt": [{"subitem_authors": [{"subitem_authors_fullname": "Sone, Aye Pyae"}, {"subitem_authors_fullname": "Nwe, Nwe"}]}]}, "item_1583108359239": {"attribute_name": "Upload type", "attribute_value_mlt": [{"interim": "Publication"}]}, "item_1583108428133": {"attribute_name": "Publication type", "attribute_value_mlt": [{"interim": "Article"}]}, "item_1583159729339": {"attribute_name": "Publication date", "attribute_value": "2009-12-30"}, "item_1583159847033": {"attribute_name": "Identifier", "attribute_value": "http://onlineresource.ucsy.edu.mm/handle/123456789/1546"}, "item_title": "Elimination noisy information on web page", "item_type_id": "21", "owner": "1", "path": ["1597824273898"], "permalink_uri": "http://hdl.handle.net/20.500.12678/0000003818", "pubdate": {"attribute_name": "Deposited date", "attribute_value": "2019-07-31"}, "publish_date": "2019-07-31", "publish_status": "0", "recid": "3818", "relation": {}, "relation_version_is_last": true, "title": ["Elimination noisy information on web page"], "weko_shared_id": -1}
Elimination noisy information on web page
http://hdl.handle.net/20.500.12678/0000003818
http://hdl.handle.net/20.500.12678/000000381894878837-e6c1-489f-8466-9fd0d7297e4a
2d73fdad-eadf-47f5-bd0e-89dcf4b48eb2
Name / File | License | Actions |
---|---|---|
54136.pdf (440 Kb)
|
|
Publication type | ||||||
---|---|---|---|---|---|---|
Article | ||||||
Upload type | ||||||
Publication | ||||||
Title | ||||||
Title | Elimination noisy information on web page | |||||
Language | en | |||||
Publication date | 2009-12-30 | |||||
Authors | ||||||
Sone, Aye Pyae | ||||||
Nwe, Nwe | ||||||
Description | ||||||
Web page typically contains manyinformation blocks. They are navigation panels,copyright and privacy notices and advertisements.These blocks are useful for business purposes.These blocks are called as the noisy blocks whichcan harm web data mining. And so, eliminatingthese noises is of great importance. The noisyblocks usually share some common contents andpresentation styles. The main contents of web pageare different in the common presentation styles.Based on this observation, a site style tree (SST) ispresented in this system to capture the commonpresentation styles and actual contents. Aninformation based algorithm is used to determinewhich parts of the SST represent noises and whichparts represent the main contents of the site.Experimental results show that eliminating noisyinformation on web pages will be effective for webdata mining. The system shows how much noisyinformation blocks can be removed from webpages depending upon file size. The users canchoose desired web page and this system willeliminate unnecessary noise by using noisedetection and web page cleaning algorithm. | ||||||
Keywords | ||||||
noise detection, noise elimination, web mining | ||||||
Identifier | http://onlineresource.ucsy.edu.mm/handle/123456789/1546 | |||||
Journal articles | ||||||
Fourth Local Conference on Parallel and Soft Computing | ||||||
Conference papers | ||||||
Books/reports/chapters | ||||||
Thesis/dissertations |