Log in
Language:

MERAL Myanmar Education Research and Learning Portal

  • Top
  • Universities
  • Ranking
To
lat lon distance
To

Field does not validate



Index Link

Index Tree

Please input email address.

WEKO

One fine body…

WEKO

One fine body…

Item

{"_buckets": {"deposit": "12c36bba-b52d-417c-bca2-8a32b3568a1b"}, "_deposit": {"id": "4418", "owners": [], "pid": {"revision_id": 0, "type": "recid", "value": "4418"}, "status": "published"}, "_oai": {"id": "oai:meral.edu.mm:recid/4418", "sets": ["user-ucsy"]}, "communities": ["ucsy"], "item_1583103067471": {"attribute_name": "Title", "attribute_value_mlt": [{"subitem_1551255647225": "Noise Elimination from Web Page in Web Content Mining", "subitem_1551255648112": "en_US"}]}, "item_1583103085720": {"attribute_name": "Description", "attribute_value_mlt": [{"interim": "Nowadays, a large number of web pagescontained useful information is oftenaccompanied by a large amount of noise such asbanner advertisements, navigation bars,copyright notices, etc. These noise data canseriously harm for web miners by extractingwhole document rather than the informativecontent and also retrieve non-relevant results. Itis also important to distinguish valuableinformation from noisy data within a single webpage. The web pages are constructed not onlymain contents information like productinformation in shopping domain, job informationin a job domain but also advertisements bar,static content like navigation panels, copyrightsections, etc. When web documents areprocessed, the main content is surrounded bynoise in the retrieved data. To tackle theseissues, a noise elimination process is describedby using html tags and main content is retrievedby using gomory-hu tree."}]}, "item_1583103108160": {"attribute_name": "Keywords", "attribute_value_mlt": [{"interim": "noise elimination"}, {"interim": "block splitting"}]}, "item_1583103120197": {"attribute_name": "Files", "attribute_type": "file", "attribute_value_mlt": [{"accessrole": "open_access", "date": [{"dateType": "Available", "dateValue": "2019-10-25"}], "displaytype": "preview", "download_preview_message": "", "file_order": 0, "filename": "Noise Elimination from Web Page in Web Content Mining.pdf", "filesize": [{"value": "765 Kb"}], "format": "application/pdf", "future_date_message": "", "is_thumbnail": false, "licensetype": "license_free", "mimetype": "application/pdf", "size": 765000.0, "url": {"url": "https://meral.edu.mm/record/4418/files/Noise Elimination from Web Page in Web Content Mining.pdf"}, "version_id": "70cecd2c-a18b-4cb8-9d2b-cde3873c15b0"}]}, "item_1583103131163": {"attribute_name": "Journal articles", "attribute_value_mlt": [{"subitem_issue": "", "subitem_journal_title": "Thirteenth International Conference On Computer Applications (ICCA 2015)", "subitem_pages": "", "subitem_volume": ""}]}, "item_1583103147082": {"attribute_name": "Conference papers", "attribute_value_mlt": [{"subitem_acronym": "", "subitem_c_date": "", "subitem_conference_title": "", "subitem_part": "", "subitem_place": "", "subitem_session": "", "subitem_website": ""}]}, "item_1583103211336": {"attribute_name": "Books/reports/chapters", "attribute_value_mlt": [{"subitem_book_title": "", "subitem_isbn": "", "subitem_pages": "", "subitem_place": "", "subitem_publisher": ""}]}, "item_1583103233624": {"attribute_name": "Thesis/dissertations", "attribute_value_mlt": [{"subitem_awarding_university": "", "subitem_supervisor(s)": [{"subitem_supervisor": ""}]}]}, "item_1583105942107": {"attribute_name": "Authors", "attribute_value_mlt": [{"subitem_authors": [{"subitem_authors_fullname": "Linn, Khaing Wah Wah"}, {"subitem_authors_fullname": "Phyu, Sabai"}]}]}, "item_1583108359239": {"attribute_name": "Upload type", "attribute_value_mlt": [{"interim": "Publication"}]}, "item_1583108428133": {"attribute_name": "Publication type", "attribute_value_mlt": [{"interim": "Article"}]}, "item_1583159729339": {"attribute_name": "Publication date", "attribute_value": "2015-02-05"}, "item_1583159847033": {"attribute_name": "Identifier", "attribute_value": "http://onlineresource.ucsy.edu.mm/handle/123456789/2352"}, "item_title": "Noise Elimination from Web Page in Web Content Mining", "item_type_id": "21", "owner": "1", "path": ["1597824273898"], "permalink_uri": "http://hdl.handle.net/20.500.12678/0000004418", "pubdate": {"attribute_name": "Deposited date", "attribute_value": "2019-10-25"}, "publish_date": "2019-10-25", "publish_status": "0", "recid": "4418", "relation": {}, "relation_version_is_last": true, "title": ["Noise Elimination from Web Page in Web Content Mining"], "weko_shared_id": -1}
  1. University of Computer Studies, Yangon
  2. Conferences

Noise Elimination from Web Page in Web Content Mining

http://hdl.handle.net/20.500.12678/0000004418
http://hdl.handle.net/20.500.12678/0000004418
df77532b-c939-4eb6-a3a9-1315fda35336
12c36bba-b52d-417c-bca2-8a32b3568a1b
None
Preview
Name / File License Actions
Noise Noise Elimination from Web Page in Web Content Mining.pdf (765 Kb)
Publication type
Article
Upload type
Publication
Title
Title Noise Elimination from Web Page in Web Content Mining
Language en_US
Publication date 2015-02-05
Authors
Linn, Khaing Wah Wah
Phyu, Sabai
Description
Nowadays, a large number of web pagescontained useful information is oftenaccompanied by a large amount of noise such asbanner advertisements, navigation bars,copyright notices, etc. These noise data canseriously harm for web miners by extractingwhole document rather than the informativecontent and also retrieve non-relevant results. Itis also important to distinguish valuableinformation from noisy data within a single webpage. The web pages are constructed not onlymain contents information like productinformation in shopping domain, job informationin a job domain but also advertisements bar,static content like navigation panels, copyrightsections, etc. When web documents areprocessed, the main content is surrounded bynoise in the retrieved data. To tackle theseissues, a noise elimination process is describedby using html tags and main content is retrievedby using gomory-hu tree.
Keywords
noise elimination, block splitting
Identifier http://onlineresource.ucsy.edu.mm/handle/123456789/2352
Journal articles
Thirteenth International Conference On Computer Applications (ICCA 2015)
Conference papers
Books/reports/chapters
Thesis/dissertations
Back
0
0
views
downloads
See details
Views Downloads

Versions

Ver.1 2020-09-01 14:45:06.946556
Show All versions

Share

Mendeley Twitter Facebook Print Addthis

Export

OAI-PMH
  • OAI-PMH DublinCore
Other Formats
  • JSON

Confirm


Back to MERAL


Back to MERAL