MERAL Myanmar Education Research and Learning Portal
-
RootNode
Item
{"_buckets": {"deposit": "45c8d427-943e-42a6-8d16-53837dfb39a6"}, "_deposit": {"created_by": 45, "id": "5343", "owner": "45", "owners": [45], "owners_ext": {"displayname": "", "username": ""}, "pid": {"revision_id": 0, "type": "recid", "value": "5343"}, "status": "published"}, "_oai": {"id": "oai:meral.edu.mm:recid/5343", "sets": ["user-uit"]}, "communities": ["uit"], "item_1583103067471": {"attribute_name": "Title", "attribute_value_mlt": [{"subitem_1551255647225": "Extraction of Reliable Information from the Web", "subitem_1551255648112": "en"}]}, "item_1583103085720": {"attribute_name": "Description", "attribute_value_mlt": [{"interim": "Information extraction is one of the methods\nto retrieve information from complex web pages. With\nthe use of multiple algorithms, intelligence,\nknowledge base, knowledge acquisition and filtering,\npeople nowadays can benefited with the use of\ninformation extraction. Such application has been\napplied in several dimensions, such as new\ntranscripts, insurance in formation, and weather\nreports. This proposed system extracts required\nlaptop data from relevant web pages and convert\nthem into a standard database. This paper uses\nSTALKER algorithm to generate the rules for\nextracting the laptop information. The extract ed data\nare matched and recognized with built in keyword\nand entity tables using Named Entity Recognition\n(NER). And then, the system produces the required\nextracted information. By using this system, the user\ncan get the meaningful laptop information and it also\nprovides the user with easy access and time saving."}]}, "item_1583103108160": {"attribute_name": "Keywords", "attribute_value_mlt": [{"interim": "Information Extraction"}, {"interim": "Named Entity Recognition"}, {"interim": "Web mining"}]}, "item_1583103120197": {"attribute_name": "Files", "attribute_type": "file", "attribute_value_mlt": [{"accessrole": "open_access", "date": [{"dateType": "Available", "dateValue": "2020-09-14"}], "displaytype": "preview", "download_preview_message": "", "file_order": 0, "filename": "Extraction of Reliable Information from the Web.pdf", "filesize": [{"value": "691 Kb"}], "format": "application/pdf", "future_date_message": "", "is_thumbnail": false, "licensetype": "license_0", "mimetype": "application/pdf", "size": 691000.0, "url": {"url": "https://meral.edu.mm/record/5343/files/Extraction of Reliable Information from the Web.pdf"}, "version_id": "7ef087bd-5536-4fba-a7f6-26acf7a6ad53"}]}, "item_1583103147082": {"attribute_name": "Conference papers", "attribute_value_mlt": [{"subitem_acronym": "PSC", "subitem_c_date": "16 December, 2010", "subitem_conference_title": "FIFTH LOCAL CONFERENCE ON PARALLEL AND SOFT COMPUTING", "subitem_place": "University of Computer Studies, Yangon, Myanmar", "subitem_website": "https://www.ucsy.edu.mm/FifthPSC.do"}]}, "item_1583105942107": {"attribute_name": "Authors", "attribute_value_mlt": [{"subitem_authors": [{"subitem_authors_fullname": "Cherry Soe"}, {"subitem_authors_fullname": "Thandar Lwin"}]}]}, "item_1583108359239": {"attribute_name": "Upload type", "attribute_value_mlt": [{"interim": "Publication"}]}, "item_1583108428133": {"attribute_name": "Publication type", "attribute_value_mlt": [{"interim": "Conference paper"}]}, "item_1583159729339": {"attribute_name": "Publication date", "attribute_value": "2010-12-16"}, "item_title": "Extraction of Reliable Information from the Web", "item_type_id": "21", "owner": "45", "path": ["1596102391527"], "permalink_uri": "http://hdl.handle.net/20.500.12678/0000005343", "pubdate": {"attribute_name": "Deposited date", "attribute_value": "2020-09-14"}, "publish_date": "2020-09-14", "publish_status": "0", "recid": "5343", "relation": {}, "relation_version_is_last": true, "title": ["Extraction of Reliable Information from the Web"], "weko_shared_id": -1}
Extraction of Reliable Information from the Web
http://hdl.handle.net/20.500.12678/0000005343
http://hdl.handle.net/20.500.12678/00000053434507e35d-950e-480d-a4f7-f4edb8cb09e2
45c8d427-943e-42a6-8d16-53837dfb39a6
Name / File | License | Actions |
---|---|---|
![]() |
Publication type | ||||||
---|---|---|---|---|---|---|
Conference paper | ||||||
Upload type | ||||||
Publication | ||||||
Title | ||||||
Title | Extraction of Reliable Information from the Web | |||||
Language | en | |||||
Publication date | 2010-12-16 | |||||
Authors | ||||||
Cherry Soe | ||||||
Thandar Lwin | ||||||
Description | ||||||
Information extraction is one of the methods to retrieve information from complex web pages. With the use of multiple algorithms, intelligence, knowledge base, knowledge acquisition and filtering, people nowadays can benefited with the use of information extraction. Such application has been applied in several dimensions, such as new transcripts, insurance in formation, and weather reports. This proposed system extracts required laptop data from relevant web pages and convert them into a standard database. This paper uses STALKER algorithm to generate the rules for extracting the laptop information. The extract ed data are matched and recognized with built in keyword and entity tables using Named Entity Recognition (NER). And then, the system produces the required extracted information. By using this system, the user can get the meaningful laptop information and it also provides the user with easy access and time saving. |
||||||
Keywords | ||||||
Information Extraction, Named Entity Recognition, Web mining | ||||||
Conference papers | ||||||
PSC | ||||||
16 December, 2010 | ||||||
FIFTH LOCAL CONFERENCE ON PARALLEL AND SOFT COMPUTING | ||||||
University of Computer Studies, Yangon, Myanmar | ||||||
https://www.ucsy.edu.mm/FifthPSC.do |