MERAL Myanmar Education Research and Learning Portal
Item
{"_buckets": {"deposit": "5212f4a3-e5ff-48ce-afb4-99b018f8ddbd"}, "_deposit": {"id": "4698", "owners": [], "pid": {"revision_id": 0, "type": "recid", "value": "4698"}, "status": "published"}, "_oai": {"id": "oai:meral.edu.mm:recid/4698", "sets": ["1597824273898", "user-ucsy"]}, "communities": ["ucsy"], "item_1583103067471": {"attribute_name": "Title", "attribute_value_mlt": [{"subitem_1551255647225": "Ordering URL for Focused Web Crawlers using Effective Prioritization", "subitem_1551255648112": "en"}]}, "item_1583103085720": {"attribute_name": "Description", "attribute_value_mlt": [{"interim": "Obtaining important pages rapidly is veryuseful when a crawler cannot visit the entire Webin a reasonable amount of time. One approach isusing focused crawler because it tries todownload only pages with pre-defined topic toavoid the irrelevant web documents and reducenetwork traffic. It can also minimize the overallnumber of downloaded Web pages for processingand maximize the percentage of relevant pages.In this paper, we present in what order a focusedcrawler should visit the URLs it has seen, inorder to obtain more “important” pages first.During crawling,Naive Bayes Classifier withfour feature representations is used toenhancecorrectness of a specific topic. Toprovide sorting URLs, we use the Priorityequation that gives every page a score."}]}, "item_1583103108160": {"attribute_name": "Keywords", "attribute_value_mlt": [{"interim": "focused crawler"}, {"interim": "learningfocused crawler"}, {"interim": "Naive Bayes classifier"}, {"interim": "similarity space model"}]}, "item_1583103120197": {"attribute_name": "Files", "attribute_type": "file", "attribute_value": []}, "item_1583103131163": {"attribute_name": "Journal articles", "attribute_value_mlt": [{"subitem_issue": "", "subitem_journal_title": "Tenth International Conference On Computer Applications (ICCA 2012)", "subitem_pages": "", "subitem_volume": ""}]}, "item_1583103147082": {"attribute_name": "Conference papers", "attribute_value_mlt": [{"subitem_acronym": "", "subitem_c_date": "", "subitem_conference_title": "", "subitem_part": "", "subitem_place": "", "subitem_session": "", "subitem_website": ""}]}, "item_1583103211336": {"attribute_name": "Books/reports/chapters", "attribute_value_mlt": [{"subitem_book_title": "", "subitem_isbn": "", "subitem_pages": "", "subitem_place": "", "subitem_publisher": ""}]}, "item_1583103233624": {"attribute_name": "Thesis/dissertations", "attribute_value_mlt": [{"subitem_awarding_university": "", "subitem_supervisor(s)": [{"subitem_supervisor": ""}]}]}, "item_1583105942107": {"attribute_name": "Authors", "attribute_value_mlt": [{"subitem_authors": [{"subitem_authors_fullname": "Min, Nandar Win"}, {"subitem_authors_fullname": "Hlaing, Aye Nandar"}]}]}, "item_1583108359239": {"attribute_name": "Upload type", "attribute_value_mlt": [{"interim": "Publication"}]}, "item_1583108428133": {"attribute_name": "Publication type", "attribute_value_mlt": [{"interim": "Article"}]}, "item_1583159729339": {"attribute_name": "Publication date", "attribute_value": "2012-02-28"}, "item_1583159847033": {"attribute_name": "Identifier", "attribute_value": "http://onlineresource.ucsy.edu.mm/handle/123456789/395"}, "item_title": "Ordering URL for Focused Web Crawlers using Effective Prioritization", "item_type_id": "21", "owner": "1", "path": ["1597824273898"], "permalink_uri": "http://hdl.handle.net/20.500.12678/0000004698", "pubdate": {"attribute_name": "Deposited date", "attribute_value": "2019-07-04"}, "publish_date": "2019-07-04", "publish_status": "0", "recid": "4698", "relation": {}, "relation_version_is_last": true, "title": ["Ordering URL for Focused Web Crawlers using Effective Prioritization"], "weko_shared_id": -1}
Ordering URL for Focused Web Crawlers using Effective Prioritization
http://hdl.handle.net/20.500.12678/0000004698
http://hdl.handle.net/20.500.12678/00000046987a9e0199-f497-49a2-b01c-7746134eff9a
5212f4a3-e5ff-48ce-afb4-99b018f8ddbd
Publication type | ||||||
---|---|---|---|---|---|---|
Article | ||||||
Upload type | ||||||
Publication | ||||||
Title | ||||||
Title | Ordering URL for Focused Web Crawlers using Effective Prioritization | |||||
Language | en | |||||
Publication date | 2012-02-28 | |||||
Authors | ||||||
Min, Nandar Win | ||||||
Hlaing, Aye Nandar | ||||||
Description | ||||||
Obtaining important pages rapidly is veryuseful when a crawler cannot visit the entire Webin a reasonable amount of time. One approach isusing focused crawler because it tries todownload only pages with pre-defined topic toavoid the irrelevant web documents and reducenetwork traffic. It can also minimize the overallnumber of downloaded Web pages for processingand maximize the percentage of relevant pages.In this paper, we present in what order a focusedcrawler should visit the URLs it has seen, inorder to obtain more “important” pages first.During crawling,Naive Bayes Classifier withfour feature representations is used toenhancecorrectness of a specific topic. Toprovide sorting URLs, we use the Priorityequation that gives every page a score. | ||||||
Keywords | ||||||
focused crawler, learningfocused crawler, Naive Bayes classifier, similarity space model | ||||||
Identifier | http://onlineresource.ucsy.edu.mm/handle/123456789/395 | |||||
Journal articles | ||||||
Tenth International Conference On Computer Applications (ICCA 2012) | ||||||
Conference papers | ||||||
Books/reports/chapters | ||||||
Thesis/dissertations |