2024-03-28T08:38:18Z
https://meral.edu.mm/oai
oai:meral.edu.mm:recid/4891
2022-03-24T23:13:08Z
1582963302567:1597824273898
user-ucsy
Implementation of focused crawler by using machine learning approach
Ye, Yamin Shwe
Soe, Khin Mar
The rapid growth of web generated on theinternet by millions of users poses many challengesfor general purpose search engines (for example,scaling).Typically a general purpose search engineconsists of three main parts, Crawler, Indexer andQuery processing system. The crawlers of a generalpurpose search engine crawl every page. Soproblems arise when we need to retrieve onlycorresponding portion of the web, especially for atopic or a group of topic. Such requirement can befulfilled by a domain specific crawler or focusedcrawler. Focused crawler crawls only those pagesthat are interested by the system. A focused crawlertraverses the web selecting out relevant pages to apredefined topic and neglecting those out of concern.The focused crawler determines which portion of theweb is relevant and which is not. That can be doneby several machine learning approach used in textcategorization. This thesis proposes a focusedcrawler by using neural network. It can be used tobuild general purpose domain specific search engine.
2010-12-16
http://hdl.handle.net/20.500.12678/0000004891
https://meral.edu.mm/records/4891