<?xml version='1.0' encoding='UTF-8'?>
<OAI-PMH xmlns="http://www.openarchives.org/OAI/2.0/" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://www.openarchives.org/OAI/2.0/ http://www.openarchives.org/OAI/2.0/OAI-PMH.xsd">
  <responseDate>2026-06-24T06:46:34Z</responseDate>
  <request verb="GetRecord" identifier="oai:meral.edu.mm:recid/4917" metadataPrefix="oai_dc">https://meral.edu.mm/oai</request>
  <GetRecord>
    <record>
      <header>
        <identifier>oai:meral.edu.mm:recid/4917</identifier>
        <datestamp>2021-12-13T03:37:29Z</datestamp>
        <setSpec>1582963302567:1597824273898</setSpec>
        <setSpec>user-ucsy</setSpec>
      </header>
      <metadata>
        <oai_dc:dc xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:oai_dc="http://www.openarchives.org/OAI/2.0/oai_dc/" xmlns="http://www.w3.org/2001/XMLSchema" xsi:schemaLocation="http://www.openarchives.org/OAI/2.0/oai_dc/ http://www.openarchives.org/OAI/2.0/oai_dc.xsd">
          <dc:title>Web Page Categorization Based on Content and Data Extraction for Academic Community</dc:title>
          <dc:creator>Phyu, Sabai</dc:creator>
          <dc:creator>Linn, Khaing Wah Wah</dc:creator>
          <dc:description>The web is a large amount of data and difficult tosearch information or data of user interest (ITacademic field). Therefore, it needs to categorize formeet user’s interesting field easily. Web pagecategorization help improve the quality of web search.In this paper, we proposed a framework for web dataextraction by using categorized web pages to improvedata extraction accuracy and result. Firstly, thenumbers of test web pages are defined as inputs. Weuse page segmentation algorithm (VIPS) to performsegmentation these pages to achieve content structurefor web page cleaning and to evaluate informative ormain content block. These main contents arecategorized by using Support Vector Machine (SVM)which gives accurate and efficient result. Thesecategorized web pages are stored into the database(IT library) to output data accurately when user query.</dc:description>
          <dc:date>2014-02-17</dc:date>
          <dc:identifier>http://hdl.handle.net/20.500.12678/0000004917</dc:identifier>
          <dc:identifier>https://meral.edu.mm/records/4917</dc:identifier>
        </oai_dc:dc>
      </metadata>
    </record>
  </GetRecord>
</OAI-PMH>
