MERAL Myanmar Education Research and Learning Portal
Item
{"_buckets": {"deposit": "a6779e61-489a-4285-aafc-86cd3c680aa7"}, "_deposit": {"id": "3420", "owners": [], "pid": {"revision_id": 0, "type": "recid", "value": "3420"}, "status": "published"}, "_oai": {"id": "oai:meral.edu.mm:recid/3420", "sets": ["user-ucsy"]}, "communities": ["ucsy"], "item_1583103067471": {"attribute_name": "Title", "attribute_value_mlt": [{"subitem_1551255647225": "Page Segmentation and Document Layout Analysis for Scanned Image by using Smearing Algorithm", "subitem_1551255648112": "en"}]}, "item_1583103085720": {"attribute_name": "Description", "attribute_value_mlt": [{"interim": "This paper presents a feature-based system which utilizes domain knowledge to segment and classify scanned image documents. Documents usually consists of a mixture of text and image. Text block possesses an interesting property that the x-profile or y-profile of text block is a periodic pattern. Image block possesses generate the connectivity histogram by summing the number of dark pixels with the same connectivity value. Initially, one-scan run-length smearing algorithm (RLSA) with block merging is proposed to segment the document. After segmentation process, the next task is to classify the segmented block. The classification task is then performed based on the rules induced from the features or primitives associated with each document. In this system, proper use of domain knowledge is proved to be effective in accelerating the segmentation speed and decreasing the classification error."}]}, "item_1583103108160": {"attribute_name": "Keywords", "attribute_value_mlt": [{"interim": "one-scan run-length smearing"}, {"interim": "block merging"}, {"interim": "connectivity histogram"}, {"interim": "text block"}, {"interim": "image block"}]}, "item_1583103120197": {"attribute_name": "Files", "attribute_type": "file", "attribute_value_mlt": [{"accessrole": "open_access", "date": [{"dateType": "Available", "dateValue": "2019-07-22"}], "displaytype": "preview", "download_preview_message": "", "file_order": 0, "filename": "psc2010paper (226).pdf", "filesize": [{"value": "599 Kb"}], "format": "application/pdf", "future_date_message": "", "is_thumbnail": false, "licensetype": "license_free", "mimetype": "application/pdf", "size": 599000.0, "url": {"url": "https://meral.edu.mm/record/3420/files/psc2010paper (226).pdf"}, "version_id": "9e9e5132-1862-4c17-b44e-929f4c804a89"}]}, "item_1583103131163": {"attribute_name": "Journal articles", "attribute_value_mlt": [{"subitem_issue": "", "subitem_journal_title": "Fifth Local Conference on Parallel and Soft Computing", "subitem_pages": "", "subitem_volume": ""}]}, "item_1583103147082": {"attribute_name": "Conference papers", "attribute_value_mlt": [{"subitem_acronym": "", "subitem_c_date": "", "subitem_conference_title": "", "subitem_part": "", "subitem_place": "", "subitem_session": "", "subitem_website": ""}]}, "item_1583103211336": {"attribute_name": "Books/reports/chapters", "attribute_value_mlt": [{"subitem_book_title": "", "subitem_isbn": "", "subitem_pages": "", "subitem_place": "", "subitem_publisher": ""}]}, "item_1583103233624": {"attribute_name": "Thesis/dissertations", "attribute_value_mlt": [{"subitem_awarding_university": "", "subitem_supervisor(s)": [{"subitem_supervisor": ""}]}]}, "item_1583105942107": {"attribute_name": "Authors", "attribute_value_mlt": [{"subitem_authors": [{"subitem_authors_fullname": "Htun, Nay Win"}, {"subitem_authors_fullname": "Ko, Lin Min"}]}]}, "item_1583108359239": {"attribute_name": "Upload type", "attribute_value_mlt": [{"interim": "Publication"}]}, "item_1583108428133": {"attribute_name": "Publication type", "attribute_value_mlt": [{"interim": "Article"}]}, "item_1583159729339": {"attribute_name": "Publication date", "attribute_value": "2010-12-16"}, "item_1583159847033": {"attribute_name": "Identifier", "attribute_value": "http://onlineresource.ucsy.edu.mm/handle/123456789/1173"}, "item_title": "Page Segmentation and Document Layout Analysis for Scanned Image by using Smearing Algorithm", "item_type_id": "21", "owner": "1", "path": ["1597824273898"], "permalink_uri": "http://hdl.handle.net/20.500.12678/0000003420", "pubdate": {"attribute_name": "Deposited date", "attribute_value": "2019-07-22"}, "publish_date": "2019-07-22", "publish_status": "0", "recid": "3420", "relation": {}, "relation_version_is_last": true, "title": ["Page Segmentation and Document Layout Analysis for Scanned Image by using Smearing Algorithm"], "weko_shared_id": -1}
Page Segmentation and Document Layout Analysis for Scanned Image by using Smearing Algorithm
http://hdl.handle.net/20.500.12678/0000003420
http://hdl.handle.net/20.500.12678/0000003420ad9a3de2-2a0f-41fc-9948-0f1177dd7c8d
a6779e61-489a-4285-aafc-86cd3c680aa7
Name / File | License | Actions |
---|---|---|
![]() |
|
Publication type | ||||||
---|---|---|---|---|---|---|
Article | ||||||
Upload type | ||||||
Publication | ||||||
Title | ||||||
Title | Page Segmentation and Document Layout Analysis for Scanned Image by using Smearing Algorithm | |||||
Language | en | |||||
Publication date | 2010-12-16 | |||||
Authors | ||||||
Htun, Nay Win | ||||||
Ko, Lin Min | ||||||
Description | ||||||
This paper presents a feature-based system which utilizes domain knowledge to segment and classify scanned image documents. Documents usually consists of a mixture of text and image. Text block possesses an interesting property that the x-profile or y-profile of text block is a periodic pattern. Image block possesses generate the connectivity histogram by summing the number of dark pixels with the same connectivity value. Initially, one-scan run-length smearing algorithm (RLSA) with block merging is proposed to segment the document. After segmentation process, the next task is to classify the segmented block. The classification task is then performed based on the rules induced from the features or primitives associated with each document. In this system, proper use of domain knowledge is proved to be effective in accelerating the segmentation speed and decreasing the classification error. | ||||||
Keywords | ||||||
one-scan run-length smearing, block merging, connectivity histogram, text block, image block | ||||||
Identifier | http://onlineresource.ucsy.edu.mm/handle/123456789/1173 | |||||
Journal articles | ||||||
Fifth Local Conference on Parallel and Soft Computing | ||||||
Conference papers | ||||||
Books/reports/chapters | ||||||
Thesis/dissertations |