Show simple item record

dc.contributor.authorChakraborty, Arpita
dc.contributor.authorBlumenstein, Michael
dc.contributor.editorLisa OConner
dc.date.accessioned2018-02-21T05:47:04Z
dc.date.available2018-02-21T05:47:04Z
dc.date.issued2016
dc.identifier.doi10.1109/DAS.2016.77
dc.identifier.urihttp://hdl.handle.net/10072/123844
dc.description.abstractWe propose a holistic, dynamic method to preserve text content with zero tolerance while removing marginal noise for historical handwritten document images. The key idea is to identify and analyze the region between the sharp peak at the edge and page frame of the text content at each margin. Depending on the proximity of the sharp peak to the text, the text content is then extracted from the document image. This method automatically adapts thresholds for each single document image and is directly applicable to gray-scale images. The proposed method is evaluated on four diverse handwritten historical datasets: Queensland State Archive (QSA), Saint Gall, Parzival and the Prosecution Project. Experimental results show that the proposed method achieves higher accuracy compared with other methods tested on the Saint Gall and Parzival datasets, whilst for the other two Australian datasets, which have been introduced here for the first time, the results are very encouraging.
dc.description.peerreviewedYes
dc.languageEnglish
dc.publisherInstitute of Electrical and Electronics Engineers (IEEE)
dc.publisher.placeUnited States
dc.relation.ispartofconferencename12th IAPR International Workshop on Document Analysis Systems (DAS)
dc.relation.ispartofconferencetitlePROCEEDINGS OF 12TH IAPR WORKSHOP ON DOCUMENT ANALYSIS SYSTEMS, (DAS 2016)
dc.relation.ispartofdatefrom2016-04-11
dc.relation.ispartofdateto2016-04-14
dc.relation.ispartoflocationGREECE
dc.relation.ispartofpagefrom329
dc.relation.ispartofpagefrom6 pages
dc.relation.ispartofpageto334
dc.relation.ispartofpageto6 pages
dc.subject.fieldofresearchArtificial intelligence not elsewhere classified
dc.subject.fieldofresearchcode460299
dc.titlePreserving Text Content from Historical Handwritten Documents
dc.typeConference output
dc.type.descriptionE1 - Conferences
dc.type.codeE - Conference Publications
gro.facultyGriffith Sciences, School of Information and Communication Technology
gro.hasfulltextNo Full Text
gro.griffith.authorBlumenstein, Michael M.
gro.griffith.authorChakraborty, Arpita


Files in this item

FilesSizeFormatView

There are no files associated with this item.

This item appears in the following Collection(s)

  • Conference outputs
    Contains papers delivered by Griffith authors at national and international conferences.

Show simple item record