Texture for Script Identification

Loading...
Thumbnail Image
File version
Author(s)
Busch, A
Boles, WW
Sridharan, S
Griffith University Author(s)
Primary Supervisor
Other Supervisors
Editor(s)

David J. Kriegman

Date
2005
Size

1029400 bytes

70153 bytes

File type(s)

application/pdf

text/plain

Location
License
Abstract

The problem of determining the script and language of a document image has a number of important applications in the field of document analysis, such as indexing and sorting of large collections of such images, or as a precursor to optical character recognition (OCR). In this paper, we investigate the use of texture as a tool for determining the script of a document image, based on the observation that text has a distinct visual texture. An experimental evaluation of a number of commonly used texture features is conducted on a newly created script database, providing a qualitative measure of which features are most appropriate for this task. Strategies for improving classification results in situations with limited training data and multiple font types are also proposed.

Journal Title

IEEE Transactions on Pattern Analysis and Machine Intelligence

Conference Title
Book Title
Edition
Volume

27

Issue

11

Thesis Type
Degree Program
School
Patent number
Funder(s)
Grant identifier(s)
Rights Statement
Rights Statement

© 2005 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.

Item Access Status
Note
Access the data
Related item(s)
Subject

Information systems

Persistent link to this record
Citation
Collections