Automatic categorization of bioscience literature containing QTL information
File version
Author(s)
Quan, X
Yu, X
Gao, Q
Peng, J
Griffith University Author(s)
Primary Supervisor
Other Supervisors
Editor(s)
Date
Size
File type(s)
Location
License
Abstract
In this paper we introduce text categorization methods to address the classification problem of literature containing Quantitative Trait Locus, QTL information. Our work focused on building an automatic categorization system targeting the QTL information of various species based on Support Vector Machines, SVM. A text representation strategy is proposed combining words and phrases that effectively improve the classification accuracy. Through studying literature containing QTL information and other species-related publications, we determined representative phrases and detected abbreviations in order to form another set of features. Together with the words selected by Chi value, the two sets of features were both used to represent text samples. We employed a portion of particular species’ QTL-related literature data to conduct an experiment regarding the system’s construction, and then tested our system using the data of multiple plants and species. The experiment results indicate that our work may help further research on constructing QTL information databases.
Journal Title
International Journal of Simulation: Systems, Science and Technology
Conference Title
Book Title
Edition
Volume
17
Issue
15
Thesis Type
Degree Program
School
Publisher link
Patent number
Funder(s)
Grant identifier(s)
Rights Statement
Rights Statement
Item Access Status
Note
Access the data
Related item(s)
Subject
Persistent link to this record
Citation
Zhong, Y; Quan, X; Yu, X; Gao, Q; Peng, J, Automatic categorization of bioscience literature containing QTL information, International Journal of Simulation: Systems, Science and Technology, 2016, 17 (15), pp. 5.1-5.10