CNN-Meth: A Tool to Accurately Predict Lysine Methylation Sites Using Evolutionary Information-Based Protein Modeling
Files
File version
Accepted Manuscript (AM)
Author(s)
Sharma, A
Dehzangi, I
Griffith University Author(s)
Primary Supervisor
Other Supervisors
Editor(s)
KC, Dukka B
Date
Size
File type(s)
Location
License
Abstract
Lysine methylation is a crucial posttranslational modification influencing both histone and nonhistone protein functions. Disruptions in lysine methyltransferase activity are linked to numerous diseases, including various cancers and developmental disorders. Accurate identification of lysine methylation sites is essential for early diagnosis and therapeutic development. Here, we present CNN-Meth, a newly developed Web-based utility that employs a convolutional neural network (CNN) to predict lysine methylation sites. CNN-Meth leverages evolutionary, structural, and physicochemical data alongside binary encoding for its training process. Evolutionary and structural features used to build CNN-Meth are extracted using protein modeling, which works similarly to using Protein Language Models (PLM). Unlike traditional approaches that rely on manually extracted features, CNN-Meth uses CNNs for automated feature extraction, ensuring minimal information loss. This novel methodology enhances prediction accuracy, achieving 96.0% Accuracy, 85.1% Sensitivity, 96.4% Specificity, and a Matthew’s Correlation Coefficient (MCC) of 0.65. This demonstrates the possible effectiveness of using PLM to predict Methylation sites as a future direction. The CNN-Meth tool and its source code are readily accessible at https://github.com/MLBC-lab/CNN-Meth, providing a robust resource for researchers and clinicians.
Journal Title
Conference Title
Book Title
Large Language Models (LLMs) in Protein Bioinformatics
Edition
1st
Volume
2941
Issue
Thesis Type
Degree Program
School
Publisher link
Patent number
Funder(s)
Grant identifier(s)
Rights Statement
Rights Statement
This work is covered by copyright. You must assume that re-use is limited to personal use and that permission from the copyright owner must be obtained for all other uses. If the document is available under a specified licence, refer to the licence for details of permitted re-use. If you believe that this work infringes copyright please make a copyright takedown request using the form at https://www.griffith.edu.au/copyright-matters.
Item Access Status
Note
Access the data
Related item(s)
Subject
Persistent link to this record
Citation
Spadaro, A; Sharma, A; Dehzangi, I, CNN-Meth: A Tool to Accurately Predict Lysine Methylation Sites Using Evolutionary Information-Based Protein Modeling, Large Language Models (LLMs) in Protein Bioinformatics, 2025, 1st, 2941, pp. 177-187