Multi-view Pre-trained Model for Code Vulnerability Identification

Loading...
Thumbnail Image
File version

Accepted Manuscript (AM)

Author(s)
Jiang, Xuxiang
Xiao, Yinhao
Wang, Jun
Zhang, Wei
Griffith University Author(s)
Primary Supervisor
Other Supervisors
Editor(s)

Wang, L

Segal, M

Chen, J

Qiu, T

Date
2022
Size
File type(s)
Location

Dalian, China

License
Abstract

Vulnerability identification is crucial for cyber security in the software-related industry. Early identification methods require significant manual efforts in crafting features or annotating vulnerable code. Although the recent pre-trained models alleviate this issue, they overlook the multiple rich structural information contained in the code itself. In this paper, we propose a novel Multi-View Pre-Trained Model (MV-PTM) that encodes both sequential and multi-type structural information of the source code and uses contrastive learning to enhance code representations. The experiments conducted on two public datasets demonstrate the superiority of MV-PTM. In particular, MV-PTM improves GraphCodeBERT by 3.36% on average in terms of F1 score.

Journal Title
Conference Title

Wireless Algorithms, Systems, and Applications: 17th International Conference, WASA 2022, Dalian, China, November 24–26, 2022, Proceedings, Part III

Book Title
Edition
Volume

13473

Issue
Thesis Type
Degree Program
School
Publisher link
Patent number
Funder(s)
Grant identifier(s)
Rights Statement
Rights Statement

© 2022 Springer Cham. This is the author-manuscript version of this paper. Reproduced in accordance with the copyright policy of the publisher.The original publication is available at www.springerlink.com

Item Access Status
Note
Access the data
Related item(s)
Subject

Machine learning

System and network security

CLONE DETECTION

Computer Science

Computer Science, Artificial Intelligence

Contrastive learning

Pre-trained model

Persistent link to this record
Citation

Jiang, X; Xiao, Y; Wang, J; Zhang, W, Multi-view Pre-trained Model for Code Vulnerability Identification, Wireless Algorithms, Systems, and Applications: 17th International Conference, WASA 2022, Dalian, China, November 24–26, 2022, Proceedings, Part III, 2022, 13473, pp. 127-135