Start Over

CodeBERT‐Attack: Adversarial attack against source code deep learning models via pre‐trained model.

Authors :: Zhang, Huangzhao
Lu, Shuai
Li, Zhuo
Jin, Zhi
Ma, Lei
Liu, Yang
Li, Ge
Source :: Journal of Software: Evolution & Process. Mar2024, Vol. 36 Issue 3, p1-29. 29p.
Publication Year :: 2024
Abstract: Over the past few years, the software engineering (SE) community has widely employed deep learning (DL) techniques in many source code processing tasks. Similar to other domains like computer vision and natural language processing (NLP), the state‐of‐the‐art DL techniques for source code processing can still suffer from adversarial vulnerability, where minor code perturbations can mislead a DL model's inference. Efficiently detecting such vulnerability to expose the risks at an early stage is an essential step and of great importance for further enhancement. This paper proposes a novel black‐box effective and high‐quality adversarial attack method, namely CodeBERT‐Attack (CBA), based on the powerful large pre‐trained model (i.e., CodeBERT) for DL models of source code processing. CBA locates the vulnerable positions through masking and leverages the power of CodeBERT to generate textual preserving perturbations. We turn CodeBERT against DL models and further fine‐tuned CodeBERT models for specific downstream tasks, and successfully mislead these victim models to erroneous outputs. In addition, taking the power of CodeBERT, CBA is capable of effectively generating adversarial examples that are less perceptible to programmers. Our in‐depth evaluation on two typical source code classification tasks (i.e., functionality classification and code clone detection) against the most widely adopted LSTM and the powerful fine‐tuned CodeBERT models demonstrate the advantages of our proposed technique in terms of both effectiveness and efficiency. Furthermore, our results also show (1) that pre‐training may help CodeBERT gain resilience against perturbations further, and (2) certain pre‐training tasks may be beneficial for adversarial robustness. [ABSTRACT FROM AUTHOR]

Subjects :: *SOURCE code
*DEEP learning
*NATURAL language processing
*COMPUTER vision
*SOFTWARE engineering
*COMPUTER programming education

Details

Language :: English
ISSN :: 20477473
Volume :: 36
Issue :: 3
Database :: Academic Search Index
Journal :: Journal of Software: Evolution & Process
Publication Type :: Academic Journal
Accession number :: 175945959
Full Text :: https://doi.org/10.1002/smr.2571

Full Text Access

View/download PDF

Tools

Email
Cite

Printer

Authors Abstract Subjects Details

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

CodeBERT‐Attack: Adversarial attack against source code deep learning models via pre‐trained model.

Abstract

Subjects

Details

Tools

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

CodeBERT‐Attack: Adversarial attack against source code deep learning models via pre‐trained model.

Abstract

Subjects

Details

Tools

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources