Abstract
Code authorship attribution is the problem of identifying authors of programming language codes through the stylistic features in their codes, a topic that recently witnessed significant interest with outstanding performance. In this work, we present SCAE, a code authorship obfuscation technique that leverages a Seq2Seq code transformer called StructCoder. SCAE customizes StructCoder, a system designed initially for function-level code translation from one language to another (e.g., Java to C#), using transfer learning. SCAE improved the efficiency at a slight accuracy degradation compared to existing work. We also reduced the processing time by ≈ 68% while maintaining an 85% transformation success rate and up to 95.77% evasion success rate in the untargeted setting.
Original language | English |
---|---|
Title of host publication | Computational Data and Social Networks - 12th International Conference, CSoNet 2023, Proceedings |
Editors | Minh Hoàng Hà, Xingquan Zhu, My T. Thai |
Publisher | Springer Science and Business Media Deutschland GmbH |
Pages | 83-92 |
Number of pages | 10 |
ISBN (Print) | 9789819706686 |
DOIs | |
State | Published - 2024 |
Event | 12th International Conference on Computational Data and Social Networks, CSoNet 2023 - Hanoi, Viet Nam Duration: 11 Dec 2023 → 13 Dec 2023 |
Publication series
Name | Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) |
---|---|
Volume | 14479 LNCS |
ISSN (Print) | 0302-9743 |
ISSN (Electronic) | 1611-3349 |
Conference
Conference | 12th International Conference on Computational Data and Social Networks, CSoNet 2023 |
---|---|
Country/Territory | Viet Nam |
City | Hanoi |
Period | 11/12/23 → 13/12/23 |
Bibliographical note
Publisher Copyright:© The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2024.
Keywords
- Code Authorship Evasion Attack
- Code Authorship Identification
- Machine Learning Identification
- Program Stylistic Features
- Software Forensics