Tossing in the Dark: Practical {Bit-Flipping} on Gray-box Deep Neural Networks for Runtime Trojan Injection

Zihao Wang; Di Tang; XiaoFeng Wang; Wei He; Zhaoyang Geng; Wenhao Wang

Authors:

Zihao Wang, Di Tang, and XiaoFeng Wang, Indiana University Bloomington; Wei He, Zhaoyang Geng, and Wenhao Wang, SKLOIS, Institute of Information Engineering, Chinese Academy of Sciences

Abstract:

Although Trojan attacks on deep neural networks (DNNs) have been extensively studied, the threat of run-time Trojan injection has only recently been brought to attention. Unlike data poisoning attacks that target the training stage of a DNN model, a run-time attack executes an exploit such as Rowhammer on memory to flip the bits of the target model and thereby implant a Trojan. This threat is stealthier but more challenging, as it requires flipping a set of bits in the target model to introduce an effective Trojan without noticeably downgrading the model's accuracy. This has been achieved only under the less realistic assumption that the target model is fully shared with the adversary through memory, thus enabling them to flip bits across all model layers, including the last few layers.

For the first time, we have investigated run-time Trojan Injection under a more realistic gray-box scenario. In this scenario, a model is perceived in an encoder-decoder manner: the encoder is public and shared through memory, while the decoder is private and so considered to be black-box and inaccessible to unauthorized parties. To address the unique challenge posed by the black-box decoder to Trojan injection in this scenario, we developed a suite of innovative techniques. Using these techniques, we constructed our gray-box attack, Groan, which stands out as both effective and stealthy. Our experiments show that Groan is capable of injecting a highly effective Trojan into the target model, while also largely preserving its performance, even in the presence of state-of-theart memory protection.

Zihao Wang, Indiana University Bloomington

Di Tang, Indiana University Bloomington

XiaoFeng Wang, Indiana University Bloomington

Wei He, SKLOIS, Institute of Information Engineering, Chinese Academy of Sciences

Zhaoyang Geng, SKLOIS, Institute of Information Engineering, Chinese Academy of Sciences

Wenhao Wang, SKLOIS, Institute of Information Engineering, Chinese Academy of Sciences

Open Access Media

USENIX is committed to Open Access to the research presented at our events. Papers and proceedings are freely available to everyone once the event begins. Any video, audio, and/or slides that are posted after the event are also free and open to everyone. Support USENIX and our commitment to Open Access.

BibTeX

@inproceedings {294647,
author = {Zihao Wang and Di Tang and XiaoFeng Wang and Wei He and Zhaoyang Geng and Wenhao Wang},
title = {Tossing in the Dark: Practical {Bit-Flipping} on Gray-box Deep Neural Networks for Runtime Trojan Injection},
booktitle = {33rd USENIX Security Symposium (USENIX Security 24)},
year = {2024},
isbn = {978-1-939133-44-1},
address = {Philadelphia, PA},
pages = {1331--1348},
url = {https://www.usenix.org/conference/usenixsecurity24/presentation/wang-zihao-tossing},
publisher = {USENIX Association},
month = aug
}

Download

Wang PDF

Wang Paper (Prepublication) PDF

Tossing in the Dark: Practical Bit-Flipping on Gray-box Deep Neural Networks for Runtime Trojan Injection