NSF Career Project

Protecting Deep Learning Systems against Hardware-Oriented Vulnerabilities

Artificial intelligence (AI) has recently approached or even surpassed human-level performance in many applications. However, the successful deployment of AI requires sufficient robustness against adversarial attacks of all types and in all phases of the model life cycle. Although much progress has been made in enhancing the robustness of AI algorithms, there is a lack of systematic studies on hardware-oriented vulnerabilities and countermeasures, which also opens up demand for AI security education. Given this pressing need, this project aims at exploring novel hardware-oriented adversarial AI concepts and developing fundamental defensive strategies against such vulnerabilities to protect next-generation AI systems.

This project has four thrusts. In Thrust 1, this project will exploit new adversarial attacks on deep neural network systems, featuring the design of an algorithm-hardware collaborative backdoor attack. Then in Thrust 2, it will develop methodologies that incorporate the hardware aspect into defense for enhancing adversarial robustness against vulnerabilities in the untrusted semiconductor supply chain. Subsequently, in Thrust 3, this project will develop novel signature embedding frameworks to protect the integrity of deep neural network models in the untrusted model building supply chain and finally in Thrust 4, it will model recovery strategies as an innovative approach to mitigate hardware-oriented fault attacks in the untrusted user-space.

This project will yield novel methodologies for ensuring trust in AI systems from both the algorithm and hardware perspectives to meet the future needs of commercial products and national defense. In addition, it will catalyze advances in emerging AI applications across a broad range of sectors, including healthcare, autonomous vehicles, and Internet of things (IoT), triggering widespread implementation of AI in mobile and edge devices. New theories and techniques developed in this project will be integrated into undergraduate and graduate education and used to raise public awareness and promote understanding of the importance of AI security.

Team Members

PI: Dr. Yingjie Lao(Personal page)
Bingyin Zhao(Personal page)
Joseph Clements(Personal page)

Research

Adversrial ML in Network Security

Deep learning is a powerful tool for combating contemporary security threats. However, the neural network models that power this technology have been demonstrated to be vulnerable in adversarial settings. Our research has explored the vulnerability of such systems and demonstrated that deep learning systems in a security application are a viable target that can compromise the system's security.

Trojans in DL Hardware

As deep learning develops further, hardware optimized to compute neural network computations will become increasingly valuable. These resources will become a target of adversaries that could embed hardware Trojans, malicious modifications into the designs of these systems. Meanwhile, adversaries can introduce backdoors into a Neural Network to control the model during inference. This is often achieved through modifications to the model's parameters that change the functionality to allow for manipulation by the adversary without affecting typical usage. To evaluate the capabilities and vulnerabilities in such settings, we developed a method for embedding hardware Trojans into a device that introduces backdoors into a deep neural network executed on it. We also developed a novel method of embedding backdoors into deep learning models by altering the function of its base mathematical operations.

Hardware Watermarks for DNN Accelerators

Adversaries may target well-optimized deep learning hardware for theft subverting profits from their rightful owner, and disincentivizing technological development. As such, a method of watermarking their hardware to identify if an adversary is fraudulently using their intellectual property. Our research developed an algorithm for embedding watermarks that identify deep learning hardware accelerators.

Class-oriented Poisoning Attacks

Poisoning attacks mainly aim at either the models' availability (i.e., subvert model performance) or integrity (i.e., cause misclassification on specific input instances). Our work advanced the goal of the poisoning availability attacks to a per-class basis that focused on manipulating the malicious behavior of each class. We proposed gradient-based class-oriented algorithms to achieve the adversarial goals.

Clean-label Poisoning Availability Attacks

Poisoning availability attacks degrade overall model accuracy and cause a denial of service by contaminating the training dataset. However, it is challenging to generate effective and stealthy poisoned data at a large scale. We developed a framework that optimizes feature space representations of poisoned data and employs generative adversarial nets to generate high-quality clean-label poisoned data.

Defense Against Poisoning Attacks in DNN

Data poisoning attacks are emerging threats to deep neural networks. Adversaries inject a fraction of well-crafted poisoned data in the training dataset and introduce malicious behaviors to DNNs trained on the poisoned dataset. Our research developed defensive strategies against different poisoning attacks such as availability attacks and backdoor attacks.

Related Publications

Joseph Clements and Yingjie Lao, "DeepHardMark: Towards Watermarking Neural Network Hardware", Thirty-Sixth AAAI Conference on Artificial Intelligence (AAAI-22).
Bingyin Zhao and Yingjie Lao, "CLPA: Clean-Label Poisoning Availability Attacks Using Generative Adversarial Nets", Thirty-Sixth AAAI Conference on Artificial Intelligence (AAAI-22).
Bingyin Zhao and Yingjie Lao, "Towards Class-Oriented Poisoning Attacks Against Neural Networks", 2022 Winter Conference on Applications of Computer Vision (WACV 2022).[paper]
Joseph Clements, Yuzhe Yang, Ankur Sharma, Hongxin Hu and Yingjie Lao, "Rallying Adversarial Techniques against Deep Learning for Network Security", IEEE Symposium Series on Computational Intelligence (IEEE SSCI 2021). [paper]
Joseph Clements and Yingjie Lao, "Hardware Trojan Design on Neural Networks", 2019 IEEE International Symposium on Circuits and Systems (ISCAS 2019).[paper]
Joseph Clements and Yingjie Lao, "Hardware Trojan Attacks on Neural Networks".[paper]
Bingyin Zhao and Yingjie Lao, "Resilience of Pruned Neural Network Against Poisoning Attack", 2018 13th International Conference on Malicious and Unwanted Software (MALWARE)[paper]
Joseph Clements and Yingjie Lao, "Backdoor Attacks on Neural Network Operations", 2018 IEEE Global Conference on Signal and Information Processing (GlobalSIP). [paper]