TY  - EJOU
AU  - Sameen, Maria 
AU  - Hwang, Seong Oun 

TI  - DISTINÏCT: Data poISoning atTacks dectectIon usiNg optÏmized jaCcard disTance
T2  - Computers, Materials \& Continua

PY  - 2022
VL  - 73
IS  - 3
SN  - 1546-2226

AB  - Machine Learning (ML) systems often involve a re-training process to make better predictions and classifications. This re-training process creates a loophole and poses a security threat for ML systems. Adversaries leverage this loophole and design data poisoning attacks against ML systems. Data poisoning attacks are a type of attack in which an adversary manipulates the training dataset to degrade the ML system’s performance. Data poisoning attacks are challenging to detect, and even more difficult to respond to, particularly in the Internet of Things (IoT) environment. To address this problem, we proposed DISTINÏCT, the first proactive data poisoning attack detection framework using distance measures. We found that Jaccard Distance (JD) can be used in the DISTINÏCT (among other distance measures) and we finally improved the JD to attain an Optimized JD (OJD) with lower time and space complexity. Our security analysis shows that the DISTINÏCT is secure against data poisoning attacks by considering key features of adversarial attacks. We conclude that the proposed OJD-based DISTINÏCT is effective and efficient against data poisoning attacks where in-time detection is critical for IoT applications with large volumes of streaming data.
KW  - Data poisoning attacks; detection framework; jaccard distance (JD); optimized jaccard distance (OJD); security analysis

DO  - 10.32604/cmc.2022.031091