An Optimized Density-based Algorithm for Anomaly Detection in High Dimensional Datasets

Main Article Content

Adeel Shiraz Hashmi
Mohammad Najmud Doja
Tanvir Ahmad

Abstract


In this study, the authors aim to propose an optimized density-based algorithm for anomaly detection with focus on high-dimensional datasets. The optimization is achieved by optimizing the input parameters of the algorithm using firefly meta-heuristic. The performance of different similarity measures for the algorithm is compared including both L1 and L2 norms to identify the most efficient similarity measure for high-dimensional datasets. The algorithm is optimized further in terms of speed and scalability by using Apache Spark big data platform. The experiments were conducted on publicly available datasets, and the results were evaluated on various performance metrics like execution time, accuracy, sensitivity, and specificity.

Article Details

Section
Research Papers