Adaptive real‐time anomaly detection in cloud infrastructures. (4th August 2017)
- Record Type:
- Journal Article
- Title:
- Adaptive real‐time anomaly detection in cloud infrastructures. (4th August 2017)
- Main Title:
- Adaptive real‐time anomaly detection in cloud infrastructures
- Authors:
- Agrawal, Bikash
Wiktorski, Tomasz
Rong, Chunming - Other Names:
- Carretero Jesus guestEditor.
Garcia‐Blas Javier guestEditor.
Nakano Koji guestEditor.
Mueller Peter guestEditor.
Grosu Daniel guestEditor.
Zheng Sheng guestEditor.
Xu Li guestEditor.
Xu Zheng guestEditor.
Yen Neil guestEditor.
Sugumaran Vijayan guestEditor. - Abstract:
- Summary: Cloud computing has become increasingly popular, which has led many individuals and organizations towards cloud storage systems. This move is motivated by benefits such as shared storage, computation, and transparent service among a massive number of users. However, cloud‐computing systems require the maintenance of complex and large‐scale systems with practically unavoidable runtime problems caused by hardware and software faults. Large systems are very complex due to heterogeneity, dynamicity, scalability, hidden complexity, and time limitations. Automatic anomaly detection is a critical technique for managing such complex cloud resources. This paper proposes a scalable model for automatic anomaly detection on a large system like a cloud. The anomaly detection process is capable of issuing a correct early warning of unusual behavior in dynamic environments after learning the system characteristic of normal operation. To detect unusual activity in the cloud, we need to monitor the data center and collect cloud performance logs. In this paper, we propose an adaptive anomaly detection mechanism, which investigates principal components of the performance metrics. It transforms the performance metrics into a low‐rank matrix and calculates the orthogonal distance using the Robust PCA algorithm. The proposed model updates itself recursively, while learning and adjusting the new threshold value, to minimize reconstruction errors. This paper also investigates robustSummary: Cloud computing has become increasingly popular, which has led many individuals and organizations towards cloud storage systems. This move is motivated by benefits such as shared storage, computation, and transparent service among a massive number of users. However, cloud‐computing systems require the maintenance of complex and large‐scale systems with practically unavoidable runtime problems caused by hardware and software faults. Large systems are very complex due to heterogeneity, dynamicity, scalability, hidden complexity, and time limitations. Automatic anomaly detection is a critical technique for managing such complex cloud resources. This paper proposes a scalable model for automatic anomaly detection on a large system like a cloud. The anomaly detection process is capable of issuing a correct early warning of unusual behavior in dynamic environments after learning the system characteristic of normal operation. To detect unusual activity in the cloud, we need to monitor the data center and collect cloud performance logs. In this paper, we propose an adaptive anomaly detection mechanism, which investigates principal components of the performance metrics. It transforms the performance metrics into a low‐rank matrix and calculates the orthogonal distance using the Robust PCA algorithm. The proposed model updates itself recursively, while learning and adjusting the new threshold value, to minimize reconstruction errors. This paper also investigates robust principal component analysis in distributed environments using Apache Spark as the underlying framework. It specifically addresses cases in which normal operation might exhibit multiple hidden modes. The accuracy and sensitivity of the model were tested on Amazon CloudWatch datasets, and Yahoo! datasets. The model achieved an accuracy of 88.54 % . … (more)
- Is Part Of:
- Concurrency and computation. Volume 29:Number 24(2017)
- Journal:
- Concurrency and computation
- Issue:
- Volume 29:Number 24(2017)
- Issue Display:
- Volume 29, Issue 24 (2017)
- Year:
- 2017
- Volume:
- 29
- Issue:
- 24
- Issue Sort Value:
- 2017-0029-0024-0000
- Page Start:
- n/a
- Page End:
- n/a
- Publication Date:
- 2017-08-04
- Subjects:
- anomaly detection -- cloud computing -- lambda architecture -- outlier detection -- real‐time -- robust PCA -- SVD
Parallel processing (Electronic computers) -- Periodicals
Parallel computers -- Periodicals
004.35 - Journal URLs:
- http://onlinelibrary.wiley.com/ ↗
- DOI:
- 10.1002/cpe.4193 ↗
- Languages:
- English
- ISSNs:
- 1532-0626
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3405.622000
British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store - Ingest File:
- 5422.xml