CASOS: a subspace method for anomaly detection in high dimensional astronomical databases. (6th November 2012)
- Record Type:
- Journal Article
- Title:
- CASOS: a subspace method for anomaly detection in high dimensional astronomical databases. (6th November 2012)
- Main Title:
- CASOS: a subspace method for anomaly detection in high dimensional astronomical databases
- Authors:
- Henrion, Marc
Hand, David J.
Gandy, Axel
Mortlock, Daniel J. - Abstract:
- <abstract abstract-type="main" xml:lang="en"> <title>Abstract</title> <p>We develop a novel algorithm for detecting anomalies. Our method has been developed to suit the challenging task of detecting anomalous sources in cross‐matched astronomical survey data. Our algorithm computes anomaly scores in lower‐dimensional subspaces of the data. By subspaces we mean, in this work, subsets of the original data variables. Our technique presents several advantages over existing methods: it can work directly on data with missing values; it addresses some of the problems posed by high‐dimensional data spaces; it is less susceptible to a masking effect from irrelevant features; it can be easily adapted to suit specific needs and it allows an easier interpretation of why a given object has a high combined anomaly score. One drawback of our method is that it cannot detect outliers that are only apparent in high‐dimensional spaces. Anomaly scores are computed using a nearest neighbor (NN) technique, but the algorithm works with any other method computing numerical anomaly scores. We demonstrate the properties of our algorithm and evaluate its performance on both simulated and real datasets. We show that it is capable of outperforming state‐of‐the‐art, full‐dimensional approaches in some situations. © 2013 Wiley Periodicals, Inc. Statistical Analysis and Data Mining 6: 53–72, 2013</p> </abstract>
- Is Part Of:
- Statistical analysis and data mining. Volume 6:Number 1(2013)
- Journal:
- Statistical analysis and data mining
- Issue:
- Volume 6:Number 1(2013)
- Issue Display:
- Volume 6, Issue 1 (2013)
- Year:
- 2013
- Volume:
- 6
- Issue:
- 1
- Issue Sort Value:
- 2013-0006-0001-0000
- Page Start:
- 53
- Page End:
- 72
- Publication Date:
- 2012-11-06
- Subjects:
- Data mining -- Statistical methods -- Periodicals
006.312 - Journal URLs:
- http://www3.interscience.wiley.com/journal/112701062/home ↗
http://onlinelibrary.wiley.com/ ↗ - DOI:
- 10.1002/sam.11167 ↗
- Languages:
- English
- ISSNs:
- 1932-1864
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 8447.424100
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 3207.xml