Optimizing automatic morphological classification of galaxies with machine learning and deep learning using Dark Energy Survey imaging. Issue 3 (19th February 2020)
- Record Type:
- Journal Article
- Title:
- Optimizing automatic morphological classification of galaxies with machine learning and deep learning using Dark Energy Survey imaging. Issue 3 (19th February 2020)
- Main Title:
- Optimizing automatic morphological classification of galaxies with machine learning and deep learning using Dark Energy Survey imaging
- Authors:
- Cheng, Ting-Yun
Conselice, Christopher J
Aragón-Salamanca, Alfonso
Li, Nan
Bluck, Asa F L
Hartley, Will G
Annis, James
Brooks, David
Doel, Peter
García-Bellido, Juan
James, David J
Kuehn, Kyler
Kuropatkin, Nikolay
Smith, Mathew
Sobreira, Flavia
Tarle, Gregory - Abstract:
- ABSTRACT: There are several supervised machine learning methods used for the application of automated morphological classification of galaxies; however, there has not yet been a clear comparison of these different methods using imaging data, or an investigation for maximizing their effectiveness. We carry out a comparison between several common machine learning methods for galaxy classification [Convolutional Neural Network (CNN), K-nearest neighbour, logistic regression, Support Vector Machine, Random Forest, and Neural Networks] by using Dark Energy Survey (DES) data combined with visual classifications from the Galaxy Zoo 1 project (GZ1). Our goal is to determine the optimal machine learning methods when using imaging data for galaxy classification. We show that CNN is the most successful method of these ten methods in our study. Using a sample of ∼2800 galaxies with visual classification from GZ1, we reach an accuracy of ∼0.99 for the morphological classification of ellipticals and spirals. The further investigation of the galaxies that have a different ML and visual classification but with high predicted probabilities in our CNN usually reveals the incorrect classification provided by GZ1. We further find the galaxies having a low probability of being either spirals or ellipticals are visually lenticulars (S0), demonstrating that supervised learning is able to rediscover that this class of galaxy is distinct from both ellipticals and spirals. We confirm thatABSTRACT: There are several supervised machine learning methods used for the application of automated morphological classification of galaxies; however, there has not yet been a clear comparison of these different methods using imaging data, or an investigation for maximizing their effectiveness. We carry out a comparison between several common machine learning methods for galaxy classification [Convolutional Neural Network (CNN), K-nearest neighbour, logistic regression, Support Vector Machine, Random Forest, and Neural Networks] by using Dark Energy Survey (DES) data combined with visual classifications from the Galaxy Zoo 1 project (GZ1). Our goal is to determine the optimal machine learning methods when using imaging data for galaxy classification. We show that CNN is the most successful method of these ten methods in our study. Using a sample of ∼2800 galaxies with visual classification from GZ1, we reach an accuracy of ∼0.99 for the morphological classification of ellipticals and spirals. The further investigation of the galaxies that have a different ML and visual classification but with high predicted probabilities in our CNN usually reveals the incorrect classification provided by GZ1. We further find the galaxies having a low probability of being either spirals or ellipticals are visually lenticulars (S0), demonstrating that supervised learning is able to rediscover that this class of galaxy is distinct from both ellipticals and spirals. We confirm that ∼2.5 per cent galaxies are misclassified by GZ1 in our study. After correcting these galaxies' labels, we improve our CNN performance to an average accuracy of over 0.99 (accuracy of 0.994 is our best result). … (more)
- Is Part Of:
- Monthly notices of the Royal Astronomical Society. Volume 493:Issue 3(2020)
- Journal:
- Monthly notices of the Royal Astronomical Society
- Issue:
- Volume 493:Issue 3(2020)
- Issue Display:
- Volume 493, Issue 3 (2020)
- Year:
- 2020
- Volume:
- 493
- Issue:
- 3
- Issue Sort Value:
- 2020-0493-0003-0000
- Page Start:
- 4209
- Page End:
- 4228
- Publication Date:
- 2020-02-19
- Subjects:
- methods: data analysis -- methods: statistical -- galaxies: structure
Astronomy -- Periodicals
Periodicals
520.5 - Journal URLs:
- http://mnras.oxfordjournals.org/ ↗
http://onlinelibrary.wiley.com/journal/10.1111/(ISSN)1365-2966 ↗
http://www.blackwell-synergy.com/issuelist.asp?journal=mnr ↗
http://www.blackwell-synergy.com/loi/mnr ↗
http://ukcatalogue.oup.com/ ↗ - DOI:
- 10.1093/mnras/staa501 ↗
- Languages:
- English
- ISSNs:
- 0035-8711
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 5943.000000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 15085.xml