Calibration of score based likelihood ratio estimation in automated forensic facial image comparison. (May 2022)
- Record Type:
- Journal Article
- Title:
- Calibration of score based likelihood ratio estimation in automated forensic facial image comparison. (May 2022)
- Main Title:
- Calibration of score based likelihood ratio estimation in automated forensic facial image comparison
- Authors:
- Rodriguez, Andrea Macarulla
Geradts, Zeno
Worring, Marcel - Abstract:
- Abstract: Forensic facial image comparison lacks a methodological standardization and empirical validation. We aim to address this problem by assessing the potential of machine learning to support the human expert in the courtroom. To yield valid evidence in court, decision making systems for facial image comparison should not only be accurate, they should also provide a calibrated confidence measure. This confidence is best conveyed using a score-based likelihood ratio. In this study we compare the performance of different calibrations for such scores. The score, either a distance or a similarity, is converted to a likelihood ratio using three types of calibration following similar techniques as applied in forensic fields such as speaker comparison and DNA matching, but which have not yet been tested in facial image comparison. The calibration types tested are: naive, quality score based on typicality, and feature-based. As transparency is essential in forensics, we focus on state-of-the-art open software and study their power compared to a state-of-the-art commercial system. With the European Network of Forensic Science Institutes (ENFSI) Proficiency tests as benchmark, calibration results on three public databases namely Labeled Faces in the Wild, SC Face and ForenFace show that both quality score and feature based calibration outperform naive calibration. Overall, the commercial system outperforms open software when evaluating these Likelihood Ratios. In general, weAbstract: Forensic facial image comparison lacks a methodological standardization and empirical validation. We aim to address this problem by assessing the potential of machine learning to support the human expert in the courtroom. To yield valid evidence in court, decision making systems for facial image comparison should not only be accurate, they should also provide a calibrated confidence measure. This confidence is best conveyed using a score-based likelihood ratio. In this study we compare the performance of different calibrations for such scores. The score, either a distance or a similarity, is converted to a likelihood ratio using three types of calibration following similar techniques as applied in forensic fields such as speaker comparison and DNA matching, but which have not yet been tested in facial image comparison. The calibration types tested are: naive, quality score based on typicality, and feature-based. As transparency is essential in forensics, we focus on state-of-the-art open software and study their power compared to a state-of-the-art commercial system. With the European Network of Forensic Science Institutes (ENFSI) Proficiency tests as benchmark, calibration results on three public databases namely Labeled Faces in the Wild, SC Face and ForenFace show that both quality score and feature based calibration outperform naive calibration. Overall, the commercial system outperforms open software when evaluating these Likelihood Ratios. In general, we conclude that calibration implemented before likelihood ratio estimation is recommended. Furthermore, in terms of performance the commercial system is preferred over open software. As open software is more transparent, more research on open software is urged for. Highlights: Automated Face Comparison systems can help the forensic investigator in court. Likelihood Ratio (LR) is an appropriate method to convey Face Comparison assessments. Applying filters to calibration improves the LR estimation compared to naive calibration. Commercial software performance outperforms open software. … (more)
- Is Part Of:
- Forensic science international. Volume 334(2022)
- Journal:
- Forensic science international
- Issue:
- Volume 334(2022)
- Issue Display:
- Volume 334, Issue 2022 (2022)
- Year:
- 2022
- Volume:
- 334
- Issue:
- 2022
- Issue Sort Value:
- 2022-0334-2022-0000
- Page Start:
- Page End:
- Publication Date:
- 2022-05
- Subjects:
- Facial image comparison -- Calibration -- Deep learning -- Forensic science -- Score based likelihood ratio
Medical jurisprudence -- Periodicals
Chemistry, Forensic -- Periodicals
Forensic Medicine -- Periodicals
Médecine légale -- Périodiques
Chimie légale -- Périodiques
Gerechtelijke geneeskunde
Gerechtelijke chemie
Gerechtelijke psychiatrie
Chemistry, Forensic
Medical jurisprudence
Electronic journals
Periodicals
Electronic journals
614.1 - Journal URLs:
- http://www.clinicalkey.com.au/dura/browse/journalIssue/03790738 ↗
http://www.clinicalkey.com/dura/browse/journalIssue/03790738 ↗
http://www.sciencedirect.com/science/journal/03790738 ↗
http://infotrac.galegroup.com/itw/infomark/1/1/1/purl=rc18_EAIM_0__jn+%22Forensic+Science+International%22?sw_aep=stand ↗
http://www.elsevier.com/homepage/elecserv.htt ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.forsciint.2022.111239 ↗
- Languages:
- English
- ISSNs:
- 0379-0738
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3987.764000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 21235.xml