Multi-source uncertain entity resolution: Transforming holocaust victim reports into people. (April 2017)
- Record Type:
- Journal Article
- Title:
- Multi-source uncertain entity resolution: Transforming holocaust victim reports into people. (April 2017)
- Main Title:
- Multi-source uncertain entity resolution: Transforming holocaust victim reports into people
- Authors:
- Sagi, Tomer
Gal, Avigdor
Barkol, Omer
Bergman, Ruth
Avram, Alexander - Abstract:
- Abstract: In this work we present a multi-source uncertain entity resolution model and show its implementation in a use case of Yad Vashem, the central repository of Holocaust-era information. The Yad Vashem dataset is unique with respect to classic entity resolution, by virtue of being both massively multi-source and by requiring multi-level entity resolution. With today's abundance of information sources, this project motivates the use of multi-source resolution on a big-data scale. We instantiate the proposed model using the MFIBlocks entity resolution algorithm and a machine learning approach, based upon decision trees to transform soft clusters into ranked clustering of records, representing possible entities. An extensive empirical evaluation demonstrates the unique properties of this dataset that make it a good candidate for multi-source entity resolution. We conclude with proposing avenues for future research in this realm. Abstract : Highlights: Uncertain Entity Resolution allows creating multiple narratives from complementary sources of data. The approach was demonstrated during a unique project performed on the Yad Vashem Names database . Algorithms implementing the approach were empirically evaluated on a tagged subset on various configurations and versus equivalent algorithms. The accurate and insightful results are being integrated into Yad Vashem systems and user applications.
- Is Part Of:
- Information systems. Volume 65(2017)
- Journal:
- Information systems
- Issue:
- Volume 65(2017)
- Issue Display:
- Volume 65, Issue 2017 (2017)
- Year:
- 2017
- Volume:
- 65
- Issue:
- 2017
- Issue Sort Value:
- 2017-0065-2017-0000
- Page Start:
- 124
- Page End:
- 136
- Publication Date:
- 2017-04
- Subjects:
- Uncertain entity resolution -- Blocking -- Holocaust
Database management -- Periodicals
Electronic data processing -- Periodicals
Bases de données -- Gestion -- Périodiques
Informatique -- Périodiques
Database management
Electronic data processing
Periodicals
005.7 - Journal URLs:
- http://www.sciencedirect.com/science/journal/03064379 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.is.2016.12.003 ↗
- Languages:
- English
- ISSNs:
- 0306-4379
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 4496.367300
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 13048.xml