Toponym matching through deep neural networks. Issue 2 (1st February 2018)
- Record Type:
- Journal Article
- Title:
- Toponym matching through deep neural networks. Issue 2 (1st February 2018)
- Main Title:
- Toponym matching through deep neural networks
- Authors:
- Santos, Rui
Murrieta-Flores, Patricia
Calado, Pável
Martins, Bruno - Abstract:
- ABSTRACT: Toponym matching, i.e. pairing strings that represent the same real-world location, is a fundamental problemfor several practical applications. The current state-of-the-art relies on string similarity metrics, either specifically developed for matching place names or integrated within methods that combine multiple metrics. However, these methods all rely on common sub-strings in order to establish similarity, and they do not effectively capture the character replacements involved in toponym changes due to transliterations or to changes in language and culture over time. In this article, we present a novel matching approach, leveraging a deep neural network to classify pairs of toponyms as either matching or nonmatching. The proposed network architecture uses recurrent nodes to build representations from the sequences of bytes that correspond to the strings that are to be matched. These representations are then combined and passed to feed-forward nodes, finally leading to a classification decision. We present the results of a wide-ranging evaluation on the performance of the proposed method, using a large dataset collected from the GeoNames gazetteer. These results show that the proposed method can significantly outperform individual similarity metrics from previous studies, as well as previous methods based on supervised machine learning for combining multiple metrics.
- Is Part Of:
- International journal of geographical information science. Volume 32:Issue 2(2018)
- Journal:
- International journal of geographical information science
- Issue:
- Volume 32:Issue 2(2018)
- Issue Display:
- Volume 32, Issue 2 (2018)
- Year:
- 2018
- Volume:
- 32
- Issue:
- 2
- Issue Sort Value:
- 2018-0032-0002-0000
- Page Start:
- 324
- Page End:
- 348
- Publication Date:
- 2018-02-01
- Subjects:
- Toponym matching -- duplicate detection -- approximate string matching -- deep neural networks -- recurrent neural networks -- geographic information retrieval
Geography -- Data processing -- Periodicals
Information storage and retrieval systems -- Periodicals
Géomatique -- Périodiques
Systèmes d'information -- Périodiques
910.285 - Journal URLs:
- http://www.tandfonline.com/loi/tgis20 ↗
http://www.tandfonline.com/ ↗ - DOI:
- 10.1080/13658816.2017.1390119 ↗
- Languages:
- English
- ISSNs:
- 1365-8816
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 4542.266150
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 5368.xml