Machine learning innovations in address matching: A practical comparison of word2vec and CRFs. Issue 2 (8th April 2019)
- Record Type:
- Journal Article
- Title:
- Machine learning innovations in address matching: A practical comparison of word2vec and CRFs. Issue 2 (8th April 2019)
- Main Title:
- Machine learning innovations in address matching: A practical comparison of word2vec and CRFs
- Authors:
- Comber, Sam
Arribas‐Bel, Daniel - Abstract:
- Abstract: Record linkage is a frequent obstacle to unlocking the benefits of integrated (spatial) data sources. In the absence of unique identifiers to directly join records, practitioners often rely on text‐based approaches for resolving candidate pairs of records to a match. In geographic information science, spatial record linkage is a form of geocoding that pertains to the resolution of text‐based linkage between pairs of addresses into matches and non‐matches. These approaches link text‐based address sequences, integrating sources of data that would otherwise remain in isolation. While recent innovations in machine learning have been introduced in the wider record linkage literature, there is significant potential to apply machine learning to the address matching sub‐field of geographic information science. As a response, this paper introduces two recent developments in text‐based machine learning—conditional random fields and word2vec—that have not been applied to address matching, evaluating their comparative strengths and drawbacks.
- Is Part Of:
- Transactions in GIS. Volume 23:Issue 2(2019)
- Journal:
- Transactions in GIS
- Issue:
- Volume 23:Issue 2(2019)
- Issue Display:
- Volume 23, Issue 2 (2019)
- Year:
- 2019
- Volume:
- 23
- Issue:
- 2
- Issue Sort Value:
- 2019-0023-0002-0000
- Page Start:
- 334
- Page End:
- 348
- Publication Date:
- 2019-04-08
- Subjects:
- Geographic information systems -- Periodicals
910.285 - Journal URLs:
- http://www.blackwell-synergy.com/servlet/useragent?func=showIssues&code=tgis ↗
http://onlinelibrary.wiley.com/ ↗ - DOI:
- 10.1111/tgis.12522 ↗
- Languages:
- English
- ISSNs:
- 1361-1682
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 9020.502000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 9812.xml