A hybrid method for Chinese address segmentation. Issue 1 (2nd January 2018)
- Record Type:
- Journal Article
- Title:
- A hybrid method for Chinese address segmentation. Issue 1 (2nd January 2018)
- Main Title:
- A hybrid method for Chinese address segmentation
- Authors:
- Li, Lin
Wang, Wei
He, Biao
Zhang, Yu - Abstract:
- ABSTRACT: Chinese address segmentation is a serious challenge in geographic information system geocoding. Most previous studies have relied on predefined gazetteers without considering the information contained by a raw address corpus. In this paper, a hybrid method employing both rule-based and statistical methods is proposed for Chinese address segmentation without a predefined gazetteer. This approach utilizes statistical methods to extract address information from a raw address corpus and a rule-based method to segment Chinese addresses. Two typical statistical methods and their combinations with rule-based methods are compared with the hybrid method in an experiment involving approximately 460, 000 address items in Shenzhen City, China. The experimental results indicate that the proposed method achieves an F -score of over 0.8, which is better than those of existing methods, thus validating the proposed method.
- Is Part Of:
- International journal of geographical information science. Volume 32:Issue 1(2018)
- Journal:
- International journal of geographical information science
- Issue:
- Volume 32:Issue 1(2018)
- Issue Display:
- Volume 32, Issue 1 (2018)
- Year:
- 2018
- Volume:
- 32
- Issue:
- 1
- Issue Sort Value:
- 2018-0032-0001-0000
- Page Start:
- 30
- Page End:
- 48
- Publication Date:
- 2018-01-02
- Subjects:
- Geocoding -- Chinese address segmentation without gazetteers -- hybrid method
Geography -- Data processing -- Periodicals
Information storage and retrieval systems -- Periodicals
Géomatique -- Périodiques
Systèmes d'information -- Périodiques
910.285 - Journal URLs:
- http://www.tandfonline.com/loi/tgis20 ↗
http://www.tandfonline.com/ ↗ - DOI:
- 10.1080/13658816.2017.1379084 ↗
- Languages:
- English
- ISSNs:
- 1365-8816
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 4542.266150
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 5332.xml