Adapting the Edinburgh Geoparser for Historical Georeferencing. Issue 1 (March 2015)
- Record Type:
- Journal Article
- Title:
- Adapting the Edinburgh Geoparser for Historical Georeferencing. Issue 1 (March 2015)
- Main Title:
- Adapting the Edinburgh Geoparser for Historical Georeferencing
- Authors:
- Alex, Beatrice
Byrne, Kate
Grover, Claire
Tobin, Richard - Abstract:
- Abstract : Place name mentions in text may have more than one potential referent (e.g. Peru, the country vs. Peru, the city in Indiana). The Edinburgh Language Technology Group (LTG) has developed the Edinburgh Geoparser, a system that can automatically recognise place name mentions in text and disambiguate them with respect to a gazetteer. The recognition step is required to identify location mentions in a given piece of text. The subsequent disambiguation step, generally referred to as georesolution, grounds location mentions to their corresponding gazetteer entries with latitude and longitude values, for example, to visualise them on a map. Geoparsing is not only useful for mapping purposes but also for making document collections more accessible as it can provide additional metadata about the geographical content of documents. Combined with other information mined from text such as person names and date expressions, complex relations between such pieces of information can be identified. The Edinburgh Geoparser can be used with several gazetteers including Unlock and GeoNames to process a variety of input texts. The original version of the Geoparser was a demonstrator configured for modern text. Since then, it has been adapted to georeference historic and ancient text collections as well as modern-day newspaper text. 1, 2, 3, 4 Currently, the LTG is involved in three research projects applying the Geoparser to historical text collections of very different types and for aAbstract : Place name mentions in text may have more than one potential referent (e.g. Peru, the country vs. Peru, the city in Indiana). The Edinburgh Language Technology Group (LTG) has developed the Edinburgh Geoparser, a system that can automatically recognise place name mentions in text and disambiguate them with respect to a gazetteer. The recognition step is required to identify location mentions in a given piece of text. The subsequent disambiguation step, generally referred to as georesolution, grounds location mentions to their corresponding gazetteer entries with latitude and longitude values, for example, to visualise them on a map. Geoparsing is not only useful for mapping purposes but also for making document collections more accessible as it can provide additional metadata about the geographical content of documents. Combined with other information mined from text such as person names and date expressions, complex relations between such pieces of information can be identified. The Edinburgh Geoparser can be used with several gazetteers including Unlock and GeoNames to process a variety of input texts. The original version of the Geoparser was a demonstrator configured for modern text. Since then, it has been adapted to georeference historic and ancient text collections as well as modern-day newspaper text. 1, 2, 3, 4 Currently, the LTG is involved in three research projects applying the Geoparser to historical text collections of very different types and for a variety of end-user applications. This paper discusses the ways in which we have customised the Geoparser for specific datasets and applications relevant to each project. … (more)
- Is Part Of:
- International journal of humanities and arts computing. Volume 9:Issue 1(2015)
- Journal:
- International journal of humanities and arts computing
- Issue:
- Volume 9:Issue 1(2015)
- Issue Display:
- Volume 9, Issue 1 (2015)
- Year:
- 2015
- Volume:
- 9
- Issue:
- 1
- Issue Sort Value:
- 2015-0009-0001-0000
- Page Start:
- 15
- Page End:
- 35
- Publication Date:
- 2015-03
- Subjects:
- Georeferencing -- georesolution -- text mining -- domain adaptation
Humanities -- Data processing -- Periodicals
Arts -- Data processing -- Periodicals
001.30285 - Journal URLs:
- http://www.euppublishing.com/journals ↗
http://www.eupjournals.com/journal/ijhac ↗ - DOI:
- 10.3366/ijhac.2015.0136 ↗
- Languages:
- English
- ISSNs:
- 1753-8548
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 540.xml