Development of a national-scale real-time Twitter data mining pipeline for social geodata on the potential impacts of flooding on communities. (May 2019)
- Record Type:
- Journal Article
- Title:
- Development of a national-scale real-time Twitter data mining pipeline for social geodata on the potential impacts of flooding on communities. (May 2019)
- Main Title:
- Development of a national-scale real-time Twitter data mining pipeline for social geodata on the potential impacts of flooding on communities
- Authors:
- Barker, J.L.P.
Macleod, C.J.A. - Abstract:
- Abstract: Social media, particularly Twitter, is increasingly used to improve resilience during extreme weather events/emergency management situations, including floods: by communicating potential risks and their impacts, and informing agencies and responders. In this paper, we developed a prototype national-scale Twitter data mining pipeline for improved stakeholder situational awareness during flooding events across Great Britain, by retrieving relevant social geodata, grounded in environmental data sources (flood warnings and river levels). With potential users we identified and addressed three research questions to develop this application, whose components constitute a modular architecture for real-time dashboards. First, polling national flood warning and river level Web data sources to obtain at-risk locations. Secondly, real-time retrieval of geotagged tweets, proximate to at-risk areas. Thirdly, filtering flood-relevant tweets with natural language processing and machine learning libraries, using word embeddings of tweets. We demonstrated the national-scale social geodata pipeline using over 420, 000 georeferenced tweets obtained between 20 and 29th June 2016. Highlights: Prototype real-time social geodata pipeline for flood events and demonstration dataset. National-scale flood warnings/river levels set 'at-risk areas' in Twitter API queries. Monitoring multiple locations (without keywords) retrieved current, geotagged tweets. Novel application of word embeddingsAbstract: Social media, particularly Twitter, is increasingly used to improve resilience during extreme weather events/emergency management situations, including floods: by communicating potential risks and their impacts, and informing agencies and responders. In this paper, we developed a prototype national-scale Twitter data mining pipeline for improved stakeholder situational awareness during flooding events across Great Britain, by retrieving relevant social geodata, grounded in environmental data sources (flood warnings and river levels). With potential users we identified and addressed three research questions to develop this application, whose components constitute a modular architecture for real-time dashboards. First, polling national flood warning and river level Web data sources to obtain at-risk locations. Secondly, real-time retrieval of geotagged tweets, proximate to at-risk areas. Thirdly, filtering flood-relevant tweets with natural language processing and machine learning libraries, using word embeddings of tweets. We demonstrated the national-scale social geodata pipeline using over 420, 000 georeferenced tweets obtained between 20 and 29th June 2016. Highlights: Prototype real-time social geodata pipeline for flood events and demonstration dataset. National-scale flood warnings/river levels set 'at-risk areas' in Twitter API queries. Monitoring multiple locations (without keywords) retrieved current, geotagged tweets. Novel application of word embeddings in flooding context identified relevant tweets. Pipeline extracts tweets to visualise using open-source libraries (SciKit Learn/Gensim). … (more)
- Is Part Of:
- Environmental modelling & software. Volume 115(2019)
- Journal:
- Environmental modelling & software
- Issue:
- Volume 115(2019)
- Issue Display:
- Volume 115, Issue 2019 (2019)
- Year:
- 2019
- Volume:
- 115
- Issue:
- 2019
- Issue Sort Value:
- 2019-0115-2019-0000
- Page Start:
- 213
- Page End:
- 227
- Publication Date:
- 2019-05
- Subjects:
- Flood management -- Twitter -- Volunteered geographic information -- Natural language processing -- Word embeddings -- Social geodata
Environmental monitoring -- Computer programs -- Periodicals
Ecology -- Computer simulation -- Periodicals
Digital computer simulation -- Periodicals
Computer software -- Periodicals
Environmental Monitoring -- Periodicals
Computer Simulation -- Periodicals
Environnement -- Surveillance -- Logiciels -- Périodiques
Écologie -- Simulation, Méthodes de -- Périodiques
Simulation par ordinateur -- Périodiques
Logiciels -- Périodiques
Computer software
Digital computer simulation
Ecology -- Computer simulation
Environmental monitoring -- Computer programs
Periodicals
Electronic journals
363.70015118 - Journal URLs:
- http://www.sciencedirect.com/science/journal/13648152 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.envsoft.2018.11.013 ↗
- Languages:
- English
- ISSNs:
- 1364-8152
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3791.522800
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 9628.xml