Applying machine learning to understand water security and water access inequality in underserved colonia communities. (June 2023)
- Record Type:
- Journal Article
- Title:
- Applying machine learning to understand water security and water access inequality in underserved colonia communities. (June 2023)
- Main Title:
- Applying machine learning to understand water security and water access inequality in underserved colonia communities
- Authors:
- Gu, Zhining
Li, Wenwen
Hanemann, Michael
Tsai, Yushiou
Wutich, Amber
Westerhoff, Paul
Landes, Laura
Roque, Anais D.
Zheng, Madeleine
Velasco, Carmen A.
Porter, Sarah - Abstract:
- Abstract: This paper explores the application of machine learning to enhance our understanding of water accessibility issues in underserved communities called colonias located along the northern part of the United States–Mexico border. We analyzed >2000 such communities using data from the Rural Community Assistance Partnership (RCAP) and applied hierarchical clustering and the adaptive affinity propagation algorithm to automatically group colonias into clusters with different water access conditions. The Gower distance was introduced to make the algorithm capable of processing complex datasets containing both categorical and numerical attributes. To better understand and explain the clustering results derived from the machine learning process, we further applied a decision tree analysis algorithm to associate the input data with the derived clusters, to identify and rank the importance of factors that characterize different water access conditions in each cluster. Our results complement experts' priority rankings of water infrastructure needs, providing a more in-depth view of the water insecurity challenges that the colonias suffer from. As an automated and reproducible workflow combining a series of tools, the proposed machine learning pipeline represents an operationalized solution for conducting data-driven analysis to understand water access inequality. This pipeline can be adapted to analyze different datasets and decision scenarios. Highlights: Develop a machineAbstract: This paper explores the application of machine learning to enhance our understanding of water accessibility issues in underserved communities called colonias located along the northern part of the United States–Mexico border. We analyzed >2000 such communities using data from the Rural Community Assistance Partnership (RCAP) and applied hierarchical clustering and the adaptive affinity propagation algorithm to automatically group colonias into clusters with different water access conditions. The Gower distance was introduced to make the algorithm capable of processing complex datasets containing both categorical and numerical attributes. To better understand and explain the clustering results derived from the machine learning process, we further applied a decision tree analysis algorithm to associate the input data with the derived clusters, to identify and rank the importance of factors that characterize different water access conditions in each cluster. Our results complement experts' priority rankings of water infrastructure needs, providing a more in-depth view of the water insecurity challenges that the colonias suffer from. As an automated and reproducible workflow combining a series of tools, the proposed machine learning pipeline represents an operationalized solution for conducting data-driven analysis to understand water access inequality. This pipeline can be adapted to analyze different datasets and decision scenarios. Highlights: Develop a machine learning approach to understand the water insecurity issues in the underserved colonias communities. Introduce Gower distance into affinity propagation clustering algorithm to handle complex datasets. The automated, interpretable, and reproducible workflow can be easily adapted to other spatial analysis and decision tasks. … (more)
- Is Part Of:
- Computers, environment and urban systems. Volume 102(2023)
- Journal:
- Computers, environment and urban systems
- Issue:
- Volume 102(2023)
- Issue Display:
- Volume 102, Issue 2023 (2023)
- Year:
- 2023
- Volume:
- 102
- Issue:
- 2023
- Issue Sort Value:
- 2023-0102-2023-0000
- Page Start:
- Page End:
- Publication Date:
- 2023-06
- Subjects:
- Machine learning -- Water security -- Hierarchical clustering -- Adaptive affinity propagation -- Underserved communities
City planning -- Data processing -- Periodicals
Regional planning -- Data processing -- Periodicals
303.4834 - Journal URLs:
- http://www.sciencedirect.com/science/journal/01989715 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.compenvurbsys.2023.101969 ↗
- Languages:
- English
- ISSNs:
- 0198-9715
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3394.914000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 27079.xml