Natural Language Processing (NLP) in Qualitative Public Health Research: A Proof of Concept Study. Issue 1 (11th November 2019)
- Record Type:
- Journal Article
- Title:
- Natural Language Processing (NLP) in Qualitative Public Health Research: A Proof of Concept Study. Issue 1 (11th November 2019)
- Main Title:
- Natural Language Processing (NLP) in Qualitative Public Health Research: A Proof of Concept Study
- Authors:
- Leeson, William
Resnick, Adam
Alexander, Daniel
Rovers, John - Abstract:
- Qualitative data-analysis methods provide thick, rich descriptions of subjects' thoughts, feelings, and lived experiences but may be time-consuming, labor-intensive, or prone to bias. Natural language processing (NLP) is a machine learning technique from computer science that uses algorithms to analyze textual data. NLP allows processing of large amounts of data almost instantaneously. As researchers become conversant with NLP, it is becoming more frequently employed outside of computer science and shows promise as a tool to analyze qualitative data in public health. This is a proof of concept paper to evaluate the potential of NLP to analyze qualitative data. Specifically, we ask if NLP can support conventional qualitative analysis, and if so, what its role is. We compared a qualitative method of open coding with two forms of NLP, Topic Modeling, and Word2Vec to analyze transcripts from interviews conducted in rural Belize querying men about their health needs. All three methods returned a series of terms that captured ideas and concepts in subjects' responses to interview questions. Open coding returned 5–10 words or short phrases for each question. Topic Modeling returned a series of word-probability pairs that quantified how well a word captured the topic of a response. Word2Vec returned a list of words for each interview question ordered by which words were predicted to best capture the meaning of the passage. For most interview questions, all three methods returnedQualitative data-analysis methods provide thick, rich descriptions of subjects' thoughts, feelings, and lived experiences but may be time-consuming, labor-intensive, or prone to bias. Natural language processing (NLP) is a machine learning technique from computer science that uses algorithms to analyze textual data. NLP allows processing of large amounts of data almost instantaneously. As researchers become conversant with NLP, it is becoming more frequently employed outside of computer science and shows promise as a tool to analyze qualitative data in public health. This is a proof of concept paper to evaluate the potential of NLP to analyze qualitative data. Specifically, we ask if NLP can support conventional qualitative analysis, and if so, what its role is. We compared a qualitative method of open coding with two forms of NLP, Topic Modeling, and Word2Vec to analyze transcripts from interviews conducted in rural Belize querying men about their health needs. All three methods returned a series of terms that captured ideas and concepts in subjects' responses to interview questions. Open coding returned 5–10 words or short phrases for each question. Topic Modeling returned a series of word-probability pairs that quantified how well a word captured the topic of a response. Word2Vec returned a list of words for each interview question ordered by which words were predicted to best capture the meaning of the passage. For most interview questions, all three methods returned conceptually similar results. NLP may be a useful adjunct to qualitative analysis. NLP may be performed after data have undergone open coding as a check on the accuracy of the codes. Alternatively, researchers can perform NLP prior to open coding and use the results to guide their creation of their codebook. … (more)
- Is Part Of:
- International journal of qualitative methods. Volume 18:Issue 1(2019)
- Journal:
- International journal of qualitative methods
- Issue:
- Volume 18:Issue 1(2019)
- Issue Display:
- Volume 18, Issue 1 (2019)
- Year:
- 2019
- Volume:
- 18
- Issue:
- 1
- Issue Sort Value:
- 2019-0018-0001-0000
- Page Start:
- Page End:
- Publication Date:
- 2019-11-11
- Subjects:
- machine learning -- natural language processing -- open coding -- topic modeling -- Word2Vec -- public health research
Qualitative research -- Periodicals
Social sciences -- Philosophy -- Periodicals
Social sciences -- Research -- Periodicals
Social sciences -- Statistical methods -- Periodicals
Qualitative Research -- Periodicals
Qualitative research
Social sciences -- Philosophy
Social sciences -- Research
Periodicals
300.72 - Journal URLs:
- http://ijq.sagepub.com/ ↗
http://www.ualberta.ca/%7Eijqm/ ↗
http://www.sagepublications.com/ ↗ - DOI:
- 10.1177/1609406919887021 ↗
- Languages:
- English
- ISSNs:
- 1609-4069
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 12251.xml