Assessing the similarity of surface linguistic features related to epilepsy across pediatric hospitals. (1st April 2014)
- Record Type:
- Journal Article
- Title:
- Assessing the similarity of surface linguistic features related to epilepsy across pediatric hospitals. (1st April 2014)
- Main Title:
- Assessing the similarity of surface linguistic features related to epilepsy across pediatric hospitals
- Authors:
- Connolly, Brian
Matykiewicz, Pawel
Bretonnel Cohen, K
Standridge, Shannon M
Glauser, Tracy A
Dlugos, Dennis J
Koh, Susan
Tham, Eric
Pestian, John - Abstract:
- Abstract: Objective The constant progress in computational linguistic methods provides amazing opportunities for discovering information in clinical text and enables the clinical scientist to explore novel approaches to care. However, these new approaches need evaluation. We describe an automated system to compare descriptions of epilepsy patients at three different organizations: Cincinnati Children's Hospital, the Children's Hospital Colorado, and the Children's Hospital of Philadelphia. To our knowledge, there have been no similar previous studies. Materials and methods In this work, a support vector machine (SVM)-based natural language processing (NLP) algorithm is trained to classify epilepsy progress notes as belonging to a patient with a specific type of epilepsy from a particular hospital. The same SVM is then used to classify notes from another hospital. Our null hypothesis is that an NLP algorithm cannot be trained using epilepsy-specific notes from one hospital and subsequently used to classify notes from another hospital better than a random baseline classifier. The hypothesis is tested using epilepsy progress notes from the three hospitals. Results We are able to reject the null hypothesis at the 95% level. It is also found that classification was improved by including notes from a second hospital in the SVM training sample. Discussion and conclusion With a reasonably uniform epilepsy vocabulary and an NLP-based algorithm able to use this uniformity to classifyAbstract: Objective The constant progress in computational linguistic methods provides amazing opportunities for discovering information in clinical text and enables the clinical scientist to explore novel approaches to care. However, these new approaches need evaluation. We describe an automated system to compare descriptions of epilepsy patients at three different organizations: Cincinnati Children's Hospital, the Children's Hospital Colorado, and the Children's Hospital of Philadelphia. To our knowledge, there have been no similar previous studies. Materials and methods In this work, a support vector machine (SVM)-based natural language processing (NLP) algorithm is trained to classify epilepsy progress notes as belonging to a patient with a specific type of epilepsy from a particular hospital. The same SVM is then used to classify notes from another hospital. Our null hypothesis is that an NLP algorithm cannot be trained using epilepsy-specific notes from one hospital and subsequently used to classify notes from another hospital better than a random baseline classifier. The hypothesis is tested using epilepsy progress notes from the three hospitals. Results We are able to reject the null hypothesis at the 95% level. It is also found that classification was improved by including notes from a second hospital in the SVM training sample. Discussion and conclusion With a reasonably uniform epilepsy vocabulary and an NLP-based algorithm able to use this uniformity to classify epilepsy progress notes across different hospitals, we can pursue automated comparisons of patient conditions, treatments, and diagnoses across different healthcare settings. … (more)
- Is Part Of:
- Journal of the American Medical Informatics Association. Volume 21:Number 5(2014:Sep.)
- Journal:
- Journal of the American Medical Informatics Association
- Issue:
- Volume 21:Number 5(2014:Sep.)
- Issue Display:
- Volume 21, Issue 5 (2014)
- Year:
- 2014
- Volume:
- 21
- Issue:
- 5
- Issue Sort Value:
- 2014-0021-0005-0000
- Page Start:
- 866
- Page End:
- 870
- Publication Date:
- 2014-04-01
- Subjects:
- Multicenter -- Epilepsy -- Linguistics -- Support vector machines -- Text classification
Medical informatics -- Periodicals
Information Services -- Periodicals
Medical Informatics -- Periodicals
Médecine -- Informatique -- Périodiques
Informatica
Geneeskunde
Informatique médicale
Computer network resources
Electronic journals
610.285 - Journal URLs:
- http://jamia.bmj.com/ ↗
http://www.jamia.org ↗
http://www.pubmedcentral.nih.gov/tocrender.fcgi?journal=76 ↗
http://www.sciencedirect.com/science/journal/10675027 ↗
http://jamia.oxfordjournals.org/ ↗
http://www.oxfordjournals.org/en/ ↗ - DOI:
- 10.1136/amiajnl-2013-002601 ↗
- Languages:
- English
- ISSNs:
- 1067-5027
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 4689.025000
British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store - Ingest File:
- 15452.xml