Natural Language Processing Algorithms for Normalizing Expressions of Synonymous Symptoms in Traditional Chinese Medicine. (11th October 2021)
- Record Type:
- Journal Article
- Title:
- Natural Language Processing Algorithms for Normalizing Expressions of Synonymous Symptoms in Traditional Chinese Medicine. (11th October 2021)
- Main Title:
- Natural Language Processing Algorithms for Normalizing Expressions of Synonymous Symptoms in Traditional Chinese Medicine
- Authors:
- Zhou, Lu
Liu, Shuangqiao
Li, Caiyan
Sun, Yuemeng
Zhang, Yizhuo
Li, Yuda
Yuan, Huimin
Sun, Yan
Xu, Fengqin
Li, Yuhang - Other Names:
- Ahmed Mediani Academic Editor.
- Abstract:
- Abstract : Background . The modernization of traditional Chinese medicine (TCM) demands systematic data mining using medical records. However, this process is hindered by the fact that many TCM symptoms have the same meaning but different literal expressions (i.e., TCM synonymous symptoms). This problem can be solved by using natural language processing algorithms to construct a high-quality TCM symptom normalization model for normalizing TCM synonymous symptoms to unified literal expressions. Methods . Four types of TCM symptom normalization models, based on natural language processing, were constructed to find a high-quality one: (1) a text sequence generation model based on a bidirectional long short-term memory (Bi-LSTM) neural network with an encoder-decoder structure; (2) a text classification model based on a Bi-LSTM neural network and sigmoid function; (3) a text sequence generation model based on bidirectional encoder representation from transformers (BERT) with sequence-to-sequence training method of unified language model (BERT-UniLM); (4) a text classification model based on BERT and sigmoid function (BERT-Classification). The performance of the models was compared using four metrics: accuracy, recall, precision, and F1-score. Results . The BERT-Classification model outperformed the models based on Bi-LSTM and BERT-UniLM with respect to the four metrics. Conclusions . The BERT-Classification model has superior performance in normalizing expressions of TCMAbstract : Background . The modernization of traditional Chinese medicine (TCM) demands systematic data mining using medical records. However, this process is hindered by the fact that many TCM symptoms have the same meaning but different literal expressions (i.e., TCM synonymous symptoms). This problem can be solved by using natural language processing algorithms to construct a high-quality TCM symptom normalization model for normalizing TCM synonymous symptoms to unified literal expressions. Methods . Four types of TCM symptom normalization models, based on natural language processing, were constructed to find a high-quality one: (1) a text sequence generation model based on a bidirectional long short-term memory (Bi-LSTM) neural network with an encoder-decoder structure; (2) a text classification model based on a Bi-LSTM neural network and sigmoid function; (3) a text sequence generation model based on bidirectional encoder representation from transformers (BERT) with sequence-to-sequence training method of unified language model (BERT-UniLM); (4) a text classification model based on BERT and sigmoid function (BERT-Classification). The performance of the models was compared using four metrics: accuracy, recall, precision, and F1-score. Results . The BERT-Classification model outperformed the models based on Bi-LSTM and BERT-UniLM with respect to the four metrics. Conclusions . The BERT-Classification model has superior performance in normalizing expressions of TCM synonymous symptoms. … (more)
- Is Part Of:
- Evidence-based complementary and alternative medicine. Volume 2021(2021)
- Journal:
- Evidence-based complementary and alternative medicine
- Issue:
- Volume 2021(2021)
- Issue Display:
- Volume 2021, Issue 2021 (2021)
- Year:
- 2021
- Volume:
- 2021
- Issue:
- 2021
- Issue Sort Value:
- 2021-2021-2021-0000
- Page Start:
- Page End:
- Publication Date:
- 2021-10-11
- Subjects:
- Alternative medicine -- Periodicals
615.505 - Journal URLs:
- http://ecam.oupjournals.org ↗
http://www.ncbi.nlm.nih.gov/pmc/journals/241/ ↗
http://www.hindawi.com/journals/ecam/ ↗ - DOI:
- 10.1155/2021/6676607 ↗
- Languages:
- English
- ISSNs:
- 1741-427X
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3831.036630
British Library HMNTS - ELD Digital store - Ingest File:
- 19907.xml