Phenonizer: A Fine-Grained Phenotypic Named Entity Recognizer for Chinese Clinical Texts. (23rd March 2022)

Record Type:: Journal Article
Title:: Phenonizer: A Fine-Grained Phenotypic Named Entity Recognizer for Chinese Clinical Texts. (23rd March 2022)
Main Title:: Phenonizer: A Fine-Grained Phenotypic Named Entity Recognizer for Chinese Clinical Texts
Authors:: Zou, Qunsheng
Yang, Kuo
Shu, Zixin
Chang, Kai
Zheng, Qiguang
Zheng, Yi
Lu, Kezhi
Xu, Ning
Tian, Haoyu
Li, Xiaomeng
Yang, Yuxia
Zhou, Yana
Yu, Haibin
Zhang, Xiaoping
Xia, Jianan
Zhu, Qiang
Poon, Josiah
Poon, Simon
Zhang, Runshun
Li, Xiaodong
Zhou, Xuezhong
Other Names:: Liang Zhaohui Academic Editor.
Abstract:: Abstract : Biomedical named entity recognition (BioNER) from clinical texts is a fundamental task for clinical data analysis due to the availability of large volume of electronic medical record data, which are mostly in free text format, in real-world clinical settings. Clinical text data incorporates significant phenotypic medical entities (e.g., symptoms, diseases, and laboratory indexes), which could be used for profiling the clinical characteristics of patients in specific disease conditions (e.g., Coronavirus Disease 2019 (COVID-19)). However, general BioNER approaches mostly rely on coarse-grained annotations of phenotypic entities in benchmark text dataset. Owing to the numerous negation expressions of phenotypic entities (e.g., "no fever, " "no cough, " and "no hypertension") in clinical texts, this could not feed the subsequent data analysis process with well-prepared structured clinical data. In this paper, we developed Human-machine Cooperative Phenotypic Spectrum Annotation System (http://www.tcmai.org/login, HCPSAS) and constructed a fine-grained Chinese clinical corpus. Thereafter, we proposed a phenotypic named entity recognizer: Phenonizer, which utilized BERT to capture character-level global contextual representation, extracted local contextual features combined with bidirectional long short-term memory, and finally obtained the optimal label sequences through conditional random field. The results on COVID-19 dataset show that Phenonizer outperforms those … (more)
Is Part Of:: BioMed research international. Volume 2022(2022)
Journal:: BioMed research international
Issue:: Volume 2022(2022)
Issue Display:: Volume 2022, Issue 2022 (2022)
Year:: 2022
Volume:: 2022
Issue:: 2022
Issue Sort Value:: 2022-2022-2022-0000
Page Start:
Page End:
Publication Date:: 2022-03-23
Subjects:: Medicine -- Periodicals
Biology -- Periodicals
Biotechnology -- Periodicals
Life sciences -- Periodicals
610.5
Journal URLs:: https://www.hindawi.com/journals/bmri/ ↗
DOI:: 10.1155/2022/3524090 ↗
Languages:: English
ISSNs:: 2314-6133
Deposit Type:: Legaldeposit
View Content:: Available online (eLD content is only available in our Reading Rooms) ↗
Physical Locations:: British Library HMNTS - ELD Digital store
Ingest File:: 21310.xml