A comparative study of pretrained language models for long clinical text. (30th November 2022)

Record Type:: Journal Article
Title:: A comparative study of pretrained language models for long clinical text. (30th November 2022)
Main Title:: A comparative study of pretrained language models for long clinical text
Authors:: Li, Yikuan
Wehbe, Ramsey M
Ahmad, Faraz S
Wang, Hanyin
Luo, Yuan
Abstract:: Abstract: Objective: Clinical knowledge-enriched transformer models (eg, ClinicalBERT) have state-of-the-art results on clinical natural language processing (NLP) tasks. One of the core limitations of these transformer models is the substantial memory consumption due to their full self-attention mechanism, which leads to the performance degradation in long clinical texts. To overcome this, we propose to leverage long-sequence transformer models (eg, Longformer and BigBird), which extend the maximum input sequence length from 512 to 4096, to enhance the ability to model long-term dependencies in long clinical texts. Materials and methods: Inspired by the success of long-sequence transformer models and the fact that clinical notes are mostly long, we introduce 2 domain-enriched language models, Clinical-Longformer and Clinical-BigBird, which are pretrained on a large-scale clinical corpus. We evaluate both language models using 10 baseline tasks including named entity recognition, question answering, natural language inference, and document classification tasks. Results: The results demonstrate that Clinical-Longformer and Clinical-BigBird consistently and significantly outperform ClinicalBERT and other short-sequence transformers in all 10 downstream tasks and achieve new state-of-the-art results. Discussion: Our pretrained language models provide the bedrock for clinical NLP using long texts. We have made our source code available at … (more)
Is Part Of:: Journal of the American Medical Informatics Association. Volume 30:Number 2(2023)
Journal:: Journal of the American Medical Informatics Association
Issue:: Volume 30:Number 2(2023)
Issue Display:: Volume 30, Issue 2 (2023)
Year:: 2023
Volume:: 30
Issue:: 2
Issue Sort Value:: 2023-0030-0002-0000
Page Start:: 340
Page End:: 347
Publication Date:: 2022-11-30
Subjects:: clinical natural language processing -- text classification -- named entity recognition -- question answering -- natural language inference
Medical informatics -- Periodicals
Information Services -- Periodicals
Medical Informatics -- Periodicals
Médecine -- Informatique -- Périodiques
Informatica
Geneeskunde
Informatique médicale
Computer network resources
Electronic journals
610.285
Journal URLs:: http://jamia.bmj.com/ ↗
http://www.jamia.org ↗
http://www.pubmedcentral.nih.gov/tocrender.fcgi?journal=76 ↗
http://www.sciencedirect.com/science/journal/10675027 ↗
http://jamia.oxfordjournals.org/ ↗
http://www.oxfordjournals.org/en/ ↗
DOI:: 10.1093/jamia/ocac225 ↗
Languages:: English
ISSNs:: 1067-5027
Deposit Type:: Legaldeposit
View Content:: Available online (eLD content is only available in our Reading Rooms) ↗
Physical Locations:: British Library DSC - 4689.025000
British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store
Ingest File:: 25160.xml