Extraction of knowledge graph of Covid-19 through mining of unstructured biomedical corpora. (February 2023)
- Record Type:
- Journal Article
- Title:
- Extraction of knowledge graph of Covid-19 through mining of unstructured biomedical corpora. (February 2023)
- Main Title:
- Extraction of knowledge graph of Covid-19 through mining of unstructured biomedical corpora
- Authors:
- Gajendran, Sudhakaran
Manjula, D.
Sugumaran, Vijayan
Hema, R. - Abstract:
- Abstract: The number of biomedical articles published is increasing rapidly over the years. Currently there are about 30 million articles in PubMed and over 25 million mentions in Medline. Among these fundamentals, Biomedical Named Entity Recognition (BioNER) and Biomedical Relation Extraction (BioRE) are the most essential in analysing the literature. In the biomedical domain, Knowledge Graph is used to visualize the relationships between various entities such as proteins, chemicals and diseases. Scientific publications have increased dramatically as a result of the search for treatments and potential cures for the new Coronavirus, but efficiently analysing, integrating, and utilising related sources of information remains a difficulty. In order to effectively combat the disease during pandemics like COVID-19, literature must be used quickly and effectively. In this paper, we introduced a fully automated framework consists of BERT-BiLSTM, Knowledge graph, and Representation Learning model to extract the top diseases, chemicals, and proteins related to COVID-19 from the literature. The proposed framework uses Named Entity Recognition models for disease recognition, chemical recognition, and protein recognition. Then the system uses the Chemical - Disease Relation Extraction and Chemical - Protein Relation Extraction models. And the system extracts the entities and relations from the CORD-19 dataset using the models. The system then creates a Knowledge Graph for the extractedAbstract: The number of biomedical articles published is increasing rapidly over the years. Currently there are about 30 million articles in PubMed and over 25 million mentions in Medline. Among these fundamentals, Biomedical Named Entity Recognition (BioNER) and Biomedical Relation Extraction (BioRE) are the most essential in analysing the literature. In the biomedical domain, Knowledge Graph is used to visualize the relationships between various entities such as proteins, chemicals and diseases. Scientific publications have increased dramatically as a result of the search for treatments and potential cures for the new Coronavirus, but efficiently analysing, integrating, and utilising related sources of information remains a difficulty. In order to effectively combat the disease during pandemics like COVID-19, literature must be used quickly and effectively. In this paper, we introduced a fully automated framework consists of BERT-BiLSTM, Knowledge graph, and Representation Learning model to extract the top diseases, chemicals, and proteins related to COVID-19 from the literature. The proposed framework uses Named Entity Recognition models for disease recognition, chemical recognition, and protein recognition. Then the system uses the Chemical - Disease Relation Extraction and Chemical - Protein Relation Extraction models. And the system extracts the entities and relations from the CORD-19 dataset using the models. The system then creates a Knowledge Graph for the extracted relations and entities. The system performs Representation Learning on this KG to get the embeddings of all entities and get the top related diseases, chemicals, and proteins with respect to COVID-19. Graphical Abstract: ga1 Highlights: A BERT-BiLSTM-CRF model to identify the biomedical named entities. A SciBERT model to extract the relations between the biomedical entities. Creating a Knowledge Graph for the COVID – 19 with entities as nodes and relations as edges. Use of Representation Learning to identify top related drugs, disease and proteins for COVID – 19. … (more)
- Is Part Of:
- Computational biology and chemistry. Volume 102(2023)
- Journal:
- Computational biology and chemistry
- Issue:
- Volume 102(2023)
- Issue Display:
- Volume 102, Issue 2023 (2023)
- Year:
- 2023
- Volume:
- 102
- Issue:
- 2023
- Issue Sort Value:
- 2023-0102-2023-0000
- Page Start:
- Page End:
- Publication Date:
- 2023-02
- Subjects:
- Biomedical Named Entity Recognition (BioNER) -- Relation Extraction (RE) -- Knowledge graph -- Representation learning -- BERT -- BiLSTM
Chemistry -- Data processing -- Periodicals
Biology -- Data processing -- Periodicals
Biochemistry -- Data processing
Biology -- Data processing
Molecular biology -- Data processing
Periodicals
Electronic journals
542.85 - Journal URLs:
- http://www.sciencedirect.com/science/journal/14769271 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.compbiolchem.2022.107808 ↗
- Languages:
- English
- ISSNs:
- 1476-9271
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3390.576700
British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store - Ingest File:
- 25136.xml