A novel statistical method predicts mutability of the genomic segments of the SARS-CoV-2 virus. (13th December 2022)
- Record Type:
- Journal Article
- Title:
- A novel statistical method predicts mutability of the genomic segments of the SARS-CoV-2 virus. (13th December 2022)
- Main Title:
- A novel statistical method predicts mutability of the genomic segments of the SARS-CoV-2 virus
- Authors:
- Darooneh, Amir Hossein
Przedborski, Michelle
Kohandel, Mohammad - Abstract:
- Abstract: The SARS-CoV-2 virus has made the largest pandemic of the 21st century, with hundreds of millions of cases and tens of millions of fatalities. Scientists all around the world are racing to develop vaccines and new pharmaceuticals to overcome the pandemic and offer effective treatments for COVID-19 disease. Consequently, there is an essential need to better understand how the pathogenesis of SARS-CoV-2 is affected by viral mutations and to determine the conserved segments in the viral genome that can serve as stable targets for novel therapeutics. Here, we introduce a text-mining method to estimate the mutability of genomic segments directly from a reference (ancestral) whole genome sequence. The method relies on calculating the importance of genomic segments based on their spatial distribution and frequency over the whole genome. To validate our approach, we perform a large-scale analysis of the viral mutations in nearly 80, 000 publicly available SARS-CoV-2 predecessor whole genome sequences and show that these results are highly correlated with the segments predicted by the statistical method used for keyword detection. Importantly, these correlations are found to hold at the codon and gene levels, as well as for gene coding regions. Using the text-mining method, we further identify codon sequences that are potential candidates for siRNA-based antiviral drugs. Significantly, one of the candidates identified in this work corresponds to the first seven codons of anAbstract: The SARS-CoV-2 virus has made the largest pandemic of the 21st century, with hundreds of millions of cases and tens of millions of fatalities. Scientists all around the world are racing to develop vaccines and new pharmaceuticals to overcome the pandemic and offer effective treatments for COVID-19 disease. Consequently, there is an essential need to better understand how the pathogenesis of SARS-CoV-2 is affected by viral mutations and to determine the conserved segments in the viral genome that can serve as stable targets for novel therapeutics. Here, we introduce a text-mining method to estimate the mutability of genomic segments directly from a reference (ancestral) whole genome sequence. The method relies on calculating the importance of genomic segments based on their spatial distribution and frequency over the whole genome. To validate our approach, we perform a large-scale analysis of the viral mutations in nearly 80, 000 publicly available SARS-CoV-2 predecessor whole genome sequences and show that these results are highly correlated with the segments predicted by the statistical method used for keyword detection. Importantly, these correlations are found to hold at the codon and gene levels, as well as for gene coding regions. Using the text-mining method, we further identify codon sequences that are potential candidates for siRNA-based antiviral drugs. Significantly, one of the candidates identified in this work corresponds to the first seven codons of an epitope of the spike glycoprotein, which is the only SARS-CoV-2 immunogenic peptide without a match to a human protein. … (more)
- Is Part Of:
- QRB discovery. Volume 3(2022)
- Journal:
- QRB discovery
- Issue:
- Volume 3(2022)
- Issue Display:
- Volume 3, Issue 2022 (2022)
- Year:
- 2022
- Volume:
- 3
- Issue:
- 2022
- Issue Sort Value:
- 2022-0003-2022-0000
- Page Start:
- Page End:
- Publication Date:
- 2022-12-13
- Subjects:
- SARS-CoV2 -- Mutability -- Statistical Analysis -- Word Ranking
Biophysics -- Periodicals
571.4 - Journal URLs:
- https://www.cambridge.org/core/journals/qrb-discovery ↗
- DOI:
- 10.1017/qrd.2021.13 ↗
- Languages:
- English
- ISSNs:
- 2633-2892
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library HMNTS - ELD Digital store
- Ingest File:
- 23066.xml