Investigating DNA‐, RNA‐, and protein‐based features as a means to discriminate pathogenic synonymous variants. Issue 10 (10th July 2017)
- Record Type:
- Journal Article
- Title:
- Investigating DNA‐, RNA‐, and protein‐based features as a means to discriminate pathogenic synonymous variants. Issue 10 (10th July 2017)
- Main Title:
- Investigating DNA‐, RNA‐, and protein‐based features as a means to discriminate pathogenic synonymous variants
- Authors:
- Livingstone, Mark
Folkman, Lukas
Yang, Yuedong
Zhang, Ping
Mort, Matthew
Cooper, David N.
Liu, Yunlong
Stantic, Bela
Zhou, Yaoqi - Abstract:
- Abstract : The article presents a machine learning model, termed DDIG‐SN, as a means to discriminate disease‐causing synonymous variants. For training DDIG‐SN, we evaluated a range of DNA‐, RNA‐, and protein‐level features. We were able to show that the disease‐causing effects in the immediate proximity to exon‐intron junctions (1–3 bp) are driven by the loss of splicing motif strength, whereas the gain of splicing motif strength is the primary cause in regions further away from the splice site (4–69 bp). Abstract: Synonymous single‐nucleotide variants (SNVs), although they do not alter the encoded protein sequences, have been implicated in many genetic diseases. Experimental studies indicate that synonymous SNVs can lead to changes in the secondary and tertiary structures of DNA and RNA, thereby affecting translational efficiency, cotranslational protein folding as well as the binding of DNA‐/RNA‐binding proteins. However, the importance of these various features in disease phenotypes is not clearly understood. Here, we have built a support vector machine (SVM) model (termed DDIG‐SN) as a means to discriminate disease‐causing synonymous variants. The model was trained and evaluated on nearly 900 disease‐causing variants. The method achieves robust performance with the area under the receiver operating characteristic curve of 0.84 and 0.85 for protein‐stratified 10‐fold cross‐validation and independent testing, respectively. We were able to show that the disease‐causingAbstract : The article presents a machine learning model, termed DDIG‐SN, as a means to discriminate disease‐causing synonymous variants. For training DDIG‐SN, we evaluated a range of DNA‐, RNA‐, and protein‐level features. We were able to show that the disease‐causing effects in the immediate proximity to exon‐intron junctions (1–3 bp) are driven by the loss of splicing motif strength, whereas the gain of splicing motif strength is the primary cause in regions further away from the splice site (4–69 bp). Abstract: Synonymous single‐nucleotide variants (SNVs), although they do not alter the encoded protein sequences, have been implicated in many genetic diseases. Experimental studies indicate that synonymous SNVs can lead to changes in the secondary and tertiary structures of DNA and RNA, thereby affecting translational efficiency, cotranslational protein folding as well as the binding of DNA‐/RNA‐binding proteins. However, the importance of these various features in disease phenotypes is not clearly understood. Here, we have built a support vector machine (SVM) model (termed DDIG‐SN) as a means to discriminate disease‐causing synonymous variants. The model was trained and evaluated on nearly 900 disease‐causing variants. The method achieves robust performance with the area under the receiver operating characteristic curve of 0.84 and 0.85 for protein‐stratified 10‐fold cross‐validation and independent testing, respectively. We were able to show that the disease‐causing effects in the immediate proximity to exon–intron junctions (1–3 bp) are driven by the loss of splicing motif strength, whereas the gain of splicing motif strength is the primary cause in regions further away from the splice site (4–69 bp). The method is available as a part of the DDIG server athttp://sparks-lab.org/ddig . … (more)
- Is Part Of:
- Human mutation. Volume 38:Issue 10(2017)
- Journal:
- Human mutation
- Issue:
- Volume 38:Issue 10(2017)
- Issue Display:
- Volume 38, Issue 10 (2017)
- Year:
- 2017
- Volume:
- 38
- Issue:
- 10
- Issue Sort Value:
- 2017-0038-0010-0000
- Page Start:
- 1336
- Page End:
- 1347
- Publication Date:
- 2017-07-10
- Subjects:
- bioinformatics -- machine learning -- same‐sense variant -- silent mutation -- synonymous SNV
Human chromosome abnormalities -- Periodicals
Mutation (Biology) -- Periodicals
616.04205 - Journal URLs:
- http://onlinelibrary.wiley.com/journal/10.1002/(ISSN)1098-1004 ↗
http://onlinelibrary.wiley.com/ ↗ - DOI:
- 10.1002/humu.23283 ↗
- Languages:
- English
- ISSNs:
- 1059-7794
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 4336.217000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 4684.xml