09 Exploring the genotype-phenotype associations of colorectal cancer using vector space model. (5th December 2017)
- Record Type:
- Journal Article
- Title:
- 09 Exploring the genotype-phenotype associations of colorectal cancer using vector space model. (5th December 2017)
- Main Title:
- 09 Exploring the genotype-phenotype associations of colorectal cancer using vector space model
- Authors:
- Deng, N
NK, Du
Feng, YN
Wang, ZY
Duan, HL
Liu, F - Abstract:
- Abstract : Background: Colorectal cancer is a malignant tumour which endangers human lives. With the rapid development of molecular medicine, a great deal of research related to clinic-omics data has been published. Mining the association of genotype-phenotype data has been increasingly recognised as an effective way for early stage prediction of colorectal cancer. Methods: In this study, a literature text mining method was proposed for biomedical objects association using the Vector Space Model (VSM). For each article, we represented biomedical objects as the vectors of VSM. Gene symbols were denoted as the genotype objects, and the MeSH terms annotated from the literature were denoted as the phenotype objects. A TF-IDF algorithm was then used to quantitatively calculate the correlation between genotype and phenotype objects. Results: A total of 473 242 articles related to colorectal cancer were acquired from the MEDLINE database. We finally obtained 77 clinical terms and 490 genes highly related to colorectal cancer, resulting in 2125 associations between these clinical terms and genes. Biological pathway analysis by KEGG database demonstrated that genotype-phenotype association mining from our study covers all stages of the development of colorectal cancer, a number of which were at the early stage. These findings might become a beneficial complement of cancer translation research. Conclusion: Our study provides a biomedical literature mining method for cancerAbstract : Background: Colorectal cancer is a malignant tumour which endangers human lives. With the rapid development of molecular medicine, a great deal of research related to clinic-omics data has been published. Mining the association of genotype-phenotype data has been increasingly recognised as an effective way for early stage prediction of colorectal cancer. Methods: In this study, a literature text mining method was proposed for biomedical objects association using the Vector Space Model (VSM). For each article, we represented biomedical objects as the vectors of VSM. Gene symbols were denoted as the genotype objects, and the MeSH terms annotated from the literature were denoted as the phenotype objects. A TF-IDF algorithm was then used to quantitatively calculate the correlation between genotype and phenotype objects. Results: A total of 473 242 articles related to colorectal cancer were acquired from the MEDLINE database. We finally obtained 77 clinical terms and 490 genes highly related to colorectal cancer, resulting in 2125 associations between these clinical terms and genes. Biological pathway analysis by KEGG database demonstrated that genotype-phenotype association mining from our study covers all stages of the development of colorectal cancer, a number of which were at the early stage. These findings might become a beneficial complement of cancer translation research. Conclusion: Our study provides a biomedical literature mining method for cancer translational research such as construction of a precision medicine knowledge base, biomarker prediction/evaluation, and knowledge discovery in texts. Acknowledgements: Supported by the National key research and development program of China (No. 2016YFC0901703), and the Public Projects of Zhejiang Province, China (No. 2017C33064). … (more)
- Is Part Of:
- Journal of investigative medicine. Volume 65(2017)Supplement 7
- Journal:
- Journal of investigative medicine
- Issue:
- Volume 65(2017)Supplement 7
- Issue Display:
- Volume 65, Issue 7 (2017)
- Year:
- 2017
- Volume:
- 65
- Issue:
- 7
- Issue Sort Value:
- 2017-0065-0007-0000
- Page Start:
- A3
- Page End:
- A3
- Publication Date:
- 2017-12-05
- Subjects:
- Clinical medicine -- Periodicals
Medicine -- Research -- Periodicals
Medicine
Research -- United States
Clinical medicine
Medicine -- Research
Periodicals
616.075 - Journal URLs:
- http://journals.lww.com/jinvestigativemed/pages/default.aspx ↗
http://jim.bmj.com/ ↗
https://journals.sagepub.com/home/IMJ ↗
http://journals.lww.com ↗ - DOI:
- 10.1136/jim-2017-MEBabstracts.9 ↗
- Languages:
- English
- ISSNs:
- 1081-5589
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 5008.010000
British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store - Ingest File:
- 18669.xml