A naive Bayes algorithm for tissue origin diagnosis (TOD‐Bayes) of synchronous multifocal tumors in the hepatobiliary and pancreatic system. Issue 2 (16th October 2017)
- Record Type:
- Journal Article
- Title:
- A naive Bayes algorithm for tissue origin diagnosis (TOD‐Bayes) of synchronous multifocal tumors in the hepatobiliary and pancreatic system. Issue 2 (16th October 2017)
- Main Title:
- A naive Bayes algorithm for tissue origin diagnosis (TOD‐Bayes) of synchronous multifocal tumors in the hepatobiliary and pancreatic system
- Authors:
- Jiang, Weiqin
Shen, Yifei
Ding, Yongfeng
Ye, Chuyu
Zheng, Yi
Zhao, Peng
Liu, Lulu
Tong, Zhou
Zhou, Linfu
Sun, Shuo
Zhang, Xingchen
Teng, Lisong
Timko, Michael P.
Fan, Longjiang
Fang, Weijia - Abstract:
- Abstract : Synchronous multifocal tumors are common in the hepatobiliary and pancreatic system but because of similarities in their histological features, oncologists have difficulty in identifying their precise tissue clonal origin through routine histopathological methods. To address this problem and assist in more precise diagnosis, we developed a computational approach for tissue origin diagnosis based on naive Bayes algorithm (TOD‐Bayes) using ubiquitous RNA‐Seq data. Massive tissue‐specific RNA‐Seq data sets were first obtained from The Cancer Genome Atlas (TCGA) and ∼1, 000 feature genes were used to train and validate the TOD‐Bayes algorithm. The accuracy of the model was >95% based on tenfold cross validation by the data from TCGA. A total of 18 clinical cancer samples (including six negative controls) with definitive tissue origin were subsequently used for external validation and 17 of the 18 samples were classified correctly in our study (94.4%). Furthermore, we included as cases studies seven tumor samples, taken from two individuals who suffered from synchronous multifocal tumors across tissues, where the efforts to make a definitive primary cancer diagnosis by traditional diagnostic methods had failed. Using our TOD‐Bayes analysis, the two clinical test cases were successfully diagnosed as pancreatic cancer (PC) and cholangiocarcinoma (CC), respectively, in agreement with their clinical outcomes. Based on our findings, we believe that the TOD‐Bayes algorithmAbstract : Synchronous multifocal tumors are common in the hepatobiliary and pancreatic system but because of similarities in their histological features, oncologists have difficulty in identifying their precise tissue clonal origin through routine histopathological methods. To address this problem and assist in more precise diagnosis, we developed a computational approach for tissue origin diagnosis based on naive Bayes algorithm (TOD‐Bayes) using ubiquitous RNA‐Seq data. Massive tissue‐specific RNA‐Seq data sets were first obtained from The Cancer Genome Atlas (TCGA) and ∼1, 000 feature genes were used to train and validate the TOD‐Bayes algorithm. The accuracy of the model was >95% based on tenfold cross validation by the data from TCGA. A total of 18 clinical cancer samples (including six negative controls) with definitive tissue origin were subsequently used for external validation and 17 of the 18 samples were classified correctly in our study (94.4%). Furthermore, we included as cases studies seven tumor samples, taken from two individuals who suffered from synchronous multifocal tumors across tissues, where the efforts to make a definitive primary cancer diagnosis by traditional diagnostic methods had failed. Using our TOD‐Bayes analysis, the two clinical test cases were successfully diagnosed as pancreatic cancer (PC) and cholangiocarcinoma (CC), respectively, in agreement with their clinical outcomes. Based on our findings, we believe that the TOD‐Bayes algorithm is a powerful novel methodology to accurately identify the tissue origin of synchronous multifocal tumors of unknown primary cancers using RNA‐Seq data and an important step toward more precision‐based medicine in cancer diagnosis and treatment. Abstract : What's new? Multiple tumors that develop from a single primary tumor within a short timeframe are often anatomically and histologically similar. As a result, such tumors frequently obscure the identification of tissue clonal origin, complicating diagnosis. To improve the accuracy of tissue origin identification, the authors of this report developed a Bayes algorithm using gene data sets specific for hepatobiliary and pancreatic tumors. The algorithm correctly classified tissue clonal origin more than 95% of the time for clinical samples of pancreatic cancer, cholangiocarcinoma, and hepatocellular carcinoma. The algorithm was also expanded to the precise diagnosis of cancer of unknown primary. … (more)
- Is Part Of:
- International journal of cancer. Volume 142:Issue 2(2018)
- Journal:
- International journal of cancer
- Issue:
- Volume 142:Issue 2(2018)
- Issue Display:
- Volume 142, Issue 2 (2018)
- Year:
- 2018
- Volume:
- 142
- Issue:
- 2
- Issue Sort Value:
- 2018-0142-0002-0000
- Page Start:
- 357
- Page End:
- 368
- Publication Date:
- 2017-10-16
- Subjects:
- synchronous multifocal tumors -- tissue origin -- RNA‐Seq -- naive Bayes algorithm -- hepatobiliary and pancreatic system
Cancer -- Periodicals
Cancer -- Prevention -- Periodicals
616.994 - Journal URLs:
- http://onlinelibrary.wiley.com/journal/10.1002/(ISSN)1097-0215 ↗
http://onlinelibrary.wiley.com/ ↗ - DOI:
- 10.1002/ijc.31054 ↗
- Languages:
- English
- ISSNs:
- 0020-7136
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 4542.156000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 5477.xml