Feature extraction from resolution perspective for gas chromatography-mass spectrometry datasets. Issue 115 (7th December 2016)
- Record Type:
- Journal Article
- Title:
- Feature extraction from resolution perspective for gas chromatography-mass spectrometry datasets. Issue 115 (7th December 2016)
- Main Title:
- Feature extraction from resolution perspective for gas chromatography-mass spectrometry datasets
- Authors:
- Ma, Pan
Zhang, Zhimin
Zhou, Xinyi
Yun, Yonghuan
Liang, Yizeng
Lu, Hongmei - Abstract:
- Abstract : Automatic feature extraction from large-scale datasets is one of the major challenges when analyzing complex samples with gas chromatography-mass spectrometry (GC-MS). Abstract : Automatic feature extraction from large-scale datasets is one of the major challenges when analyzing complex samples with gas chromatography-mass spectrometry (GC-MS). The classic processing pipeline basically consists of noise filtering, baseline correction, peak detection, alignment, normalization and identification. The long pipeline makes the extracted features inconsistent with different methods and values of parameters. In this study, MS-Assisted Resolution of Signals (MARS) has been proposed to extract features automatically from resolution perspective for large-scale GC-MS datasets. Firstly, it divides complex data into small segments and searches the target zone by moving sub-window factor analysis (MSWFA). Then, improved iterative target transformation factor analysis (ITTFA) has been developed to extract features of the compound from complex datasets. MARS was systematically tested on a simulated dataset (5 samples), peppermint dataset (2 samples), red wine dataset (24 samples) and human plasma dataset (131 samples). The results show that MARS can extract features accurately, automatically, objectively and swiftly from these complex datasets at 2–3 minutes/chromatogram speed. The extracted features of overlapped peaks are comparable to the features resolved by MCR-ALS orAbstract : Automatic feature extraction from large-scale datasets is one of the major challenges when analyzing complex samples with gas chromatography-mass spectrometry (GC-MS). Abstract : Automatic feature extraction from large-scale datasets is one of the major challenges when analyzing complex samples with gas chromatography-mass spectrometry (GC-MS). The classic processing pipeline basically consists of noise filtering, baseline correction, peak detection, alignment, normalization and identification. The long pipeline makes the extracted features inconsistent with different methods and values of parameters. In this study, MS-Assisted Resolution of Signals (MARS) has been proposed to extract features automatically from resolution perspective for large-scale GC-MS datasets. Firstly, it divides complex data into small segments and searches the target zone by moving sub-window factor analysis (MSWFA). Then, improved iterative target transformation factor analysis (ITTFA) has been developed to extract features of the compound from complex datasets. MARS was systematically tested on a simulated dataset (5 samples), peppermint dataset (2 samples), red wine dataset (24 samples) and human plasma dataset (131 samples). The results show that MARS can extract features accurately, automatically, objectively and swiftly from these complex datasets at 2–3 minutes/chromatogram speed. The extracted features of overlapped peaks are comparable to the features resolved by MCR-ALS or PARAFAC2, and significantly better than XCMS. Furthermore, PLS-DA models of the human plasma dataset indicated that features extracted automatically by MARS are comparable or better than features extracted manually by experts with a GC-MS workstation. It has been implemented and open-sourced at Web:https://github.com/zmzhang/MARS . … (more)
- Is Part Of:
- RSC advances. Volume 6:Issue 115(2016)
- Journal:
- RSC advances
- Issue:
- Volume 6:Issue 115(2016)
- Issue Display:
- Volume 6, Issue 115 (2016)
- Year:
- 2016
- Volume:
- 6
- Issue:
- 115
- Issue Sort Value:
- 2016-0006-0115-0000
- Page Start:
- 113997
- Page End:
- 114004
- Publication Date:
- 2016-12-07
- Subjects:
- Chemistry -- Periodicals
540.5 - Journal URLs:
- http://pubs.rsc.org/en/Journals/JournalIssues/RA ↗
http://www.rsc.org/ ↗ - DOI:
- 10.1039/c6ra17864b ↗
- Languages:
- English
- ISSNs:
- 2046-2069
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 8036.750300
British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store - Ingest File:
- 1266.xml